Category: BAScraper

Clean and analyze social media usage data with Python (Coursera)

maxjo, 2024-02-112024-02-11

BAnalyzer Importing data In [ ]: import pandas from collections import Counter from os import path import json from datetime import datetime cwd = ‘../data’ data_path = path.join(cwd, ’01.json’) student_path = path.join(cwd, ‘students.txt’) exclude_author = [‘AutoModerator’, ‘BlueArchive-ModTeam’] with open(data_path, “r”, encoding=’utf-8′) as outfile: data = json.load(outfile) data_df = pandas.DataFrame.from_dict(data, orient=’index’) # [print(col)…

First release of BAScraper

maxjo, 2024-01-31

So after a bit I released the proper version of BAScraper on PYPI. I can’t guarantee it’ll properly work since I don’t have a testing method yet but I’ll fix stuffs asap since I also use it quite often. The following is the docs for the current version(as of writing)…

So, about the BAscraper

maxjo, 2024-01-092024-01-09

So, after having no luck in finding a good way to scrape reddit for submissions and posts, I luckily managed to find out about PullPush.io. It was a godsend for me since it basically had all usefull functions from pushshift.io which is much appreciated. It uses web api like usual…