BAnalyzer Importing data In [ ]: import pandas from collections import Counter from os import path import json from datetime import datetime cwd = ‘../data’ data_path = path.join(cwd, ’01.json’) student_path = path.join(cwd, ‘students.txt’) exclude_author = [‘AutoModerator’, ‘BlueArchive-ModTeam’] with open(data_path, “r”, encoding=’utf-8′) as outfile: data = json.load(outfile) data_df = pandas.DataFrame.from_dict(data, orient=’index’) # [print(col)…
Category: BAScraper
First release of BAScraper
So after a bit I released the proper version of BAScraper on PYPI. I can’t guarantee it’ll properly work since I don’t have a testing method yet but I’ll fix stuffs asap since I also use it quite often. The following is the docs for the current version(as of writing)…
So, about the BAscraper
So, after having no luck in finding a good way to scrape reddit for submissions and posts, I luckily managed to find out about PullPush.io. It was a godsend for me since it basically had all usefull functions from pushshift.io which is much appreciated. It uses web api like usual…