start_date = convert_date_to_timestamp(start_date) end_date = convert_date_to_timestamp(end_date) url = "https://api.pushshift.io/reddit/{}/search?limit=1000&sort=desc&subreddit={}&before={}&after={}" print (f"Saving {object_type}s to {filename}") dfs =[] previous_end_date...
OneNote's collaboration features allow team members to share notebooks, edit notes together in real-time, and leave comments and feedback. Plus, OneNote integrates with other Microsoft tools, including Outlook, Word, and Excel, making it easy to incorporate your notes into your workflow. Looking...
To create the BOW model, we first preprocessed all comments from the r/insomnia subreddit using the gensim simple preprocessor function, which removed symbols and numbers and converted the comments into lists of words. The natural language toolkit “wordNetLemmatizer” was used to convert words ...
(1) extractRawTxt.py - Extracts raw text comments, tokenizes comments at sentence level, and extracts select meta info from reddit text files, and convert to CSV files in rawtxt directory Reads in the name XXX of a directory with .txt files from the Reddit corpus (e.g. "txtFiles1")...