Skip to content

Prep

WhompingWillow.prep

prep_data(corpus) -> pd.DataFrame

Process the data :param corpus: PD of the cleaned data. (Needs to have "data" column with the values for LDA.) :return processed_docs: Processed PD

build_BoW(processed_docs) -> dict

Build the Bog of Words and Gensim Dictionary. :param processed_docs: PD of processed information. :return BoW, id2word: Bag of Words and Gensim Dictionary