Natural Language Processing

The project combines some approaches for natural language processing. Word embedding approaches such as w2v were used. Training from scratch and pre-training the model was also compared. Neural networks including convolutional and lstm were used for classification.

w2v notebook considers the feasibility of training a custom model of word embedding, as well as pre-training the resulting model on new data.

pretrained_lstm_sentiment_analysis notebook focuses on working with a Global Vectors for Word Representation — pre-trained word embedding model, which is used to compare the performance of three neural networks: simple(classical), convolutional and LSTMs.

vader_roberta notebook focuses on the Robustly Optimized Bidirectional Encoder Representations from Transformers and Valence Aware Dictionary and sEntiment Reasoner models, by which for every review in the dataset, its overall emotional background is determined.

The main goal was to learn the NLP pipelines, develop an understanding of the topic and build core skills.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitattributes		.gitattributes
README.md		README.md
Womens Clothing E-Commerce Reviews.csv		Womens Clothing E-Commerce Reviews.csv
alice_in_wonderland.txt		alice_in_wonderland.txt
clothes_reviews.model		clothes_reviews.model
lstm_model_acc_0.631.h5		lstm_model_acc_0.631.h5
pretrained_lstm_sentiment_analysis.ipynb		pretrained_lstm_sentiment_analysis.ipynb
vader_roberta.ipynb		vader_roberta.ipynb
w2v.ipynb		w2v.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Natural Language Processing

About

Releases

Packages

Languages

TheGreatRico/nlp

Folders and files

Latest commit

History

Repository files navigation

Natural Language Processing

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages