GPT3 Sentence Transformer Pipeline

A Python pipeline to generate responses for a dataset of 250 datapoints of type gender, age, ethnicity using GPT3 (text-davinci-003), map them to a 768 dimensional dense vector space using the T5 XXL sentence transformer, use PCA and UMAP dimensionality-reduction methods to reduce the dimensionality of the data set, and then provide visualizations using Plotly and sentiment analysis using TextBlob.

Instructions

Setup

Add OpenAI key to keys.py
Run pip install matplotlib seaborn umap-learn sentence_transformers openai plotly textblob
Run python3 gen.py, changing the PROMPT global variable if you want to change the dataset in fake_data/fake_people.csv
Change the STORY_START and STORY_END global variables in stories.py to account for what answers you want GPT3 to generate

Pipeline

Run python3 stories.py
Run python3 strans.py (CAUTION: This will likely use significant GPU resources while the sentence transformer is running)
Run python3 vis.py to generate Plotly and sentence sentiment analysis graphs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GPT3 Sentence Transformer Pipeline

Instructions

Setup

Pipeline

About

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
fake_data		fake_data
README.md		README.md
foi_simple.py		foi_simple.py
gen.py		gen.py
keys.py		keys.py
stories.py		stories.py
strans.py		strans.py
vis.py		vis.py

adamrounsville/gpt3-sentence-transformer-pipeline

Folders and files

Latest commit

History

Repository files navigation

GPT3 Sentence Transformer Pipeline

Instructions

Setup

Pipeline

About

Topics

Resources

Stars

Watchers

Forks

Languages