Conditional sentence generation by using a force word

This is a repository for a university project (University of Twente) for the course Natural Language Processing.

Short Project Description

The main idea for the research project is to create a sentence generator. This sentence generator shall generate conditional sentences out of only word together with its’ given speech tag. In exceptional cases without any part of speech tag, the most common (with the highest probability) tag shall be used for the particular word. The structure in which the given word shall be implemented in, will be a simple subject-verb-object structure, e.g., .

We think that with these given features, the model should be able to generate a sentence that makes sense.

Research Questions

How to identify the grammatical part in the sentence for the given word?
How to make sure that the generated sentences actually make sense and are based on a continuous bag of words (CBOW)?
How can we force our model to generate the sentence that contains a given word?
Optional: Does the order of adverbs as well as the sentence order influence the performance of the model?

Datasets

https://lindat.mff.cuni.cz/repository/xmlui/handle/11234/1-5150

References

Almutiri, T., & Nadeem, F. (2022). Markov models applications in natural language processing: a survey. Int. J. Inf. Technol. Comput. Sci, 2, 1-16. Cao, S., Kitaev, N., & Klein, D. (2020). Unsupervised parsing via constituency tests. arXiv preprint arXiv:2010.03146.

Setup and usage

After downloading the code and installing the requirements you have to download the corpus from the link provided in the datesets, create a folder called "Data" in the root directory of the project and place there the UD_English-EWT folder.

The program can be executed with the main.py command.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
website		website
.gitignore		.gitignore
ProjectReport.pdf		ProjectReport.pdf
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Conditional sentence generation by using a force word

Short Project Description

Research Questions

Datasets

References

Setup and usage

About

Releases

Packages

Languages

TheBarbaraIsTaken/SentenceGenerator

Folders and files

Latest commit

History

Repository files navigation

Conditional sentence generation by using a force word

Short Project Description

Research Questions

Datasets

References

Setup and usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages