Skip to content

Commit

Permalink
Update Esperanto-Morphological-Tokenization.md
Browse files Browse the repository at this point in the history
  • Loading branch information
generic-account authored Jun 22, 2024
1 parent 79cc100 commit 3a1dc72
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions Esperanto-Morphological-Tokenization.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,8 @@

# Esperanto Morphological Tokenization

A research project by Gordon Lichtstein using Fairseq to investigate the impact of morphological tokenization on the output of English-Esperanto translations

## Introduction
#### Esperanto Background
Esperanto is an agglutinative constructed international auxiliary language, boasting a unique and regular set of grammatical features along with the largest speaker base of any constructed language. I am one of those speakers. I've been studying Esperanto for about three years at this point, and I’ve been programming for about eight. This 2+ month personal project perfectly merges my fascination with linguistics and computer science, also utilizing another one of my passions - math - for statistical analysis of my results.
Expand Down

0 comments on commit 3a1dc72

Please sign in to comment.