You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Meaning id (in the li of the first ol within an article)
Locution Name id (locutions appear in a h3)
Locution Meaning id (in li of a locution's ol)
We are currently working with the first, second and fourth types. The third type is not being extracted since most of the redirections to locutions refer to one of their meanings. However, there are ~700 (~0.5% of the total number of meanings) that refer to the locution name. At this point we consider them exceptions. It is not a big deal since locutions are barely asked in the show, but we probably we could handle them by extracting the h3 IDs and creating a dict to store the relations (key: locution name id, value: locution meaning id) to use it in the next stage.
The text was updated successfully, but these errors were encountered:
We have left the abbreviations at the end. This issue was meant for context abbrs, not for word abbrs, but it is not a problem to have those in the meaning.
There could be 4 different IDs in a page:
article
tag)li
of the firstol
within an article)h3
)li
of a locution'sol
)We are currently working with the first, second and fourth types. The third type is not being extracted since most of the redirections to locutions refer to one of their meanings. However, there are ~700 (~0.5% of the total number of meanings) that refer to the locution name. At this point we consider them exceptions. It is not a big deal since locutions are barely asked in the show, but we probably we could handle them by extracting the h3 IDs and creating a dict to store the relations (key: locution name id, value: locution meaning id) to use it in the next stage.
The text was updated successfully, but these errors were encountered: