Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PI-2526 Remove repeated punctuation strings #4612

Merged
merged 1 commit into from
Feb 3, 2025

Conversation

marcus-bcl
Copy link
Contributor

to prevent model errors due to "too many tokens":

Input validation error: `inputs` must have less than 512 tokens Given: 566

caused by strings such as:

---------------------------------------------------------
Comment added by name on 01/02/2023 at 12:34
Report Edited: 01/02/2023 at 12:34
---------------------------------------------------------
Comment added by name on 01/02/2023 at 12:34
Report Edited: 01/02/2023 at 12:34
...

to prevent model errors due to "too many tokens":
```
Input validation error: `inputs` must have less than 512 tokens Given: 566
```

caused by strings such as:
```
---------------------------------------------------------
Comment added by name on 01/02/2023 at 12:34
Report Edited: 01/02/2023 at 12:34
---------------------------------------------------------
Comment added by name on 01/02/2023 at 12:34
Report Edited: 01/02/2023 at 12:34
...
```
@marcus-bcl marcus-bcl requested a review from a team as a code owner February 3, 2025 13:06
@marcus-bcl marcus-bcl enabled auto-merge (squash) February 3, 2025 13:06
@marcus-bcl marcus-bcl merged commit b7c7a66 into main Feb 3, 2025
59 checks passed
@marcus-bcl marcus-bcl deleted the PI-2526-remove-punctuation branch February 3, 2025 13:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants