Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

torchortho #24

Open
K-H-Ismail opened this issue Feb 4, 2025 · 1 comment
Open

torchortho #24

K-H-Ismail opened this issue Feb 4, 2025 · 1 comment

Comments

@K-H-Ismail
Copy link

KAT used the variance preserving initialization as formulated in the Kaimining initialization for learnable rational activations. This implies calculating the 2nd order moment of a rational function, which has a complicated closed form. We show that this 2nd order moment can be easily computed by considering orthogonal functions. As an example, we used orthogonal polynomials (Hermite) and trigonometric functions (Fourier) and showed that they can be used to achieve better results in image classification on ImageNet using ConvNeXt and next token prediction on OpenWebText using GPT-2.

📄 Paper: Learnable Polynomial, Trigonometric, and Tropical Activations
💻 Code: torchortho on GitHub

@Adamdad
Copy link
Owner

Adamdad commented Feb 4, 2025

Great work! @K-H-Ismail
I will check the paper to see if there is anything I can incoperate into my work!
Thank you so much!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants