Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing documentation on usgae #7

Open
trackme518 opened this issue Feb 2, 2024 · 4 comments
Open

Missing documentation on usgae #7

trackme518 opened this issue Feb 2, 2024 · 4 comments

Comments

@trackme518
Copy link

Hi,
I installed the extension in super-collider and it works. But it is unclear how your code works. Please add extensive description to your example. It is loading the rave model. Than I need to export my own msprior.ts based on the rave model and its training data I am using? Or should that be downloaded from somewhere? I would appreciate already exported build or links to working set of models to at least test it.

How does it function? It feeds noise into rave model which will do what? Generate audio? Why there is msprior.ts? Please explain. Thank you.

@elgiano
Copy link
Owner

elgiano commented Feb 7, 2024

Hi @trackme518
Thanks for reaching out, from your comment I understand that docs can look too cryptic, as they assume a certain workflow without explaining it in detail. I'm looking forward to add more descriptive text in the next few months. Could you help me understand what exactly were your difficulties?

Also, if anyone else is reading this issues and have similar concerns, they are welcome to add them to the discussion here.

I think, and other people using this extension confirmed it, that the usage is quite clear for people who have used RAVE and nn_tilde before. But your comment is an occasion to make it clearer for people who perhaps didn't. What is your previous experience with RAVE? Or is your first experience with it happening through nn.ar?
And would the following kind of information help you?

  1. nn.ar is an interface to load trained RAVE models, and to use in real-time their processing methods. For more informations about RAVE, please refer to its own documentation at https://github.com/acids-ircam/RAVE
  2. nn.ar doesn't include any pre-trained model, but some are shared by the RAVE project itself at https://github.com/acids-ircam/RAVE?tab=readme-ov-file#pretrained-models
  3. RAVE models typically offer three processing methods: encoding (i.e. generating a latent representation of the audio you input), decoding (i.e. the inverse, generating audio from a latent representation you input), and forwarding (generating audio from audio, without providing access to its latent representation).
    [here I could put examples for each method, a bit like RAVE does, so one example for reconstruction, one example for latent manipulation, one example for playing a RAVE decoder like a synth with obscure parameters]
  4. Some rave models include a "prior" method, which achieves unconditional generation. It takes only one input, the "temperature", and generates a stream of sound. [here I could put a rave v1 prior example]
  5. Other RAVE models don't include a prior, and nn.ar also works with generative models produced by the msprior library. msprior is still experimental, and msprior models can have different generative methods, for which refer to its own documentation. [msprior usage example]

@trackme518
Copy link
Author

Hi, thanks for reaching out - yes I did not use RAVE before. I am interested in direct latent space traversing. My idea was to generate N-dimensional noise seeds that I would feed to the network. My confusion stems from the example that works with 2 models - msprior.ts and ~/rave/ravemodel.ts, why? Do I need 2 models? Or is the example actually multiple examples merged into one block of code? Sorry I don't get it.

@elgiano
Copy link
Owner

elgiano commented Feb 9, 2024 via email

@elgiano
Copy link
Owner

elgiano commented May 10, 2024

@trackme518
I'm sorry it took so long, but I've updated documentation on v0.0.4-alpha. If you're still playing with this, I would love to hear if docs are more helpful now.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants