-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Epic] OpenM++ and model serving as a platform #2
Comments
More related information that was sent by Steve Gribble on July 22nd 2023
|
Tasks going forward: For service implemented using a StatefulSet workload: For the OpenMPI backend service: |
A few questions / details about the web service that will help to determine the architecture: Assumptions:
If a centralized web service is used for multi-tenancy some questions would be:
I think these questions would determine if we're able to share a web instance between projects or the web instance should be isolated per namespace, or even if multiple instances are required per project based on version / data sensitivity / dev / prod. As far as I can tell the benefits of sharing of a web instance would be:
|
Dropped a mini-poc you can find here to install / launch the Openm++ web interface from the jupyterlab notebook as a shortcut, much better experience than using the remote desktop image and simpler than a centralized interface. |
Let's think about the big picture of how we want to serve Machine learning models and the OpenM++ features. There has also been discussions about converting existing models from ModGen to work with OpenM++. This issue will be the overarching place for information and tasks definition throughout the architecting and scoping of the project.
Note that for the time being it will live in the AAW github project, but assume that it will have its own in the very short term.
It think it makes sense to reiterate some of the stuff that I put down in the performance agreement and separate that into smaller actionable tasks. Here are these components again:
From my reading of the OpenM++ wiki so far, the biggest functionality gap appears to be setting up OpenM++ as a service, and facilitating the uploading and transpiling of Modgen style models into OpenM++ style models. So in my opinion that would a priority in terms of providing a well-rounded service.
Also it might be good for me to engage in some learning tasks about kubernetes and aaw based deployments.
The text was updated successfully, but these errors were encountered: