Objectives
- Allow users to insert images and generate responses using the LLaVa model.
- Enhance user experience by providing a seamless interface for image insertion and prompt creation.
Leads
User Stories
In Scope
-
As a user, I want to easily insert images into Jan's interface, either by dropping files or using buttons:
- Scenario: I aim to effortlessly insert images.
- Acceptance Criteria: The interface should support drag-and-drop functionality or provide clear buttons for uploading images.
-
As a user, I want a user-friendly interface to create text prompts for the LLaVa model linked to the inserted images:
- Scenario: I seek a seamless way to input prompts for the LLaVa model connected to the inserted images.
- Acceptance Criteria: The interface should offer a clear space to input text prompts directly related to the inserted images.
-
As a user, I want to view and interact with LLaVa model responses based on the inserted images and associated prompts:
- Scenario: After inserting images and prompts, I expect to access and interact with the LLaVa model's generated responses.
- Acceptance Criteria: The generated responses should be visible and easily accessible within the interface, showcasing accurate output based on the inserted image and associated prompts.
Out-of-Scope
- As a user, I want to see prompts suggestions based on the capabilities of the LLaVa model.
- As a user, I want to attach many images at the same time.
Design Wireframes
Key Considerations
Figma link

Engineering & Architecture
In Scope
- For @vuonghoainam to input.
- Nitro supports image capability @tikikun
Out-of-Scope
Tasklist
Resources
https://twitter.com/nathanlands/status/1709539003312259172?s=46&t=osxIAvq8ztXuDbNAm11thA
https://twitter.com/LMStudioAI/status/1734640355318944190
Out of scope
Nitro supports speech/hear capability
Objectives
Leads
User Stories
In Scope
As a user, I want to easily insert images into Jan's interface, either by dropping files or using buttons:
As a user, I want a user-friendly interface to create text prompts for the LLaVa model linked to the inserted images:
As a user, I want to view and interact with LLaVa model responses based on the inserted images and associated prompts:
Out-of-Scope
Design Wireframes
Key Considerations
Figma link
Engineering & Architecture
In Scope
Out-of-Scope
Tasklist
model.jsonfor multiple model artifacts #1049model.jsonformat for Nitro inference engine #1088Resources
https://twitter.com/nathanlands/status/1709539003312259172?s=46&t=osxIAvq8ztXuDbNAm11thA
https://twitter.com/LMStudioAI/status/1734640355318944190
Out of scope
Nitro supports speech/hear capability