-
|
I'm planning to fine-tune LLaMA 3.2 11B Instruct on a JSONL dataset of domain-specific question-answer pairs — purely text, no images. The goal is to improve its instruction-following behavior for specialized text tasks, while still retaining its ability to handle multimodal inputs like OCR and image-based queries. in examples we have a sample .yaml file for this based on which I have made a similar .yaml file but when i run I get the error but when i remove the field or even I get the error What happened here? Python Version: 3.12 which was previously used to fine tune Llama3.1 8B using the following config.yaml Thank you. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 5 replies
-
|
Hello, you're almost there. The key is What you should instead do, is take a regular llama3 config (like the nous research one), change model path, and train your data on it. That's all :) You can do a short training on mllama then just verify you can load it with some inference tools if you're worried about vision being lost. |
Beta Was this translation helpful? Give feedback.
Hi @NanoCode012, thank you very much for the clear and helpful guidance! The note about processor_type: AutoProcessor and switching to a regular LLaMA3 config was exactly what I needed. Appreciate your time and insight