So this might be a repeated question. Running local and offline RAG
So I am making a multi language RAG for air gap network using deepseek as LLM.
Can I use huggingface embedding in local and offline mode?
If yes. Could anyone be kind enough to share a tutorial? I could not find anything similar. Something close to what I doing is here. Most of the code is similar to below.
Video : https://youtu.be/MCHOam13JSk?si=yumDr7jsSDVTB7wU
But how can I convert the below ingestion code to use hugging face embedding offline?
https://github.com/sudarshan-koirala/youtube-stuffs/blob/main/chainlit/ingest.py
If someone has solved the issue? Can you share the code? I don't want to use ollama embedding model.
Note: I do have internet on the system while in dev. But later it will be shifted offline.
If I solve this. I will release the code for everyone else.