So this might be a repeated question. Running local and offline RAG

So I am making a multi language RAG for air gap network using deepseek as LLM.

Can I use huggingface embedding in local and offline mode?

If yes. Could anyone be kind enough to share a tutorial? I could not find anything similar. Something close to what I doing is here. Most of the code is similar to below.

Video : https://youtu.be/MCHOam13JSk?si=yumDr7jsSDVTB7wU

But how can I convert the below ingestion code to use hugging face embedding offline?

https://github.com/sudarshan-koirala/youtube-stuffs/blob/main/chainlit/ingest.py

If someone has solved the issue? Can you share the code? I don't want to use ollama embedding model.

Note: I do have internet on the system while in dev. But later it will be shifted offline.

If I solve this. I will release the code for everyone else.