Ollama and Phi3 Based Microservices Chatbot Application

This repository contains a Flask-based application using the Ollama and Phi3 models to create an interactive chatbot. The application is designed to provide fast and accurate responses to user queries through a microservices architecture, where the front-end and back-end are isolated. Unlike other setups where Streamlit and Ollama might be combined, our system uses REST APIs for communication between the front-end and back-end, allowing for greater flexibility and modularity.

Features

Interactive chatbot interface
Model optimization for faster inference
Efficient resource management with Docker
Microservices architecture with isolated front-end and back-end components

Prerequisites

Docker
Docker Compose
Flask (for back-end)
Front-end application framework (e.g., React)

Installation

Clone the repository:

git clone https://github.com/your-repo/ollama_phi3_chatbot.git
cd ollama_phi3_chatbot

Build and run the Docker container:

sudo docker-compose up --build

or

docker-compose --env-file .env.dev up --build --remove-orphans

Pull the Phi3 or other model with the Ollama container: Get more details at Ollama GitHub and Ollama Docker Hub
```
docker exec -it ollama ollama run phi3
```
Open the application:
- The Flask back-end will be running on port 8000.
- Your front-end application should make REST API calls to this back-end service.

Usage

Front-End Interaction:
- The front-end application communicates with the back-end using REST APIs.
- It sends user queries to the back-end and displays responses.
Session Management:
- Each session is managed independently, allowing users to interact with different sessions concurrently.

Optimization Strategies

To ensure the chatbot runs efficiently, the following strategies have been employed:

Model Optimization: Quantization and pruning to reduce model size and improve speed.
Efficient Loading: Lazy loading and caching of models and predictions.
Hardware Acceleration: Utilization of GPU/TPU and multi-threading for faster computation.
Asynchronous Processing: Handling multiple requests concurrently using asynchronous processing.

Application Screenshot

Change branch to code for code and backend application from front end code

Contributing

If you wish to contribute to this project, please fork the repository and create a pull request with your changes.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Acknowledgements

Thanks to the creators of the Ollama and Phi3 models.
Special thanks to the Flask and Docker communities for providing excellent tools for building and managing services.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
chat_pdf		chat_pdf
img		img
streamlit_app		streamlit_app
user_chat		user_chat
.gitignore		.gitignore
Dockerfile.frontend		Dockerfile.frontend
Dockerfile.middle		Dockerfile.middle
Dockerfile.user_chat		Dockerfile.user_chat
LICENSE		LICENSE
Modelfile		Modelfile
README.md		README.md
__init__.py		__init__.py
app.py		app.py
chat_pdf.py		chat_pdf.py
config.py		config.py
config.toml		config.toml
docker-compose.yml		docker-compose.yml
embeddings.db		embeddings.db
entrypoint.sh		entrypoint.sh
ollama.crt.com		ollama.crt.com
requirements.middle.stable.txt		requirements.middle.stable.txt
requirements.middle.txt		requirements.middle.txt
requirements.txt		requirements.txt
user_chat.py		user_chat.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Ollama and Phi3 Based Microservices Chatbot Application

Features

Prerequisites

Installation

Usage

Optimization Strategies

Application Screenshot

Change branch to code for code and backend application from front end code

Contributing

License

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

dhirajpatra/ollama-langchain-streamlit

Folders and files

Latest commit

History

Repository files navigation

Ollama and Phi3 Based Microservices Chatbot Application

Features

Prerequisites

Installation

Usage

Optimization Strategies

Application Screenshot

Change branch to code for code and backend application from front end code

Contributing

License

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages