vision-ai
Here are 36 public repositories matching this topic...
📺 Instill Console for 🔮 Instill Core: https://github.com/instill-ai/instill-core
-
Updated
Nov 2, 2025 - TypeScript
Snappy: A vision-first document retrieval using ColPali embeddings - Search PDFs with FastAPI, Next.js 16, Qdrant, and React 19.2
-
Updated
Nov 3, 2025 - TypeScript
This repository demonstrates YOLOv8-based license plate recognition with GCP Vision AI integration, enabling versatile real-world applications like vehicle identification, traffic monitoring, and geospatial analysis while capturing vital media metadata for enhanced insights.
-
Updated
Feb 1, 2024 - Jupyter Notebook
[CVPRW'25] Official Code For "SK-RD4AD: Skip-Connected Reverse Distillation for One-Class Anomaly Detection"
-
Updated
Jul 7, 2025 - Python
MCQ_Grading_Bot is an AI-powered tool that grades solved MCQ exam sheets from images using Gemini Vision. It extracts student info, checks answers, calculates score, and displays detailed results—all through a simple Gradio interface in Colab.
-
Updated
Jun 19, 2025 - Jupyter Notebook
Bidirectional Markdown↔PDF converter with AI-powered vision. MD→PDF with beautiful themes, PDF→MD with LLaVA - open source & privacy-first
-
Updated
Oct 30, 2025 - TypeScript
MDDenseResNet : Enhanced Malware Detection Using DNNs
-
Updated
Jul 27, 2025 - Jupyter Notebook
Hybrid AI orchestration stack combining local LLMs (Ollama), vector search (Qdrant), and Azure AI Foundry for scalable RAG, Agentic AI, and Vision. Built with .NET 8 and Python.
-
Updated
Oct 12, 2025 - Python
General vision AI defect detection engine for MLops process/simulations
-
Updated
Mar 5, 2025 - Python
Backend проекта Pinterest команды OND team
-
Updated
Mar 2, 2024 - Go
Eagle-Eye-AI is a project designed for the Kria KR260 board that enables AI-driven camera tracking and face detection.
-
Updated
Sep 7, 2025 - Tcl
-
Updated
Aug 18, 2025 - Jupyter Notebook
People detection and notifications based on the Raspberry Pi + AI Camera
-
Updated
Feb 3, 2025 - Python
MetaSynAI is an AI‑driven accessibility framework that enables seamless interaction through voice commands, hand gestures, and eye‑tracking, offering a modern and inclusive way to control web interfaces.
-
Updated
Jul 27, 2025 - HTML
Detect text in image, using Autogon AI
-
Updated
Jul 9, 2024 - JavaScript
Gemini Vision & Image Generation MCP for Claude Desktop and Claude Code
-
Updated
Sep 3, 2025 - JavaScript
Extract text from images using multiple AI providers - local SmolVLM, Ollama LLaVA, or OpenAI GPT-4o
-
Updated
Oct 16, 2025 - Python
🏡 Instill AI organisation profile and default configuration
-
Updated
Jun 13, 2025
Improve this page
Add a description, image, and links to the vision-ai topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the vision-ai topic, visit your repo's landing page and select "manage topics."