The xInfer Model Zoo API
Welcome to the xInfer::zoo. This is the high-level, "batteries-included" API for xInfer.
The zoo is a collection of pre-packaged, hyper-optimized, and incredibly easy-to-use solutions for the world's most common AI tasks.
The Philosophy: Solutions, Not Just Tools
While the Core Toolkit provides the powerful, low-level "engine parts" for building custom pipelines, the zoo provides the finished "F1 car."
Each class in the zoo is a complete, end-to-end pipeline that abstracts away all the complexity of pre-processing, inference, and post-processing. The goal is to let you solve a complex problem like real-time object detection or image generation with just two lines of C++ code:
- One line to initialize the pipeline from a pre-built engine.
- One line to predict.
This is the power of the zoo. It gives you the full, state-of-the-art performance of a custom C++/CUDA/TensorRT application with the simplicity of a high-level library.
Key Features of All zoo Classes
- Performance by Default: Every
zooclass is built on top of the hyper-performantxInfer::core::InferenceEngineand uses fused CUDA kernels from thepreprocandpostprocmodules wherever possible. - Simple, Task-Oriented API: You don't interact with raw tensors. You provide a
cv::Matimage and get back astd::vector<BoundingBox>or astd::string. The API is designed around the final answer, not the intermediate steps. - Seamless Hub Integration: Most
zooclasses have special constructors that can download a pre-built, perfectly optimized engine for your hardware directly from the Ignition Hub. This provides a "zero-setup" user experience. - Robust and Production-Ready: These classes are designed to be used directly in your final application. They are efficient, safe, and easy to integrate.
The zoo Catalog of Solutions
The zoo is organized by domain. Explore the available pipelines below to find the solution you need.
🖼️ Computer Vision
Tools for understanding and analyzing visual information from images and video. This is the most mature and comprehensive part of the zoo.
- Tasks: Image Classification, Object Detection, Semantic & Instance Segmentation, Pose Estimation, Face Recognition, OCR, and many more.
✨ Generative AI
Powerful pipelines for creating novel content, from images and audio to 3D models.
- Tasks: Text-to-Image (Diffusion), Image Generation (GANs), Super-Resolution, Style Transfer, Text-to-Speech.
📝 Natural Language Processing (NLP)
High-throughput, low-latency solutions for understanding and processing human language.
- Tasks: Text Classification, Named Entity Recognition (NER), Sentence Embeddings for RAG, Summarization, and Translation.
🎧 Audio & Signal Processing
Real-time pipelines for analyzing audio signals, from speech to environmental sounds.
- Tasks: Speech Recognition, Audio Classification, Speaker Identification, Music Source Separation.
➡️ Explore the Audio & DSP API
📈 Time Series
Specialized solutions for forecasting and analyzing sequential data.
- Tasks: Forecasting, Anomaly Detection, and Classification.
➡️ Explore the Time Series API
🧊 3D & Spatial Computing
Cutting-edge pipelines for processing 3D data from sensors like LIDAR.
- Tasks: 3D Reconstruction (Gaussian Splatting), Point Cloud Detection, Point Cloud Segmentation.
🌍 Geospatial
Specialized tools for analyzing satellite and aerial imagery.
- Tasks: Building & Road Segmentation, Change Detection, Maritime Object Detection.
⚕️ Medical Imaging
High-performance pipelines for medical image analysis.
- Tasks: Tumor Detection, Cell Segmentation, Retinal Abnormality Scanning, Artery Analysis.
📄 Document AI
Pipelines for understanding the structure and content of documents.
- Tasks: Table Extraction, Signature Detection, Handwriting Recognition.
➡️ Explore the Document AI API
🚀 Specialized & RL
Hyper-specialized, high-value solutions for specific industries and advanced applications.
- Tasks: Reinforcement Learning Policies, Financial (HFT) Models, Physics Simulation, Genomics.
