Compare the Top Nonprofit Computer Vision Software as of November 2025 - Page 2

  • 1
    FieldDay

    FieldDay

    FieldDay

    Unlock the world of AI and Machine Learning right on your phone with FieldDay. We’ve taken the complexity out of creating machine learning models and turned it into an engaging, hands-on experience that’s as simple as using your camera. FieldDay allows you to create custom AI apps and embed them in your favourite tools, using just your phone. Feed FieldDay examples to learn from, and generate a custom model ready to be embedded in your app/project. A range of projects and apps driven by custom FieldDay machine learning models. Our range of integrations and export options simplifies the process of embedding a machine-learning model into the platform you prefer. With FieldDay, you can collect data directly from your phone’s camera. Our bespoke interface is designed for easy and intuitive annotation during collection, so you can build a custom dataset in no time. FieldDay lets you preview and correct your models in real-time.
    Starting Price: $19.99 per month
  • 2
    Voxel51

    Voxel51

    Voxel51

    FiftyOne by Voxel51 - the most powerful visual AI and computer vision data platform. Without the right data, even the smartest AI models fail. FiftyOne gives machine learning engineers the power to deeply understand and evaluate their visual datasets—across images, videos, 3D point clouds, geospatial, and medical data. With over 2.8 million open source installs and customers like Walmart, GM, Bosch, Medtronic, and the University of Michigan Health, FiftyOne is an indispensable tool for building computer vision systems that work in the real world, not just in the lab. FiftyOne streamlines visual data curation and model analysis with workflows to simplify the labor-intensive processes of visualizing and analyzing insights during data curation and model refinement—addressing a major challenge in large-scale data pipelines with billions of samples. Proven impact with FiftyOne: ⬆️30% increase in model accuracy ⏱️5+ months of development time saved 📈30% boost in productivity
    Starting Price: $0
  • 3
    Azure AI Custom Vision
    Create a custom computer vision model in minutes. Customize and embed state-of-the-art computer vision image analysis for specific domains with AI Custom Vision, part of Azure AI Services. Build frictionless customer experiences, optimize manufacturing processes, accelerate digital marketing campaigns, and more. No machine learning expertise is required. Set your model to perceive a particular object for your use case. Easily build your image identifier model using the simple interface. Start training your computer vision model by simply uploading and labeling a few images. The model tests itself on these and continually improves precision through a feedback loop as you add images. To speed development, use customizable, built-in models for retail, manufacturing, and food. See how Minsur, one of the world's largest tin mines, uses AI Custom Vision for sustainable mining. Rely on enterprise-grade security and privacy for your data and any trained models.
    Starting Price: $2 per 1,000 transactions
  • 4
    Qwen2-VL

    Qwen2-VL

    Alibaba

    Qwen2-VL is the latest version of the vision language models based on Qwen2 in the Qwen model familities. Compared with Qwen-VL, Qwen2-VL has the capabilities of: SoTA understanding of images of various resolution & ratio: Qwen2-VL achieves state-of-the-art performance on visual understanding benchmarks, including MathVista, DocVQA, RealWorldQA, MTVQA, etc. Understanding videos of 20 min+: Qwen2-VL can understand videos over 20 minutes for high-quality video-based question answering, dialog, content creation, etc. Agent that can operate your mobiles, robots, etc.: with the abilities of complex reasoning and decision making, Qwen2-VL can be integrated with devices like mobile phones, robots, etc., for automatic operation based on visual environment and text instructions. Multilingual Support: to serve global users, besides English and Chinese, Qwen2-VL now supports the understanding of texts in different languages inside images
    Starting Price: Free
  • 5
    Prophesee Metavision
    Metavision is an advanced event-based vision software toolkit developed by Prophesee, designed to facilitate the evaluation, design, and commercialization of event-based vision products. The SDK offers a comprehensive suite of tools, including 64 algorithms, 105 code samples, and 17 tutorials, enabling developers to efficiently build and deploy event-based applications. The open source architecture of Metavision SDK ensures full interoperability between software and hardware devices, fostering a rapidly growing event-based vision community. The platform covers a wide range of computer vision fields, such as machine learning, computer vision, camera calibration, and high-performance applications. Developers have access to extensive documentation, including over 300 pages of content, programming guides, and reference data, providing a solid foundation for product development. Metavision SDK5 PRO includes advanced add-ons like high-speed counting, spatter monitoring, and more.
    Starting Price: Free
  • 6
    Qwen2.5-VL

    Qwen2.5-VL

    Alibaba

    Qwen2.5-VL is the latest vision-language model from the Qwen series, representing a significant advancement over its predecessor, Qwen2-VL. This model excels in visual understanding, capable of recognizing a wide array of objects, including text, charts, icons, graphics, and layouts within images. It functions as a visual agent, capable of reasoning and dynamically directing tools, enabling applications such as computer and phone usage. Qwen2.5-VL can comprehend videos exceeding one hour in length and can pinpoint relevant segments within them. Additionally, it accurately localizes objects in images by generating bounding boxes or points and provides stable JSON outputs for coordinates and attributes. The model also supports structured outputs for data like scanned invoices, forms, and tables, benefiting sectors such as finance and commerce. Available in base and instruct versions across 3B, 7B, and 72B sizes, Qwen2.5-VL is accessible through platforms like Hugging Face and ModelScope.
    Starting Price: Free
  • 7
    Rapid Monitor

    Rapid Monitor

    Rapid Global

    Rapid Global’s AI Safety Software is a computer vision platform designed to enhance workplace safety by detecting unsafe acts and hazardous conditions in real time. Compatible with most IP cameras, it seamlessly integrates with existing surveillance systems, ensuring easy deployment and secure, on-site data processing. Users can customize monitoring parameters by selecting specific objects, areas, and timeframes, and set tailored alarm notifications to identify unsafe behaviors as they occur. It detects missing personal protective equipment, tracks forklift-pedestrian near misses, and monitors unauthorized activity within designated zones, such as individuals standing on conveyor belts or moving outside assigned walkways. These capabilities enable organizations to proactively prevent incidents and improve safety outcomes.
    Starting Price: Free
  • 8
    EarthCam

    EarthCam

    EarthCam

    EarthCam offers a comprehensive suite of construction camera solutions designed to monitor, document, and promote projects through high-quality visual content. It provides advanced AI video analytics, enabling real-time insights into jobsite readiness, activity, and stress metrics, akin to a smartwatch biometrics report for your project. EarthCam's innovative webcams facilitate live streaming, 4K time-lapse, and 360° VR tours, enhancing visual collaboration and security with 24/7 recordings. EarthCam identifies over 30 job site materials, integrating seamlessly with Procore for schedule overlays and safety advisories. EarthCam's time-lapse services include image stabilization, enhancement, and customized music, delivering polished videos in multiple formats for marketing and archival purposes.
    Starting Price: Free
  • 9
    RoboRealm

    RoboRealm

    RoboRealm

    RoboRealm is a Windows-based machine vision software designed to simplify vision programming and enable rapid prototyping with advanced modules. It features an intuitive GUI requiring no or low code, making it accessible for both casual users and serious robotic scientists. It supports hundreds of image processing modules and is camera agnostic, allowing for flexibility in hardware choices. Users can experience real-time parameter changes, and the software includes a fully supported server API for integration with other systems. RoboRealm accommodates multiple image sources and offers various output interfaces, including file, web, FTP, and email. Its plugin framework allows for the development of custom modules, and an active online community provides expert assistance. It enables the combination of modules through an easy-to-use pipeline to create tailored solutions for tasks such as surface defect detection, measurement, counting, detection, etc.
    Starting Price: $25 per month
  • 10
    Rosepetal AI

    Rosepetal AI

    Rosepetal AI

    Rosepetal AI is an innovative technology company specializing in advanced artificial vision and deep-learning solutions designed specifically for industrial quality control. Our platform integrates dataset handling, automated labelling and training of adaptive neural networks, enabling real-time defect detection without requiring advanced technical expertise. This intuitive, no-code SaaS solution democratizes access to sophisticated AI, significantly enhancing efficiency, reducing waste, and driving operational excellence across multiple industries such as automotive, food processing, pharmaceuticals, plastics, and electronics. The unique strength of Rosepetal AI lies in its dynamic adaptability and scalability. Our system allows industrial companies to quickly deploy robust AI models directly onto their production lines, continuously adjusting to new product variations and emerging defects. This capability ensures consistent quality, minimizes downtime.
    Starting Price: $195
  • 11
    Scandit

    Scandit

    Scandit

    Scandit is the leader in smart data capture giving superpowers to workers, customers and businesses by providing actionable insights and automating end-to-end processes. Our Smart Data Capture platform enables smart devices, such as smartphones, drones, digital eyewear and robots to interact with physical items by capturing data from barcodes, text, IDs and objects with unmatched speed, accuracy and intelligence. Scandit accurately scans up to 3x faster than dedicated scanners in challenging light or at angles, on damaged labels, across multiple codes on any smart device. We enable innovation that delivers significant cost savings, increases employee retention and customer loyalty. Scandit partners with customers at every step with trials, solution design, integration and customer success support included. Visit scandit.com to learn why many market leaders trust us.
  • 12
    Interplay

    Interplay

    Iterate.ai

    Interplay Platform is a patented low-code platform with 475 pre-built connectors (enterprise, AI, IoT, Startup Technologies). It's used as middleware and as a rapid app building platform by big companies like Circle K, Ulta Beauty, and many others. As middleware, it operates Pay-by-Plate (frictionless payments at the gas pump) in Europe, Weapons Detection (to predict robberies), AI-based Chat, online personalization tools, low price guarantee tools, computer vision applications such as damage estimation, and much more. It also helps companies to go to market with their digital solutions 10X to 17X faster than in old ways.
  • 13
    Amazon Rekognition
    Amazon Rekognition makes it easy to add image and video analysis to your applications using proven, highly scalable, deep learning technology that requires no machine learning expertise to use. With Amazon Rekognition, you can identify objects, people, text, scenes, and activities in images and videos, as well as detect any inappropriate content. Amazon Rekognition also provides highly accurate facial analysis and facial search capabilities that you can use to detect, analyze, and compare faces for a wide variety of user verification, people counting, and public safety use cases. With Amazon Rekognition Custom Labels, you can identify the objects and scenes in images that are specific to your business needs. For example, you can build a model to classify specific machine parts on your assembly line or to detect unhealthy plants. Amazon Rekognition Custom Labels takes care of the heavy lifting of model development for you, so no machine learning experience is required.
  • 14
    Supervisely

    Supervisely

    Supervisely

    The leading platform for entire computer vision lifecycle. Iterate from image annotation to accurate neural networks 10x faster. With our best-in-class data labeling tools transform your images / videos / 3d point cloud into high-quality training data. Train your models, track experiments, visualize and continuously improve model predictions, build custom solution within the single environment. Our self-hosted solution guaranties data privacy, powerful customization capabilities, and easy integration into your technology stack. A turnkey solution for Computer Vision: multi-format data annotation & management, quality control at scale and neural networks training in end-to-end platform. Inspired by professional video editing software, created by data scientists for data scientists — the most powerful video labeling tool for machine learning and more.
  • 15
    Hive Data
    Create training datasets for computer vision models with our fully managed solution. We believe that data labeling is the most important factor in building effective deep learning models. We are committed to being the field's leading data labeling platform and helping companies take full advantage of AI's capabilities. Organize your media with discrete categories. Identify items of interest with one or many bounding boxes. Like bounding boxes, but with additional precision. Annotate objects with accurate width, depth, and height. Classify each pixel of an image. Mark individual points in an image. Annotate straight lines in an image. Measure, yaw, pitch, and roll of an item of interest. Annotate timestamps in video and audio content. Annotate freeform lines in an image.
    Starting Price: $25 per 1,000 annotations
  • 16
    Mobius Labs

    Mobius Labs

    Mobius Labs

    We make it easy to add superhuman computer vision to your applications, devices and processes to give you unassailable competitive advantage. No code, customizable & on-premise AI solutions.
  • 17
    FindFace

    FindFace

    NtechLab

    NtechLab platform processes video and recognizes human faces, bodies and actions, as well as cars and plate numbers. AI-powered technology enables record breaking accuracy and high speed of recognition. The multi-object and analytical capabilities of FindFace Multi unlock new scenarios for responding challenges of public sector and business. FindFace Multi quickly and accurately recognizes faces, human bodies, cars, and license plate numbers in a live video stream or in a video archive. Searching for faces, bodies, and vehicles in a database or in an archive is available both by a photo sample and by specific features, for example, by age, clothes color, or vehicle model. NtechLab developers are constantly improving recognition algorithms, increasing their performance and accuracy. With FindFace Multi it takes less than a second to detect a face in a video stream, recognize it, and search for a match in a database with billions of images.
  • 18
    Unleash live
    Unleash live is an A.I. video analytics enterprise solution provider. We take a vision from any camera and combine it with computer vision to deliver actionable data in real-time so that your organization has immediate insights to drive down costs, improve productivity, increase accuracy, and improve safety. Support for a wide range of cameras. Connect any combination of IP/CCTV, drone, body cam, mobile or robotic cameras. Live stream in the field and share it with your team while operations are in progress, or upload footage into your account. Apply A. I Apps from our app store to detect, inspect and monitor objects and items of interest or create 2D orthomaps and 3D models. Integrate results into your operational workflow, from live dashboards, to notifications and API integrations. Take the complexity and time out of collaboration. Instantly connect any mix of cameras to share over a live stream with stakeholders and 3rd parties. No plugs-in, no downloads, all in the browser.
    Starting Price: $99 per month
  • 19
    SiaSearch

    SiaSearch

    SiaSearch

    We want ML engineers to worry less about data engineering and focus on what they love, building better models in less time. Our product is a powerful framework that makes it 10x easier and faster for developers to explore, understand and share visual data at scale. Automatically create custom interval attributes using pre-trained extractors or any other model. Visualize data and analyze model performance using custom attributes combined with all common KPIs. Use custom attributes to query, find rare edge cases and curate new training data across your whole data lake. Easily save, edit, version, comment and share frames, sequences or objects with colleagues or 3rd parties. SiaSearch, a data management platform that automatically extracts frame-level, contextual metadata and utilizes it for fast data exploration, selection and evaluation. Automating these tasks with metadata can more than double engineering productivity and remove the bottleneck to building industrial AI.
  • 20
    VisionSense
    Real-time computer vision and advanced image processing solution that leverages advanced models of convolutional neural networks. The top application of the product has been in building management, identity verification and fraud detection, manufacturing and quality control. Winjit is one of India’s leading technology providers with over a decade of experience in innovating engineering solutions across industries.
  • 21
    Vyntelligence

    Vyntelligence

    Vyntelligence

    Boost operational efficiency and reduce risk and costs with the power of Vyn SmartVideoNotes. Video-enabled structured data capture into enterprise systems, to enhance and replace manual/text form fields in just 60 seconds. Timely, auto-labeled and rich data to drive higher compliance and productivity to save on costs as leaders gain better insight to act faster. Enterprise-grade security, open API SaaS platform designed for any workflow integration e.g. CRM (Salesforce), FSM and people systems. AI-powered Computer Vision & Natural language processing, video search and analyses deliver quantitative trends from qualitative data for richer, smarter business decisions. Bring your processes to life in a whole new way by quickly building intelligence from your field teams with vyn, so you see what’s happening and why. vyn captures SmartVideoNotes, on the go, by asking the right people the right questions at the right time - all in a minute or less.
  • 22
    Black.ai

    Black.ai

    Black.ai

    Respond to events and make better decisions with the help of AI and your existing IP camera infrastructure. Cameras are almost exclusively used for security and surveillance purposes. We add cutting-edge Machine Vision models to unlock a high-impact resource available to your team daily. We help you to improve operations for your staff and customers without compromising privacy. No facial recognition, or long-term tracking, no exceptions. Fewer people in the loop. A reliance on staff compiling and watching footage is invasive and unscalable. We help you to review only the things that matter and only at the right time. Black.ai creates a privacy layer that sits between security cameras and operations teams, so you can build a better experience for people without breaching their trust. Black.ai interfaces with your existing cameras using parallel streaming protocols. Our system is installed without additional infrastructure cost or any risk of obstructing operations.
  • 23
    Plainsight

    Plainsight

    Plainsight

    Remove the complexity from your machine learning projects with our vision AI platform built from the ground up for fast, effective video analytics application development. With easy, no-code point-and-click features all in one platform, Plainsight slashes your time-to-production and accelerates the success of vision AI-powered solutions across industries. Connect, administer, & control cameras, sensors & edge devices in one interface. Collect accurate training datasets to provide a high-quality training foundation for models. Accelerate labeling with smart polygon selection, predictive labeling, & automated object recognition. Easily train models with a breakthrough process designed to reduce time to vision AI solutions. Quickly deploy & scale applications at the edge, in the cloud, or on-premises to meet business needs.
  • 24
    TuMeke

    TuMeke

    TuMeke Ergonomics

    No need for wearables, goniometers, or other equipment. Measure and automatically track the safety of employees without stopping production. Stop filling out long assessment worksheets so you can focus on giving great recommendations. Manage videos and assessment results across teams and devices. Enterprise features to make the most of your resources. Our platform includes a phone and web app that work together to allow teams to collaborate across locations, get automatic recommendations on postures to investigate and a dashboard to track performance over time.
  • 25
    Amazon Lookout for Vision
    Easily create a machine learning (ML) model to spot anomalies from your live process line with as few as 30 images. Identify visual anomalies in real time to reduce and prevent defects and improve product quality. Prevent unplanned downtime and reduce operational costs by using visual inspection data to spot potential issues and take corrective action. Spot damage to a product’s surface quality, color, and shape during the fabrication and assembly process. Determine what’s missing based on the absence, presence, or placement of objects, like a missing capacitor in a printed circuit board. Detect defects with repeating patterns, such as repeated scratches in the same spot on a silicon wafer. Amazon Lookout for Vision is an ML service that uses computer vision to spot defects in manufactured products at scale. Spot product defects using computer vision to automate quality inspection.
  • 26
    Ai-RGUS

    Ai-RGUS

    Ai-RGUS

    Ai-rgususes Artificial Intelligence and custom-built software to automatically catch camera view problems; camera/NVR/DVR misconfigurations or failures; wrong timestamp; and missing or not enough days of recordings. With Ai-rgus you will save time compared to doing it manually and you will have peace of mind that your camera system has the footage you need before an incident. Efficient: Automated verification, saves time from manually reviewing cameras, and enables hassle-free camera system growth. AI verification is reliable and consistent. Proactive verification, providing confidence that desired image exists including for slip and falls and loss prevention cases. Ai-RGUS makes sure that the task of camera verification is done, with a consistent verification quality, and sends automatic email alerts.
  • 27
    CVEDIA

    CVEDIA

    CVEDIA

    CVEDIA-RT is our AI software stack that comes pre-installed with dozens of video analytics and computer vision solutions. It's easy to configure and customize to your use case, even if you're not a data scientist or developer. For a single low price, you have access to all of our AI solutions now and in the future. This means you can discover new use cases and expand your AI capabilities risk-free! If you couldn't find what you are looking for, or you want to run on another device, no problem. We are happy to develop custom solutions based on your requirements. Reach out to us for a free call! What sets us apart from everyone else is our use of synthetic data. Our analytics are more accurate, faster, and affordable than traditional solutions. Your team is busy and deadlines are near, we get it. If you like, we can take care of everything, from development to integration of the analytics. All you have to do is build a product around it!
    Starting Price: Free
  • 28
    FlyPix AI

    FlyPix AI

    FlyPix AI

    FlyPix AI is an object detection platform designed for analyzing satellite and drone imagery . It allows users to effortlessly detect, segment and localize objects and areas within geospatial data. Users can use FlyPix AI advanced functionalities to track changes and detect anomalies. Plus, it's user-friendly and intuitive interface empowers users without coding expertise to create customized use cases and extract valuable information from earth observation data.
    Starting Price: €890
  • 29
    Yandex Vision
    Yandex Vision OCR recognizes text in an image and outputs it along with automatic punctuation. The service supports and automatically identifies more than 50 languages. Extract standard fields and recognize text in templates and documents, e.g., passports, driver’s licenses, vehicle registration certificates, and license plates. With support for Russian and English, as well as combinations of handwritten and printed texts. The service scans the table structure and outputs text in row and column coordinates. Optical character recognition (OCR), document recognition, and license plate number recognition. Yandex Vision OCR allows you to work with JPEG, PNG, and PDF formats. File sizes should be no larger than 20 MB with no more than 300 pages per file. The service can scan images and find passports from 20 countries, driver’s licenses, vehicle registration documents, and license plates.
  • 30
    Campedia

    Campedia

    Campedia

    Campedia is like ChatGPT but for the real world. You snap a photo and ask any question. Identify a plant, ask about an attraction, or let it create a recipe from ingredients in your refrigerator. Campedia is powered by GPT-4 Vision, which is able to take in images and answer questions about them. It is a breakthrough new technology that enables an AI to see. A revolutionary new AI meets a radically simplified user interface. Campedia turns your entire screen into 1 single button. Simply tap & hold to snap, ask your question, and then release to get an answer. Campedia speaks your language. Currently, we support English, German, French, Italian, Spanish, Japanese, Korean, Portuguese and Chinese. Campedia is an AI camera App that works like ChatGPT but for photos. You simply snap a photo and can ask any question. Campedia can be used for an unlimited array of use cases. Popular examples are detecting plants or animals, and asking for info about a wine or a landmark.
    Starting Price: Free