Suggested Categories:

Disk Imaging Software
Disk imaging software creates an exact copy of a computer's hard drive, including the operating system, applications, settings, and files. It is commonly used for system backups, disaster recovery, and deploying standardized configurations across multiple machines. By capturing a full snapshot of a disk, users can restore their system quickly in case of data loss, hardware failure, or malware attacks. Many disk imaging tools offer incremental backups, compression, and encryption for efficient storage and security. This software is essential for IT professionals, businesses, and individuals looking to safeguard their data and streamline system recovery.
Image Processing API
An image processing API is an API that can be used by developers to add image processing capabilities to applications. Image processing APIs can be used to build features into applications such as image cleanup, OCR, image conversion, image compression, computer vision, image editing, and more.
Medical Imaging Software
Medical imaging software is used to capture, store, process, and analyze medical images such as X-rays, MRIs, CT scans, ultrasounds, and other diagnostic images. These tools assist healthcare professionals in interpreting images for the purpose of diagnosis, treatment planning, and monitoring of patient progress. Medical imaging software often includes features such as image enhancement, 3D visualization, measurement tools, and the ability to integrate with electronic health records (EHR) systems. It may also offer tools for image sharing, collaboration, and comparison across different time points or imaging modalities. These software solutions improve the accuracy and efficiency of medical imaging workflows, supporting clinicians in delivering better patient care.
Image Recognition Software
Image recognition software enables machines to identify and classify objects, people, or features within images using advanced algorithms and machine learning techniques. This technology is capable of processing vast amounts of visual data to recognize patterns, shapes, and specific elements, making it useful in applications like facial recognition, object detection, and optical character recognition (OCR). The software often integrates with artificial intelligence (AI) models to improve accuracy and adaptability over time through training on diverse datasets. Image recognition is widely used in industries such as security, retail, healthcare, and autonomous vehicles, where it helps automate tasks and enhance decision-making. Overall, this software transforms visual data into actionable insights, providing significant value in various real-world applications.
Dental Imaging Software
Dental imaging software is used by dental professionals to capture, manage, and analyze various types of dental images such as X-rays, 3D scans, and intraoral photographs. This software helps in diagnosing and planning treatments for oral health issues by providing clear, detailed images that are critical for assessment and treatment planning. Features often include image enhancement tools, integration with patient records, digital radiography support, 3D visualization, and measurement tools for accurate diagnosis. Dental imaging software can also improve patient communication by allowing them to view their images and understand their treatment plans more effectively.
Process Mining Software
Process mining software enables organizations to analyze events and processes in order to identify processes that can be improved. Process mining software allows companies to visualize how processes are being handled across the organization.
Process Manufacturing Software
Process manufacturing software enables process manufacturers to streamline and manage their process manufacturing business. Process manufacturing software provides a variety of features including ERP, lot management, measurement management, production management, production scheduling, expiration tracking, recipe management, compliance management, inventory management, and more.
Payment Processing Software
Payment processing software is software used to facilitate payments made to and from businesses. It can process these transactions quickly and securely, allowing merchants to run their operations smoothly. The software is often integrated with point-of-sale systems or accounting programs, allowing for seamless tracking of all financial information. Users can easily manage payment gateways, report on transactions, and reconcile accounts with detailed analytics tools.
Claims Processing Software
Claims processing software is software built to help businesses, particularly in insurance and healthcare, automate and streamline the management of claims from submission to resolution. These platforms typically offer features for claim tracking, claim validation, document management, fraud detection, and payout processing. Claims processing software enables companies to improve operational efficiency, reduce manual errors, and enhance customer satisfaction by speeding up the claims approval process. It also helps with compliance by ensuring that all claims are handled according to industry regulations and internal policies.
Natural Language Processing Software
Natural language processing (NLP) software analyzes both written and spoken human languages and interprets them for translation, deep learning and automation purposes. Natural language processing software may also include natural language understanding (NLU) capabilities.
Business Process Management Software
Business process management software is designed to streamline and automate the way businesses manage their processes. It can help organizations easily create, analyze, optimize, and monitor business processes. BPM software typically works by breaking down a workflow into individual tasks and activities, and then tracking them from start to finish. By using this type of tool, businesses can become more efficient by eliminating redundant steps, reducing manual labor for complex tasks, and improving collaboration among employees.
Digital Process Automation Software
Digital process automation (DPA) software provides businesses with the tools to automate their tasks and operations in order to optimize their workflows. Digital process automation platforms let organizations build apps and integrations quickly using automated workflows and processes. Most capable digital process automation tools allow non-technical users the ability to create workflows and applications without knowing how to code.
Image Optimization Software
Image optimization software is a type of software designed to help reduce the file size of digital images while maintaining the image quality. It typically works by compressing file sizes, changing color profiles or resolution, and stripping out unnecessary metadata and other non-visible data. This reduces the overall file size while still allowing users to keep a high-quality picture. Ultimately, this helps to improve website speed and performance by reducing load times from large image files.
Image Compressors
Image compressors are software tools designed to reduce the file size of digital images without significantly compromising visual quality. They use algorithms like lossless or lossy compression to optimize images for faster loading times, reduced storage needs, and improved web performance. Image compressors support various formats such as JPEG, PNG, GIF, and WebP, often allowing users to balance quality and compression level. These tools are essential for photographers, web developers, and marketers aiming to enhance user experience and bandwidth efficiency. Many compressors also include batch processing, automation, and integration capabilities for streamlined workflows.
Image CDN Software
Image CDN (Content Delivery Network) providers are services that optimize and distribute images quickly and efficiently across a global network of servers. They reduce image load times by caching content in multiple geographic locations, ensuring fast delivery to users regardless of their location. These providers often offer features like automatic image optimization, format conversion, and responsive resizing to improve website performance.
Shoppable Image Software
Shoppable image software is a web-based solution that enables businesses to integrate product catalogs with their online images. This allows consumers to buy products they see in photos or videos with just one click.
Image Converters
Image converters are software tools and apps that enable users to convert images, graphics, and photos from one file format to another. Some image converters offer the ability for batch image conversion.
Image Annotation Tools
Image annotation tools are used to automatically process and label digital images using advanced techniques in machine learning, AI, and computer vision. These tools can accurately recognize important features in images, such as objects, characters, or facial expressions. This data can then be used for various purposes such as automatic image tagging and sorting. Image annotation is becoming an increasingly popular tool for organizing large databases of images and videos.
Robotic Process Automation (RPA) Software
Robotic process automation (RPA) software provides businesses with the automation tools to optimize their processes and operations in order to improve productivity. RPA tools enable businesses to improve efficiency, reduce errors, and lower operational costs by automating mundane tasks, freeing up human employees for more complex and value-driven work. RPA software can integrate with existing systems and applications, providing a seamless automation experience across different workflows. It is widely used in industries such as finance, healthcare, and customer service for tasks that require high levels of consistency and accuracy.
Process Server Software
Process server software is software to manage and streamline the delivery of legal documents, such as subpoenas, summons, and court orders. It enables process servers to track assignments, schedule deliveries, and log successful service attempts in real time. The software often includes GPS tracking, automated notifications, and reporting features to ensure transparency and accountability. It enhances efficiency by digitizing paperwork, reducing manual errors, and providing a centralized platform for managing service records. Legal firms, process serving companies, and independent servers use it to ensure compliance with legal requirements and improve operational workflows.
View more categories (20) for "delphi image processing"

41 Products for "delphi image processing" with 2 filters applied:

  • 1
    Google AI Studio
    Google AI Studio is a comprehensive, web-based development environment that democratizes access to Google's cutting-edge AI models, notably the Gemini family, enabling a broad spectrum of users to explore and build innovative applications. This platform facilitates rapid prototyping by providing an intuitive interface for prompt engineering, allowing developers to meticulously craft and refine their interactions with AI. Beyond basic experimentation, AI Studio supports the seamless...
    Starting Price: Free
    View Software
    Visit Website
  • 2
    Llama 4 Scout
    Llama 4 Scout is a powerful 17 billion active parameter multimodal AI model that excels in both text and image processing. With an industry-leading context length of 10 million tokens, it outperforms its predecessors, including Llama 3, in tasks such as multi-document summarization and parsing large codebases. Llama 4 Scout is designed to handle complex reasoning tasks while maintaining high efficiency, making it perfect for use cases requiring long-context comprehension and image grounding. ...
    Starting Price: Free
  • 3
    InstructGPT
    InstructGPT is an open-source framework for training language models to generate natural language instructions from visual input. It uses a generative pre-trained transformer (GPT) model and the state-of-the-art object detector, Mask R-CNN, to detect objects in images and generate natural language sentences that describe the image. InstructGPT is designed to be effective across domains such as robotics, gaming and education; it can assist robots in navigating complex tasks with natural language instructions, or help students learn by providing descriptive explanations of processes or events.
    Starting Price: $0.0200 per 1000 tokens
  • 4
    GLM-4.6V

    GLM-4.6V

    Zhipu AI

    ...It ships in two variants: a full-scale version (106B parameters) for cloud or high-performance clusters, and a lightweight “Flash” variant (9B) optimized for local deployment or low-latency use. GLM-4.6V supports a native context window of up to 128K tokens during training, enabling it to process very long documents or multimodal inputs. Crucially, it integrates native Function Calling, meaning the model can take images, screenshots, documents, or other visual media as input directly (without manual text conversion), reason about them, and trigger tool calls, bridging “visual perception” with “executable action.” This enables a wide spectrum of capabilities; interleaved image-and-text content generation (for example, combining document understanding with text summarization or generation of image-annotated responses).
    Starting Price: Free
  • 5
    GPT-5 nano
    GPT-5 nano is OpenAI’s fastest and most affordable version of the GPT-5 family, designed for high-speed text processing tasks like summarization and classification. It supports text and image inputs, generating high-quality text outputs with a large 400,000-token context window and up to 128,000 output tokens. GPT-5 nano offers very fast response times, making it ideal for applications requiring quick turnaround without sacrificing quality. Pricing is extremely competitive, with input tokens costing $0.05 per million and output tokens $0.40 per million, making it accessible for budget-conscious projects. ...
    Starting Price: $0.05 per 1M tokens
  • 6
    Amazon Nova Lite
    Amazon Nova Lite is a cost-efficient, multimodal AI model designed for rapid processing of image, video, and text inputs. It delivers impressive performance at an affordable price, making it ideal for interactive, high-volume applications where cost is a key consideration. With support for fine-tuning across text, image, and video inputs, Nova Lite excels in a variety of tasks that require fast, accurate responses, such as content generation and real-time analytics.
  • 7
    ERNIE Bot
    ...Built on the ERNIE (Enhanced Representation through Knowledge Integration) model, ERNIE Bot excels at understanding complex queries and generating human-like responses across various domains. Its capabilities include processing text, generating images, and engaging in multimodal communication, making it suitable for a wide range of applications such as customer support, virtual assistants, and enterprise automation. With its advanced contextual understanding, ERNIE Bot offers an intuitive and efficient solution for businesses seeking to enhance their digital interactions and automate workflows.
    Starting Price: Free
  • 8
    Llama 4 Behemoth
    Llama 4 Behemoth is Meta's most powerful AI model to date, featuring a massive 288 billion active parameters. It excels in multimodal tasks, outperforming previous models like GPT-4.5 and Gemini 2.0 Pro across multiple STEM-focused benchmarks such as MATH-500 and GPQA Diamond. As the teacher model for the Llama 4 series, Behemoth sets the foundation for models like Llama 4 Maverick and Llama 4 Scout. While still in training, Llama 4 Behemoth demonstrates unmatched intelligence, pushing the...
    Starting Price: Free
  • 9
    Qwen2.5-VL

    Qwen2.5-VL

    Alibaba

    Qwen2.5-VL is the latest vision-language model from the Qwen series, representing a significant advancement over its predecessor, Qwen2-VL. This model excels in visual understanding, capable of recognizing a wide array of objects, including text, charts, icons, graphics, and layouts within images. It functions as a visual agent, capable of reasoning and dynamically directing tools, enabling applications such as computer and phone usage. Qwen2.5-VL can comprehend videos exceeding one hour in...
    Starting Price: Free
  • 10
    Pixtral Large

    Pixtral Large

    Mistral AI

    ...It integrates a 123-billion-parameter multimodal decoder with a 1-billion-parameter vision encoder, enabling advanced understanding of documents, charts, and natural images while maintaining leading text comprehension capabilities. With a context window of 128,000 tokens, Pixtral Large can process at least 30 high-resolution images simultaneously. The model has demonstrated state-of-the-art performance on benchmarks such as MathVista, DocVQA, and VQAv2, surpassing models like GPT-4o and Gemini-1.5 Pro. ...
    Starting Price: Free
  • 11
    ERNIE X1
    ...Unlike previous versions, ERNIE X1 is designed to be more efficient in understanding and generating human-like responses. It incorporates cutting-edge machine learning techniques to handle complex queries, making it capable of not only processing text but also generating images and engaging in multimodal communication. ERNIE X1 is often used in natural language processing applications such as chatbots, virtual assistants, and enterprise automation, offering significant improvements in accuracy, contextual understanding, and response quality.
    Starting Price: $0.28 per 1M tokens
  • 12
    ERNIE 4.5 Turbo
    ERNIE 4.5 Turbo, unveiled by Baidu at the 2025 Baidu Create conference, is a cutting-edge AI model designed to handle a variety of data inputs, including text, images, audio, and video. It offers powerful multimodal processing capabilities that enable it to perform complex tasks across industries such as customer support automation, content creation, and data analysis. With enhanced reasoning abilities and reduced hallucinations, ERNIE 4.5 Turbo ensures that businesses can achieve higher accuracy and reliability in AI-driven processes. ...
  • 13
    Palmyra LLM
    ...The Palmyra family includes specialized models such as Palmyra X 004, featuring tool-calling capabilities; Palmyra Med, tailored for healthcare; Palmyra Fin, designed for finance; and Palmyra Vision, which offers advanced image and video processing. These models are available through Writer's full-stack generative AI platform, which integrates graph-based Retrieval Augmented Generation (RAG).
    Starting Price: $18 per month
  • 14
    Amazon Nova
    Amazon Nova is a new generation of state-of-the-art (SOTA) foundation models (FMs) that deliver frontier intelligence and industry leading price-performance, available exclusively on Amazon Bedrock. Amazon Nova Micro, Amazon Nova Lite, and Amazon Nova Pro are understanding models that accept text, image, or video inputs and generate text output. They provide a broad selection of capability, accuracy, speed, and cost operation points. Amazon Nova Micro is a text only model that delivers the lowest latency responses at very low cost. Amazon Nova Lite is a very low-cost multimodal model that is lightning fast for processing image, video, and text inputs. ...
  • 15
    Mistral Small 3.1
    ​Mistral Small 3.1 is a state-of-the-art, multimodal, and multilingual AI model released under the Apache 2.0 license. Building upon Mistral Small 3, this enhanced version offers improved text performance, and advanced multimodal understanding, and supports an expanded context window of up to 128,000 tokens. It outperforms comparable models like Gemma 3 and GPT-4o Mini, delivering inference speeds of 150 tokens per second. Designed for versatility, Mistral Small 3.1 excels in tasks such as...
    Starting Price: Free
  • 16
    Qwen

    Qwen

    Alibaba

    Qwen is a powerful, free AI assistant built on the advanced Qwen model series, designed to help anyone with creativity, research, problem-solving, and everyday tasks. While Qwen Chat is the main interface for most users, Qwen itself powers a broad range of intelligent capabilities including image generation, deep research, website creation, advanced reasoning, and context-aware search. Its multimodal intelligence enables Qwen to understand and process text, images, audio, and video simultaneously for richer insights. Qwen is available on web, desktop, and mobile, ensuring seamless access across all devices. For developers, the Qwen API provides OpenAI-compatible endpoints, making integration simple and allowing Qwen’s intelligence to power apps, services, and automation. ...
    Starting Price: Free
  • 17
    Gemini 2.0
    Gemini 2.0 is an advanced AI-powered model developed by Google, designed to offer groundbreaking capabilities in natural language understanding, reasoning, and multimodal interactions. Building on the success of its predecessor, Gemini 2.0 integrates large language processing with enhanced problem-solving and decision-making abilities, enabling it to interpret and generate human-like responses with greater accuracy and nuance. Unlike traditional AI models, Gemini 2.0 is trained to handle multiple data types simultaneously, including text, images, and code, making it a versatile tool for research, business, education, and creative industries. ...
    Starting Price: Free
  • 18
    GPT-5 mini
    GPT-5 mini is a streamlined, faster, and more affordable variant of OpenAI’s GPT-5, optimized for well-defined tasks and precise prompts. It supports text and image inputs and delivers high-quality text outputs with a 400,000-token context window and up to 128,000 output tokens. This model excels at rapid response times, making it suitable for applications requiring fast, accurate language understanding without the full overhead of GPT-5. Pricing is cost-effective, with input tokens at $0.25...
    Starting Price: $0.25 per 1M tokens
  • 19
    Vertex AI
    Build, deploy, and scale machine learning (ML) models faster, with fully managed ML tools for any use case. Through Vertex AI Workbench, Vertex AI is natively integrated with BigQuery, Dataproc, and Spark. You can use BigQuery ML to create and execute machine learning models in BigQuery using standard SQL queries on existing business intelligence tools and spreadsheets, or you can export datasets from BigQuery directly into Vertex AI Workbench and run your models from there. Use Vertex...
    Starting Price: Free ($300 in free credits)
  • 20
    GPT-4o

    GPT-4o

    OpenAI

    GPT-4o (“o” for “omni”) is a step towards much more natural human-computer interaction—it accepts as input any combination of text, audio, image, and video and generates any combination of text, audio, and image outputs. It can respond to audio inputs in as little as 232 milliseconds, with an average of 320 milliseconds, which is similar to human response time (opens in a new window) in a conversation. It matches GPT-4 Turbo performance on text in English and code, with significant...
    Starting Price: $5.00 / 1M tokens
  • 21
    Gemini 2.0 Flash
    The Gemini 2.0 Flash AI model represents the next generation of high-speed, intelligent computing, designed to set new benchmarks in real-time language processing and decision-making. Building on the robust foundation of its predecessor, it incorporates enhanced neural architecture and breakthrough advancements in optimization, enabling even faster and more accurate responses. Gemini 2.0 Flash is designed for applications requiring instantaneous processing and adaptability, such as live...
  • 22
    ERNIE 4.5
    ERNIE 4.5 is a cutting-edge conversational AI platform developed by Baidu, leveraging advanced natural language processing (NLP) models to enable highly sophisticated human-like interactions. The platform is part of Baidu’s ERNIE (Enhanced Representation through Knowledge Integration) series, which integrates multimodal capabilities, including text, image, and voice. ERNIE 4.5 enhances the ability of AI models to understand complex context and deliver more accurate, nuanced responses, making it suitable for various applications, from customer service and virtual assistants to content creation and enterprise-level automation.
    Starting Price: $0.55 per 1M tokens
  • 23
    ERNIE 5.0
    ERNIE 5.0 is a next-generation conversational AI platform developed by Baidu, designed to deliver natural, human-like interactions across multiple domains. Built on Baidu’s Enhanced Representation through Knowledge Integration (ERNIE) framework, it fuses advanced natural language processing (NLP) with deep contextual understanding. The model supports multimodal capabilities, allowing it to process and generate text, images, and voice seamlessly. ERNIE 5.0’s refined contextual awareness enables it to handle complex conversations with greater precision and nuance. Its applications span customer service, content generation, and enterprise automation, enhancing both user engagement and productivity. ...
  • 24
    Gemini

    Gemini

    Google

    ...Accessible via the web and mobile apps, Gemini integrates seamlessly with various Google services, including Docs, Drive, and Gmail, enabling users to draft content, summarize information, and manage tasks efficiently. Its multimodal capabilities allow it to process and generate diverse data types, such as text, images, and audio, providing comprehensive assistance across different contexts. As a continuously learning model, Gemini adapts to user interactions, offering personalized and context-aware responses to meet a wide range of user needs.
    Starting Price: Free
  • 25
    Grok 3
    Grok-3, developed by xAI, represents a significant advancement in the field of artificial intelligence, aiming to set new benchmarks in AI capabilities. It is designed to be a multimodal AI, capable of processing and understanding data from various sources including text, images, and audio, which allows for a more integrated and comprehensive interaction with users. Grok-3 is built on an unprecedented scale, with training involving ten times more computational resources than its predecessor, leveraging 100,000 Nvidia H100 GPUs on the Colossus supercomputer. ...
    Starting Price: Free
  • 26
    Reka Flash 3
    ​Reka Flash 3 is a 21-billion-parameter multimodal AI model developed by Reka AI, designed to excel in general chat, coding, instruction following, and function calling. It processes and reasons with text, images, video, and audio inputs, offering a compact, general-purpose solution for various applications. Trained from scratch on diverse datasets, including publicly accessible and synthetic data, Reka Flash 3 underwent instruction tuning on curated, high-quality data to optimize performance. The final training stage involved reinforcement learning using REINFORCE Leave One-Out (RLOO) with both model-based and rule-based rewards, enhancing its reasoning capabilities. ...
  • 27
    Amazon Nova Pro
    Amazon Nova Pro is a versatile, multimodal AI model designed for a wide range of complex tasks, offering an optimal combination of accuracy, speed, and cost efficiency. It excels in video summarization, Q&A, software development, and AI agent workflows that require executing multi-step processes. With advanced capabilities in text, image, and video understanding, Nova Pro supports tasks like mathematical reasoning and content generation, making it ideal for businesses looking to implement cutting-edge AI in their operations.
  • 28
    Gemini 2.5 Pro
    ...Leading common benchmarks, it excels in math, science, and coding, demonstrating strong performance in tasks like web app creation and code transformation. Built on the Gemini 2.5 foundation, it features a 1 million token context window, enabling it to process vast datasets from various sources such as text, images, and code repositories. Available now in Google AI Studio, Gemini 2.5 Pro is optimized for more sophisticated applications and supports advanced users with improved performance for complex problem-solving.
    Starting Price: $19.99/month
  • 29
    Amazon Nova Premier
    Amazon Nova Premier is the most advanced model in their Nova family, designed to handle complex tasks and act as a teacher for model distillation. Available on Amazon Bedrock, Nova Premier can process text, images, and video inputs, making it capable of managing intricate workflows, multi-step planning, and the precise execution of tasks across various data sources. The model features a context length of one million tokens, enabling it to handle large-scale documents and code bases efficiently. Furthermore, Nova Premier allows users to create smaller, faster, and more cost-effective versions of its models, such as Nova Pro and Nova Micro, for specific use cases through model distillation.
  • 30
    Qwen2.5

    Qwen2.5

    Alibaba

    Qwen2.5 is an advanced multimodal AI model designed to provide highly accurate and context-aware responses across a wide range of applications. It builds on the capabilities of its predecessors, integrating cutting-edge natural language understanding with enhanced reasoning, creativity, and multimodal processing. Qwen2.5 can seamlessly analyze and generate text, interpret images, and interact with complex data to deliver precise solutions in real time. Optimized for adaptability, it excels in personalized assistance, data analysis, creative content generation, and academic research, making it a versatile tool for professionals and everyday users alike. ...
    Starting Price: Free
  • Previous
  • You're on page 1
  • 2
  • Next