Compare the Top Artificial Intelligence (AI) APIs in New Zealand as of July 2025

What is Artificial Intelligence (AI) APIs in New Zealand?

Artificial Intelligence APIs are software that provide access to advanced technology, AI, and machine learning algorithms designed to solve complex problems. They allow developers to create applications with smarter artificial intelligence features such as natural language processing, image recognition, and more. Many companies use AI APIs to automate tasks or gain insights into customer data so they can improve their products or services. AI APIs are constantly evolving, enabling businesses to benefit from cutting-edge technologies while decreasing the time required for development. Compare and read user reviews of the best Artificial Intelligence (AI) APIs in New Zealand currently available using the table below. This list is updated regularly.

  • 1
    Google Cloud Speech-to-Text
    The Google Cloud Speech-to-Text service provides a powerful AI API that allows developers to seamlessly integrate speech recognition capabilities into their applications. This API processes audio input in real time and can transcribe it into text, making it suitable for a wide range of applications, including voice search and interactive systems. The API's ability to work with various audio formats and handle different speech patterns further enhances its versatility. Additionally, it provides enhanced capabilities for handling long audio files and multiple speakers, offering more comprehensive transcription solutions. As a bonus, new customers receive $300 in free credits to experiment with these AI tools, giving them the flexibility to explore the API’s full potential without initial financial commitment.
    Leader badge
    Starting Price: Free ($300 in free credits)
    View Software
    Visit Website
  • 2
    Qloo

    Qloo

    Qloo

    Qloo is the “Cultural AI”, decoding and predicting consumer taste across the globe. A privacy-first API that predicts global consumer preferences and catalogs hundreds of millions of cultural entities. Through our API, we provide contextualized personalization and insights based on a deep understanding of consumer behavior and more than 575 million people, places, and things. Our technology empowers you to look beyond trends and uncover the connections behind people’s tastes in the world around them. Look up entities in our vast library spanning categories like brands, music, film, fashion, travel destinations, and notable people. Results are delivered within milliseconds and can be weighted by factors such as regionalization and real-time popularity. Used by companies who want to incorporate best-in-class data in their consumer experiences. Our flagship recommendation API delivers results based on demographics, preferences, cultural entities, metadata, and geolocational factors.
    Leader badge
    View Software
    Visit Website
  • 3
    Speechmatics

    Speechmatics

    Speechmatics

    Best-in-Market Speech-to-Text & Voice AI for Enterprises. Speechmatics delivers industry-leading Speech-to-Text and Voice AI for enterprises needing unrivaled accuracy, security, and flexibility. Our enterprise-grade APIs provide real-time and batch transcription with exceptional precision—across the widest range of languages, dialects, and accents. Powered by Foundational Speech Technology, Speechmatics supports mission-critical voice applications in media, contact centers, finance, healthcare, and more. With on-prem, cloud, and hybrid deployment, businesses maintain full control over data security while unlocking voice insights. Trusted by global leaders, Speechmatics is the top choice for best-in-class transcription and voice intelligence. 🔹 Unmatched Accuracy – Superior transcription across languages & accents 🔹 Flexible Deployment – Cloud, on-prem, and hybrid 🔹 Enterprise-Grade Security – Full data control 🔹 Real-Time & Batch Processing – Scalable transcription
    Starting Price: $0 per month
  • 4
    Dialogflow
    Dialogflow from Google Cloud is a natural language understanding platform that makes it easy to design and integrate a conversational user interface into your mobile app, web application, device, bot, interactive voice response system, and so on. Using Dialogflow, you can provide new and engaging ways for users to interact with your product. Dialogflow can analyze multiple types of input from your customers, including text or audio inputs (like from a phone or voice recording). It can also respond to your customers in a couple of ways, either through text or with synthetic speech. Dialogflow CX and ES provide virtual agent services for chatbots and contact centers. If you have a contact center that employs human agents, you can use Agent Assist to help your human agents. Agent Assist provides real-time suggestions for human agents while they are in conversations with end-user customers.
  • 5
    AI/ML API

    AI/ML API

    AI/ML API

    AI/ML API is a game-changing platform for developers and SaaS entrepreneurs looking to integrate cutting-edge AI capabilities into their products. It offers a single point of access to over 200 state-of-the-art AI models, covering everything from NLP to computer vision. Key Features for Developers: Extensive Model Library: 200+ pre-trained models for rapid prototyping and deployment Developer-Friendly Integration: RESTful APIs and SDKs for seamless incorporation into your stack Serverless Architecture: Focus on coding, not infrastructure management Advantages for SaaS Entrepreneurs: Rapid Time-to-Market: Leverage advanced AI without building from scratch Scalability: From MVP to enterprise-grade solutions, AI/ML API grows with your business Cost-Efficiency: Pay-as-you-go pricing model reduces upfront investment Competitive Edge: Stay ahead with continuously updated AI models
    Starting Price: $4.99/week
  • 6
    MeaningCloud

    MeaningCloud

    MeaningCloud

    MeaningCloud is the easiest, most powerful, and most affordable way to extract the meaning from unstructured content: documents, articles, social conversations, web content, etc. We provide text analytics products to extract the most accurate insights from any content in many languages. And we do it SaaS and On-prem. We work for different industries (pharma, finance, media, retail, hospitality, telco, etc.) developing personalized and industry-oriented solutions.  Pay only for what you use, without any activation fees, minimum time commitment and with the most generous free plan of the market. If you don't like it, you can stop using it, just like that. Without software to install or infrastructure to deploy. All the reliability and scalability of solutions in the cloud, and the possibility of testing it for free.
    Starting Price: $99 per month
  • 7
    Vertex AI Vision
    Easily build, deploy, and manage computer vision applications with a fully managed, end-to-end application development environment that reduces the time to build computer vision applications from days to minutes at one-tenth the cost of current offerings. Quickly and conveniently ingest real-time video and image streams at a global scale. Easily build computer vision applications using a drag-and-drop interface. Store and search petabytes of data with built-in AI capabilities. Vertex AI Vision includes all the tools needed to manage the life cycle of computer vision applications, across ingestion, analysis, storage, and deployment. Easily connect application output to a data destination, like BigQuery for analytics, or live streaming to drive real-time business actions. Ingest thousands of video streams from across the globe. With a monthly pricing model, enjoy up to one-tenth lower costs than previous offerings.
    Starting Price: $0.0085 per GB
  • 8
    Komprehend

    Komprehend

    Komprehend

    Komprehend AI APIs are the most comprehensive set of document classification and NLP APIs for software developers. Our NLP models are trained on more than a billion documents and provide state-of-the-art accuracy on most common NLP use cases such as sentiment analysis and emotion detection. Try our free demo now and see the effectiveness of our Text Analysis API. Maintains high accuracy in the real world, and brings out useful insights from open-ended textual data. Works on a variety of data, ranging from finance to healthcare. Supports private cloud deployments via Docker containers or on-premise deployment ensuring no data leakage. Protects your data and follows the GDPR compliance guidelines to the last word. Understand the social sentiment of your brand, product, or service while monitoring online conversations. Sentiment analysis is contextual mining of text which identifies and extracts subjective information in the source material.
    Starting Price: $79 per month
  • 9
    Allganize

    Allganize

    Allganize

    Allganize's industry-leading AI solutions provide businesses with the best tool to automate customer and employee support. Automate an average of 72% of all monthly support tickets within the first 4 months of implementation. Let our AI automate simple customer requests and free up your agents’ time to handle more complex issues. Employees can ask questions in a conversational way and find answers from multiple document types. Conversational AI chat bot pre-trained for your websites and automates customer service. Intelligent search that extracts accurate answers from any document, instantaneously. Automatically extracts important keywords from any document and categorizes them, providing valuable insights. Understands the context of product reviews using one's natural language to automatically detect positive or negative experiences. Assigns predefined categories from customer support conversions to accurately determine user intent.
    Starting Price: $2 per month
  • 10
    Google AI Edge
    ​Google AI Edge offers a comprehensive suite of tools and frameworks designed to facilitate the deployment of artificial intelligence across mobile, web, and embedded applications. By enabling on-device processing, it reduces latency, allows offline functionality, and ensures data remains local and private. It supports cross-platform compatibility, allowing the same model to run seamlessly across embedded systems. It is also multi-framework compatible, working with models from JAX, Keras, PyTorch, and TensorFlow. Key components include low-code APIs for common AI tasks through MediaPipe, enabling quick integration of generative AI, vision, text, and audio functionalities. Visualize the transformation of your model through conversion and quantification. Overlays the results of the comparisons to debug the hotspots. Explore, debug, and compare your models visually. Overlays comparisons and numerical performance data to identify problematic hotspots.
    Starting Price: Free
  • 11
    IBM Distributed AI APIs
    Distributed AI is a computing paradigm that bypasses the need to move vast amounts of data and provides the ability to analyze data at the source. Distributed AI APIs built by IBM Research is a set of RESTful web services with data and AI algorithms to support AI applications across hybrid cloud, distributed, and edge computing environments. Each Distributed AI API addresses the challenges in enabling AI in distributed and edge environments with APIs. The Distributed AI APIs do not focus on the basic requirements of creating and deploying AI pipelines, for example, model training and model serving. You would use your favorite open-source packages such as TensorFlow or PyTorch. Then, you can containerize your application, including the AI pipeline, and deploy these containers at the distributed locations. In many cases, it’s useful to use a container orchestrator such as Kubernetes or OpenShift operators to automate the deployment process.
  • 12
    GAIMIN AI

    GAIMIN AI

    GAIMIN AI

    Run your AI with our APIs, and only pay for what you use; no idle costs, just unparalleled speed and scalability. Improve your product by integrating AI-driven image generation, offering users high-quality, unique visuals. Create content, automate responses, or personalize experiences with AI text generation. Enhance accessibility and productivity by integrating real-time speech recognition into your products. Use the API to create voiceovers, enhance accessibility, or build interactive experiences. Use our API to sync speech with facial movements for lifelike animations and improved video quality. Automates repetitive tasks and streamlines workflows. Gain valuable insights from data to make informed business decisions. Stay ahead with advanced AI powered by a global, decentralized network of cutting-edge computers. Provides personalized recommendations, improving customer satisfaction.
  • 13
    GPT-Image-1
    OpenAI's Image Generation API, powered by the gpt-image-1 model, enables developers and businesses to integrate high-quality, professional-grade image generation directly into their tools and platforms. This model offers versatility, allowing it to create images across diverse styles, faithfully follow custom guidelines, leverage world knowledge, and accurately render text, unlocking countless practical applications across multiple domains. Leading enterprises and startups across industries, including creative tools, ecommerce, education, enterprise software, and gaming, are already using image generation in their products and experiences. It gives creators the choice and flexibility to experiment with different aesthetic styles. Users can generate and edit images from simple prompts, adjusting styles, adding or removing objects, expanding backgrounds, and more.
    Starting Price: $0.19 per image
  • 14
    Mistral Agents API
    Mistral AI has introduced its Agents API, a significant advancement aimed at enhancing the capabilities of AI by addressing the limitations of traditional language models in performing actions and maintaining context. This new API integrates Mistral's powerful language models with several key features, built-in connectors for code execution, web search, image generation, and Model Context Protocol (MCP) tools; persistent memory across conversations; and agentic orchestration capabilities. The Agents API complements Mistral's Chat Completion API by providing a dedicated framework that simplifies the implementation of agentic use cases, serving as the backbone of enterprise-grade agentic platforms. It enables developers to build AI agents capable of handling complex tasks, maintaining context, and coordinating multiple actions, thereby making AI more practical and impactful for enterprises.
  • 15
    AWS AI Services
    AWS pre-trained AI Services provide ready-made intelligence for your applications and workflows. AI Services easily integrate with your applications to address common use cases such as personalized recommendations, modernizing your contact center, improving safety and security, and increasing customer engagement. Because we use the same deep learning technology that powers Amazon.com and our ML Services, you get quality and accuracy from continuously-learning APIs. And best of all, AI Services on AWS doesn't require machine learning experience. Catalog assets, automate workflows, and extract meaning from your media and applications. Identify missing product components, vehicle and structure damage, and irregularities for comprehensive quality control. Improve operations with automated monitoring to find bottlenecks and assess manufacturing quality and safety. Pull valuable information from millions of documents at speed.
  • 16
    Lexalytics

    Lexalytics

    Lexalytics

    Integrate our text analytics APIs to add world-leading NLP into your product, platform, or application. The most feature-complete NLP feature stack on the market, 19 years in development and constantly being improved with new libraries, configurations, and models. Determine whether a piece of writing is positive, negative, or neutral. Sort and organize documents into customizable groups. Determine the expressed intent of customers and reviewers. Find people, places, dates, companies, products, jobs, titles, and more. Deploy our text analytics and NLP systems across any combination of on-premise, private cloud, hybrid cloud, and public cloud infrastructure. Our core text analytics and natural language processing software libraries are at your command. Suitable for data scientists and architects who want complete access to the underlying technology or who need on-premise deployment for security or privacy reasons.
  • 17
    Mistral OCR

    Mistral OCR

    Mistral AI

    Mistral AI's Document Capabilities provide a powerful set of tools for understanding, summarizing, and generating content from complex documents using advanced AI models. Designed for developers and businesses, these capabilities allow users to process large volumes of text efficiently, extracting key information, generating concise summaries, and even drafting new content based on the original document. By leveraging state-of-the-art language models, Mistral enables organizations to automate document-heavy workflows, from legal reviews and contract analysis to research paper summaries and business reports. The API allows seamless integration into existing systems, enabling real-time document processing and analysis. Mistral’s Document capabilities are especially suited for scenarios where quick comprehension of lengthy or technical materials is critical, reducing the time spent on manual reading and review.
  • 18
    Gemini Live API
    ​The Gemini Live API is a preview feature that enables low-latency, bidirectional voice and video interactions with Gemini. It allows end users to experience natural, human-like voice conversations and provides the ability to interrupt the model's responses using voice commands. The model can process text, audio, and video input, and it can provide text and audio output. New capabilities include two new voices and 30 new languages with configurable output language, configurable image resolutions (66/256 tokens), configurable turn coverage (send all inputs all the time or only when the user is speaking), configurable interruption settings, configurable voice activity detection, new client events for end-of-turn signaling, token counts, a client event for signaling the end of stream, text streaming, configurable session resumption with session data stored on the server for 24 hours, and longer session support with a sliding context window.
  • 19
    Windows AI Foundry
    Windows AI Foundry is a unified, reliable, and secure platform supporting the AI developer lifecycle from model selection, fine-tuning, optimizing, and deployment across CPU, GPU, NPU, and cloud. It integrates tools like Windows ML, enabling developers to bring their own models and deploy them efficiently across the silicon partner ecosystem, including AMD, Intel, NVIDIA, and Qualcomm, spanning CPU, GPU, and NPU. Foundry Local allows developers to pull in their favorite open source models and make their apps smarter. It offers ready-to-use AI APIs powered by on-device models, optimized for efficiency and performance on Copilot+ PC devices with minimal setup required. These APIs include capabilities such as text recognition (OCR), image super resolution, image segmentation, image description, and object erasing. Developers can customize Windows inbox models with their own data using LoRA for Phi Silica.
  • 20
    Clarity AI

    Clarity AI

    Clarity AI

    Invest sustainably, shop sustainably, and report or benchmark for sustainability with easy-to-use, AI-powered technology. With tech building blocks for every sustainability use case, you can cover any needs you have related to data, methodologies, or tools. And, with digitally-native capabilities and a fully modular infrastructure, you can take and use any, or every, piece of our sustainability tech kit. Whether you need a comprehensive, customizable, fully-packaged sustainability tech platform or just one data point to ensure regulatory compliance, Clarity AI empowers you to efficiently and confidently assess, analyze, and report on anything valuable to you or your clients and everything required by regulation. Clarity AI seamlessly integrates into your workflow via API or our web app and is the only scalable and flexible end-to-end SaaS tool able to address any sustainability use case.
  • Previous
  • You're on page 1
  • Next