Compare the Top Data Catalog Software for Cloud as of June 2025

What is Data Catalog Software for Cloud?

Data catalog software is a tool used to organize, manage, and provide easy access to an organization's data assets. It helps businesses create a centralized inventory of all available data, such as databases, datasets, reports, and documents, allowing users to search, classify, and understand their data assets more efficiently. Features often include metadata management, data lineage tracking, data governance, collaboration tools, and integration with data management systems. By providing a clear overview of data sources and their relationships, data catalog software facilitates data discovery, improves data quality, ensures compliance, and enhances collaboration across teams. Compare and read user reviews of the best Data Catalog software for Cloud currently available using the table below. This list is updated regularly.

  • 1
    Composable DataOps Platform

    Composable DataOps Platform

    Composable Analytics

    Composable is an enterprise-grade DataOps platform built for business users that want to architect data intelligence solutions and deliver operational data-driven products leveraging disparate data sources, live feeds, and event data regardless of the format or structure of the data. With a modern, intuitive dataflow visual designer, built-in services to facilitate data engineering, and a composable architecture that enables abstraction and integration of any software or analytical approach, Composable is the leading integrated development environment to discover, manage, transform and analyze enterprise data.
    Starting Price: $8/hr - pay-as-you-go
  • 2
    Pentaho

    Pentaho

    Hitachi Vantara

    With an integrated product suite providing data integration, analytics, cataloging, optimization and quality, Pentaho+ enables seamless data management, driving innovation and informed decision-making. Pentaho+ has helped customers achieve a 3x increase in improved data trust, a 7x increase in impactful business results and most importantly, a 70% increase in productivity.
  • 3
    OvalEdge

    OvalEdge

    OvalEdge

    OvalEdge is a cost-effective data catalog designed for end-to-end data governance, privacy compliance, and fast, trustworthy analytics. OvalEdge crawls your organizations’ databases, BI platforms, ETL tools, and data lakes to create an easy-to-access, smart inventory of your data assets. Using OvalEdge, analysts can discover data and deliver powerful insights quickly. OvalEdge’s comprehensive functionality enables users to establish and improve data access, data literacy, and data quality.
    Starting Price: $1,300/month
  • 4
    DvSum

    DvSum

    DvSum

    DvSum is a AI-powered Data Intelligence platform that makes it remarkably easier for your data and analytics teams to discover, monitor, and govern data. With powerful AI-enabled algorithms, DvSum automatically catalogues, classifies, and curates your data and makes it available as an actionable Data Catalog. Propel your enterprise towards its digital and analytics enabled transformation goals with DvSum Data Intelligence.
    Starting Price: $1000/ per month
  • 5
    K2View

    K2View

    K2View

    At K2View, we believe that every enterprise should be able to leverage its data to become as disruptive and agile as the best companies in its industry. We make this possible through our patented Data Product Platform, which creates and manages a complete and compliant dataset for every business entity – on demand, and in real time. The dataset is always in sync with its underlying sources, adapts to changes in the source structures, and is instantly accessible to any authorized data consumer. Data Product Platform fuels many operational use cases, including customer 360, data masking and tokenization, test data management, data migration, legacy application modernization, data pipelining and more – to deliver business outcomes in less than half the time, and at half the cost, of any other alternative. The platform inherently supports modern data architectures – data mesh, data fabric, and data hub – and deploys in cloud, on-premise, or hybrid environments.
  • 6
    OneTrust Privacy Automation
    Go beyond compliance and build trust through transparency, choice, and control. People demand greater control of their data, unlocking an opportunity for organizations to use these moments to build trust and deliver more valuable experiences. We provide privacy and data governance automation to help organizations better understand their data across the business, meet regulatory requirements, and operationalize risk mitigation to provide transparency and choice to individuals. Achieve data privacy compliance faster and build trust in your organization. Our platform helps break down silos across processes, workflows, and teams to operationalize regulatory compliance and enable trusted data use. Build proactive privacy programs rooted in global best practices, not reactive to individual regulations. Gain visibility into unknown risks to drive mitigation and risk-based decision making. Respect individual choice and embed privacy and security by default into the data lifecycle.
  • 7
    Alation

    Alation

    Alation

    Alation is the first company to bring a data catalog to market. It radically improves how people find, understand, trust, use, and reuse data. Alation pioneered active, non-invasive data governance, which supports both data democratization and compliance at scale, so people have the data they need alongside guidance on how to use it correctly. By combining human insight with AI and machine learning, Alation tackles the toughest challenges in data today. More than 350 enterprises use Alation to make confident, data-driven decisions. American Family Insurance, Exelon, Munich Re, and Pfizer are all proud customers.
  • 8
    Narrative

    Narrative

    Narrative

    Create new streams of revenue using the data you already collect with your own branded data shop. Narrative is focused on the fundamental principles that make buying and selling data easier, safer, and more strategic. Ensure that the data you access meets your standards, whatever they may be. Know exactly who you’re working with and how the data was collected. Easily access new supply and demand for a more agile and accessible data strategy. Own your data strategy entirely with end-to-end control of inputs and outputs. Our platform simplifies and automates the most time- and labor-intensive aspects of data acquisition, so you can access new data sources in days, not months. With filters, budget controls, and automatic deduplication, you’ll only ever pay for the data you need, and nothing that you don’t.
    Starting Price: $0
  • 9
    Datameer

    Datameer

    Datameer

    Datameer revolutionizes data transformation with a low-code approach, trusted by top global enterprises. Craft, transform, and publish data seamlessly with no code and SQL, simplifying complex data engineering tasks. Empower your data teams to make informed decisions confidently while saving costs and ensuring responsible self-service analytics. Speed up your analytics workflow by transforming datasets to answer ad-hoc questions and support operational dashboards. Empower everyone on your team with our SQL or Drag-and-Drop to transform your data in an intuitive and collaborative workspace. And best of all, everything happens in Snowflake. Datameer is designed and optimized for Snowflake to reduce data movement and increase platform adoption. Some of the problems Datameer solves: - Analytics is not accessible - Drowning in backlog - Long development
  • 10
    Azure Data Catalog
    In the new world of data, you can spend more time looking for data than you do analyzing it. Azure Data Catalog is an enterprise-wide metadata catalog that makes data asset discovery straightforward. It’s a fully-managed service that lets you—from analyst to data scientist to data developer—register, enrich, discover, understand, and consume data sources. Work with data in the tool of your choice. Data Catalog lets you find the data you need and use it in the tools you choose. Your data stays where you want it, and Data Catalog helps you discover and work with it where you want, with an intuitive user experience. ncrease broad adoption and continuous value creation across your data ecosystem. Data Catalog helps you get tips, tricks, and unwritten rules into an experience where everyone can get value. With Data Catalog, everyone can contribute. Democratize data asset discovery.
    Starting Price: $1 per user per month
  • 11
    erwin Data Intelligence
    erwin Data Intelligence (erwin DI) combines data catalog and data literacy capabilities for greater awareness of and access to available data assets, guidance on their use, and guardrails to ensure data policies and best practices are followed. Automatically harvest, transform and feed metadata from a wide array of data sources, operational processes, business applications and data models into a central catalog. Then make it accessible and understandable via role-based, contextual views so stakeholders can make strategic decisions based on accurate insights. erwin DI supports enterprise data governance, digital transformation and any effort that relies on data for favorable outcomes. Schedule ongoing scans of metadata from the widest array of data sources. Easily map data elements from source to target, including data in motion, and harmonize data integration across platforms. Enable data consumers to define and discover data relevant to their roles.
    Starting Price: $299 per month
  • 12
    Google Cloud Data Catalog
    A fully managed and highly scalable data discovery and metadata management service. New customers get $300 in free credits to spend on Google Cloud during the Free Trial. All customers get up to 1 MiB of business or ingested metadata storage and 1 million API calls, free of charge. Pinpoint your data with a simple but powerful faceted-search interface. Sync technical metadata automatically and create schematized tags for business metadata. Tag sensitive data automatically, through Cloud Data Loss Prevention (DLP) integration. Get access immediately then scale without infrastructure to set up or manage. Empower any user on the team to find or tag data with a powerful UI, built with the same search technology as Gmail, or via API access. Data Catalog is fully managed, so you can start and scale effortlessly. Enforce data security policies and maintain compliance through Cloud IAM and Cloud DLP integrations.
    Starting Price: $100 per GiB per month
  • 13
    IBM Watson Knowledge Catalog
    Activate business-ready data for AI and analytics with intelligent cataloging, backed by active metadata and policy management. IBM Watson® Knowledge Catalog is a data catalog tool that powers intelligent, self-service discovery of data, models and more. The cloud-based enterprise metadata repository activates information for AI, machine learning (ML) and deep learning. Access, curate, categorize and share data, knowledge assets and their relationships, wherever they reside. Organize, define and manage enterprise data to provide the right context and drive value across needs like regulatory compliance and data monetization. Protect data, manage compliance and audit-readiness, and maintain client trust with active policy management and dynamic masking of sensitive data. Consume and transform data at the speed of business with intuitive dashboards and flows that can be shared with peers or analytics tools.
    Starting Price: $300 per instance
  • 14
    SAP Data Intelligence
    Turn data chaos into data value with data intelligence. Connect, discover, enrich, and orchestrate disjointed data assets into actionable business insights at enterprise scale. SAP Data Intelligence is a comprehensive data management solution. As the data orchestration layer of SAP’s Business Technology Platform, it transforms distributed data sprawls into vital data insights, delivering innovation at scale. Provide your users with intelligent, relevant, and contextual insights with integration across the IT landscape. Integrate and orchestrate massive data volumes and streams at scale. Streamline, operationalize, and govern innovation driven by machine learning. Optimize governance and minimize compliance risk with comprehensive metadata management rules. Connect, discover, enrich, and orchestrate disjointed data assets into actionable business insights at enterprise scale.
    Starting Price: $1.22 per month
  • 15
    iomete

    iomete

    iomete

    Modern lakehouse built on top of Apache Iceberg and Apache Spark. Includes: Serverless lakehouse, Serverless Spark Jobs, SQL editor, Advanced data catalog and built-in BI (or connect 3rd party BI e.g. Tableau, Looker). iomete has an extreme value proposition with compute prices is equal to AWS on-demand pricing. No mark-ups. AWS users get our platform basically for free.
    Starting Price: Free
  • 16
    Decube

    Decube

    Decube

    Decube is a data management platform that helps organizations manage their data observability, data catalog, and data governance needs. It provides end-to-end visibility into data and ensures its accuracy, consistency, and trustworthiness. Decube's platform includes data observability, a data catalog, and data governance components that work together to provide a comprehensive solution. The data observability tools enable real-time monitoring and detection of data incidents, while the data catalog provides a centralized repository for data assets, making it easier to manage and govern data usage and access. The data governance tools provide robust access controls, audit reports, and data lineage tracking to demonstrate compliance with regulatory requirements. Decube's platform is customizable and scalable, making it easy for organizations to tailor it to meet their specific data management needs and manage data across different systems, data sources, and departments.
  • 17
    Secoda

    Secoda

    Secoda

    With Secoda AI on top of your metadata, you can now get contextual search results from across your tables, columns, dashboards, metrics, and queries. Secoda AI can also help you generate documentation and queries from your metadata, saving your team hundreds of hours of mundane work and redundant data requests. Easily search across all columns, tables, dashboards, events, and metrics. AI-powered search lets you ask any question to your data and get a contextual answer, fast. Get answers to questions. Integrate data discovery into your workflow without disrupting it with our API. Perform bulk updates, tag PII data, manage tech debt, build custom integrations, identify the least used resources, and more. Eliminate manual error and have total trust in your knowledge repository.
    Starting Price: $50 per user per month
  • 18
    Datafi

    Datafi

    Datafi

    Datafi provides a unified data platform for business teams. It integrates data siloes, it unifies data security and it enables self-service data workflows for the unique requirements of business users to easily find, use, and share the business information they need. Customers deploy Datafi to expand their organization’s data capabilities and empower more people to make fast and better data-driven decisions. With Datafi, data anywhere is easily accessible and meaningful for everyone. Know for sure how your data is accessed and how your data is used. Data-forward organizations know the value of enabling their data to drive new business outcomes, this starts with enabling data access in a simple and secure way. Novel uses of business data can drive new business outcomes and organizations that increase their data literacy are more likely to discover the data-driven insights that create new outcomes to better serve their customers.
    Starting Price: $0.005 per query
  • 19
    s.360

    s.360

    Samplemed

    s360 is the only life underwriting platform you’ll ever need. A complete underwriting workbench connected to Automated underwriting, predictive models, tele and video interviews, accelerated underwriting, and API-integrated paramedical exams report collection – have full control over your case pipeline and operate elegantly and autonomously. Get deeper underwriting insights because it was designed with a data-focused philosophy. It transforms your medical unstructured data into structured insights. Rich in a variety of risk analysis channels - predictive models, interviews, automated underwriting, accelerated UDW, lab exams, and underwriting manuals, among other incredible features.
    Starting Price: $250,000 per year
  • 20
    Google Cloud Dataplex
    Google Cloud's Dataplex is an intelligent data fabric that enables organizations to centrally discover, manage, monitor, and govern data across data lakes, data warehouses, and data marts with consistent controls, providing access to trusted data and powering analytics and AI at scale. Dataplex offers a unified interface for data management, allowing users to automate data discovery, classification, and metadata enrichment of structured, semi-structured, and unstructured data stored in Google Cloud and beyond. It facilitates the logical organization of data into business-specific domains using lakes and data zones, simplifying data curation, tiering, and archiving. Centralized security and governance features enable policy management, monitoring, and auditing across data silos, supporting distributed data ownership with global oversight. Additionally, Dataplex provides built-in data quality and lineage capabilities, automating data quality assessments and capturing data lineage.
    Starting Price: $0.060 per hour
  • 21
    Catalog

    Catalog

    Coalesce

    Catalog from Coalesce (formerly CastorDoc) is a data catalog designed for mass adoption across the whole company. Have an overview of all your data environment. Search for data instantly thanks to our powerful search engine. Onboard to a new data infrastructure and access data in a breeze. Go beyond your traditional data catalog. Modern data teams now have numerous data sources, build one truth. With its delightful and automated documentation experience, Catalog makes it dead simple to trust data. Column-level, cross-system data lineage in minutes. Get a bird’s eye view of your data pipelines to build trust in your data. Troubleshoot data issues, perform impact analyses, comply with GDPR in one tool. Optimize performance, cost, compliance, and security for your data. Keep your data stack healthy with our automated infrastructure monitoring system.
    Starting Price: $699 per month
  • 22
    AWS Glue

    AWS Glue

    Amazon

    AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development. AWS Glue provides all the capabilities needed for data integration so that you can start analyzing your data and putting it to use in minutes instead of months. Data integration is the process of preparing and combining data for analytics, machine learning, and application development. It involves multiple tasks, such as discovering and extracting data from various sources; enriching, cleaning, normalizing, and combining data; and loading and organizing data in databases, data warehouses, and data lakes. These tasks are often handled by different types of users that each use different products. AWS Glue runs in a serverless environment. There is no infrastructure to manage, and AWS Glue provisions, configures, and scales the resources required to run your data integration jobs.
  • 23
    Tree Schema Data Catalog
    The essential tool for metadata management. Automatically populate your entire catalog in under 5 minutes! Data Discovery. Find the data you need anywhere within your data ecosystem from the database all the way down to the specific values for each field. Automatically document your data from existing data stores. First-class support for tabular and unstructured data. Automated data governance actions. Data Lineage. Explore your data lineage and understand where your data comes from and where it is going. View impact analysis of changes Find all up and downstream impacts. Visualize relationships and connections. API AccessNew. Manage your data lineage as code and keep your catalog up to date with the Tree Schema API. Integrate Data Lineage into CICD pipelines Capture values & descriptions within your code Analyze impact for breaking changes. Data Dictionary. Know the key terms and lingo that drive your business. Define the context and scope for keywords
    Starting Price: $99 per month
  • 24
    Y42

    Y42

    Datos-Intelligence GmbH

    Y42 is the first fully managed Modern DataOps Cloud. It is purpose-built to help companies easily design production-ready data pipelines on top of their Google BigQuery or Snowflake cloud data warehouse. Y42 provides native integration of best-of-breed open-source data tools, comprehensive data governance, and better collaboration for data teams. With Y42, organizations enjoy increased accessibility to data and can make data-driven decisions quickly and efficiently.
  • 25
    Informatica Enterprise Data Catalog
    Scan and index metadata, discover and profile data, and provide detailed lineage across tens of millions of data sets. Classify and organize data assets across any environment to maximize data value and reuse. Automatically scan across multi-cloud platforms, BI tools, ETL, and third-party metadata catalogs; and data types. Leverage AI-powered domain discovery, data similarity, business term associations, and recommendations. Track data movement, from high-level system views to granular column-level lineage, and get detailed impact analysis. Use the Data Asset Analytics dashboard to understand asset usage, enrichment, and collaboration. View data quality rules, scorecards, metric groups, and profiling stats in context. Tap into shared data knowledge with certifications, ratings and reviews, a Q&A platform, and change notifications. Our broad and deep lineup of enterprise-grade data management solutions sets Informatica apart from the crowd.
  • 26
    erwin Data Catalog

    erwin Data Catalog

    Quest Software

    erwin Data Catalog by Quest is metadata management software that helps organizations learn what data they have and where it’s located, including data at rest and in motion. It tells you the data and metadata available for a certain topic so those particular sources and assets can be found quickly for analysis and decision-making. erwin Data Catalog automates the processes involved in harvesting, integrating, activating and governing enterprise data according to business requirements. This automation results in greater accuracy and faster time to value for data governance and digital transformation efforts, including data warehouse, data lake, data vault and other Big Data deployments, cloud migrations, etc. Metadata management is key to sustainable data governance and any other organizational effort for which data is key to the outcome. erwin Data Catalog automates enterprise metadata management, data mapping, data cataloging, code generation, data profiling and data lineage.
  • 27
    Oracle Cloud Infrastructure Data Catalog
    Oracle Cloud Infrastructure (OCI) Data Catalog is a metadata management service that helps data professionals discover data and support data governance. Designed specifically to work well with the Oracle ecosystem, it provides an inventory of assets, a business glossary, and a common metastore for data lakes. OCI Data Catalog is fully managed by Oracle and runs with all the power and scale of Oracle Cloud Infrastructure. Benefit from all of the security, reliability, performance, and scale of Oracle Cloud while using OCI Data Catalog. Using REST APIs and SDKs, developers can integrate OCI Data Catalog’s capabilities in their custom applications. Using a trusted system for managing user identities and access privileges, administrators can control access to data catalog objects and capabilities to manage security requirements. Discover data assets across Oracle data stores on-premises and in the cloud to start gaining real value from data.
  • 28
    ThinkData Works

    ThinkData Works

    ThinkData Works

    Data is the backbone of effective decision-making. However, employees spend more time managing it than using it. ThinkData Works provides a robust catalog platform for discovering, managing, and sharing data from both internal and external sources. Enrichment solutions combine partner data with your existing datasets to produce uniquely valuable assets that can be shared across your entire organization. Unlock the value of your data investment by making data teams more efficient, improving project outcomes, replacing multiple existing tech solutions, and providing you with a competitive advantage.
  • 29
    MetaCenter

    MetaCenter

    Data Advantage Group

    MetaCenter enables business and technology teams to catalog and classify an organization's information assets. Users can self-service questions about their data assets and how data flows through the business and classify how it should be used. This enables organizations to lower costs while improving agility and reducing operational risks. Search-based semantic layer automates cross-referencing models. Faceted Views of specific data assets can be published to individual roles. Lower cost of ownership and higher levels of automation deliver superior ROI compared to competing solutions. Simple GUI driven customization enables rapid application customization. No programming or professional services are required.
  • 30
    Blindata

    Blindata

    Blindata

    Blindata covers all the functions of a Data Governance program: Business Glossary, Data Catalog & Data Lineage build an integrated and complete view on your Data. Data Classification module gives a semantic meaning to the data while the Data Quality, Issue Management & Data Stewardship modules improve the reliability and trust on data. Moreover, privacy compliance can leverage specific features: registry of processing activities, centralized privacy note management, consent registry with Blockchain integrated notarization. Blindata Agent can connect to different data sources, collecting metadata such data structures (Tables, Views, Fields, …), data quality metrics, reverse lineage, etc. Blindata has a modular and entirely API based architecture allowing systematic integration with the most critical business systems (DBMS, Active Directory, e-commerce, Data Platforms). Blindata is available as SaaS, can be installed “on Premise” or purchased on AWS Marketplace.
    Starting Price: $2000/year/user
  • Previous
  • You're on page 1
  • 2
  • Next