Compare the Top Data Discovery Software for Linux as of June 2025

What is Data Discovery Software for Linux?

Data discovery software is a type of software tool that allows users to quickly identify patterns, trends, and relationships in large datasets. It utilizes tools such as natural language processing and machine learning to quickly analyze data and uncover insights. Data discovery software can be used in a variety of areas such as healthcare, business intelligence, fraud detection, risk management, and more. Its purpose is to give its users quick access to the most relevant data so they can make informed decisions. Compare and read user reviews of the best Data Discovery software for Linux currently available using the table below. This list is updated regularly.

  • 1
    Semarchy xDM
    Use Semarchy unified data platform to experience xDM. Discover, govern, enrich, enlighten and manage data. You can easily transform data into insights with xDM and rapidly deliver data-rich applications with automated master data management. Its business-centric interfaces provide for rapid creation and adoption of data-rich applications, while automation rapidly generates applications to your specific requirements. Use the agile platform to quickly expand or evolve data applications.
    Leader badge
    View Software
    Visit Website
  • 2
    DashboardFox
    Dashboards, codeless reporting, interactive data visualizations, data level security, mobile access, scheduled reports, embedding, sharing via link, and more. DashboardFox is a dashboard and data visualization solution designed for business users with a no-subscription pricing model. Pay once and you own the software for life. DashboardFox is self-hosted, install on your own server, behind your firewall. Looking for Cloud BI? We offer managed hosting services, but you still retain ownership of your DashboardFox licenses and data. DashboardFox allows your users to drill-down and interact with live data visualizations via dashboards and reports. Business users can create new visualization in a codeless report builder without needing a technical pedigree. An alternative to Tableau, Sisense, Looker, Domo, Qlik, Crystal Reports, and others.
    Starting Price: $495 one-time payment
    View Software
    Visit Website
  • 3
    Omniscope Evo
    Visokio builds Omniscope Evo, complete and extensible BI software for data processing, analytics and reporting. A smart experience on any device. Start from any data in any shape, load, edit, blend, transform while visually exploring it, extract insights through ML algorithms, automate your data workflows, and publish interactive reports and dashboards to share your findings. Omniscope is not only an all-in-one BI tool with a responsive UX on all modern devices, but also a powerful and extensible platform: you can augment data workflows with Python / R scripts and enhance reports with any JS visualisation. Whether you’re a data manager, scientist or analyst, Omniscope is your complete solution: from data, through analytics to visualisation.
    Starting Price: $59/month/user
  • 4
    Card Recon

    Card Recon

    Ground Labs

    Card Recon by Ground Labs is the cardholder data discovery tool of choice for more than 300 PCI Qualified Security Assessors (QSAs) and PCI Forensic Investigators (PFI). Accurate and powerful, Card Recon is trusted by over 4,500 merchants across 80 countries as their preferred credit card data discovery tool. Ground Labs has two industry-leading credit card scanning solutions that can fit the needs of your small to medium business: Card Recon Server and Card Recon Desktop. Card Recon searches files, memory and even deleted locations on workstations and file servers (Card Recon Server only) while inspecting hundreds of file types to accurately detect credit card numbers issued by the 10 major payment card providers. Custom-built to meet PCI compliance, Card Recon’s out-of-the-box cardholder data detection capabilities scan for credit card numbers from the 10 major card brands and can identify 160+ combinations of primary account number (PAN) structures used around the world.
  • 5
    OvalEdge

    OvalEdge

    OvalEdge

    OvalEdge is a cost-effective data catalog designed for end-to-end data governance, privacy compliance, and fast, trustworthy analytics. OvalEdge crawls your organizations’ databases, BI platforms, ETL tools, and data lakes to create an easy-to-access, smart inventory of your data assets. Using OvalEdge, analysts can discover data and deliver powerful insights quickly. OvalEdge’s comprehensive functionality enables users to establish and improve data access, data literacy, and data quality.
    Starting Price: $1,300/month
  • 6
    NaturalText

    NaturalText

    NaturalText

    NaturalText A.I. helps you get more out of your data. Discover relationships, create collections, and unveil hidden insights in documents and other text-based data. NaturalText A.I. uses novel artificial intelligence technology to uncover hidden relationships in data. The software uses various state-of-the-art methods to understand context, analyze patterns, and reveal insights—all in a human-readable way. Reveal insights hidden in your data. Finding everything hidden in your text data is a difficult, if not impossible, task. With traditional search, you can only locate information related to a document. NaturalText A.I., on the other hand, uncovers new information within millions of documents, including scientific papers and patents. Use NaturalText A.I. to reveal insights in the data you are currently missing.
    Starting Price: $5000.00
  • 7
    IRI DarkShield

    IRI DarkShield

    IRI, The CoSort Company

    IRI DarkShield is a powerful data masking tool that can (simultaneously) find and anonymize Personally Identifiable Information (PII) "hidden" in semi-structured and unstructured files and database columns / collections. DarkShield jobs are configured, logged, and run from IRI Workbench or a restful RPC (web services) API to encrypt, redact, blur, etc., the PII it finds in: * NoSQL & RDBs * PDFs * Parquet * JSON, XML & CSV * Excel & Word * BMP, DICOM, GIF, JPG & TIFF DarkShield is one of 3 data masking products in the IRI Data Protector Suite, and comes with IRI Voracity data management platform subscriptions. DarkShield bridges the gap between structured and unstructured data masking, allowing users to secure data in a consistent manner across disparate silos and formats by using the same masking functions as FieldShield and CellShield EE. DarkShield also handles data in RDBs and flat-files, too, but there are more capabilities that FieldShield offers for those sources.
    Starting Price: $5000
  • 8
    Bold BI

    Bold BI

    Syncfusion

    Turn your customers into success stories with built-in intelligence featuring state of the art analytics, dashboards and more. Bold BI is a platform designed from the ground up to be embedded, there are no compromises, no shortcuts. Outdated requirements such as “bring all your data into our data store or cube” today add unnecessary complexity to your application. With some solutions, you must design a complex system for data integration and keep the embedded BI data store in sync with your data sources. With Bold BI, you connect directly to hundreds of data sources and any data warehouse of your choice. Deploy with single-tenant mode if you have a shared data model. If you have an isolated data model, use multi-tenant mode to maintain a single source for dashboards and publish across different tenants. Bold BI is designed to accommodate scenarios common and uncommon. Embed Bold BI directly from your application code using powerful JavaScript APIs.
    Starting Price: $495
  • 9
    SCIKIQ

    SCIKIQ

    DAAS Labs

    An AI-powered data management platform that enables true data democratization. Integrates & centralizes all data sources, facilitates collaboration, and empowers organizations for innovation, driven by Insights. SCIKIQ is a holistic business data platform that simplifies data complexities from business users through a no-code, drag-and-drop user interface which allows businesses to focus on driving value from data, thereby enabling them to grow, and make faster and smarter decisions with confidence. Use box integration, connect any data source, and ingest any structured and unstructured data. Build for business users, ease of use, a simple no-code platform, and use drag and drop to manage your data. Self-learning platform. Cloud agnostic, environment agnostic. Build on top of any data environment. SCIKIQ architecture is designed specifically to address the challenges facing the complex hybrid data landscape.
    Starting Price: $10,000 per year
  • 10
    Data Rover

    Data Rover

    Data Rover

    Data Rover is an Advanced User Data and Security Management for any Data-Driven Organisation. A single solution for Infrastructure and Security managers that allows data users to explore, manage, process, and protect their data effectively and efficiently, by simultaneously addressing the two primary needs related to the use of data: Cyber Security and Data Management. Data Rover plays a key role in business asset protection and corporate data management policy definition. Data Analytics Check for security flaws and eliminate issues. Simplify the management of permissions. File Auditor It gives you the proof that something was done. Right or Wrong it's not important - JUST the FACTS. Dark Data Makes work faster and safer by optimising the storage resources usage and reducing costs. Involve the users in data management so they can contribute in keeping the storage systems clean and efficient. Advanced Data Exchange Share business data in/out of the company SAFELY.
  • 11
    Knovos Discovery
    A Holistic eDiscovery Solution Knovos Discovery is eDiscovery made easy. Knovos’ end-to-end solution covers the entire litigation life cycle — from processing, early case assessment, review to production. Our powerful built-in analytics engine enables deep, multidimensional analysis for a 360-degree view of your data.
  • 12
    MinerEye DataTracker
    MinerEye’s DataTracker enables organizations to overcome the information governance and protection challenge. It automatically scans, indexes, analyzes, virtually labels and categorizes every piece of unstructured and dark data contained in the organization’s data repositories. With proprietary Interpretive AI™, machine learning, and computer vision, the solution locates relevant files out of the billions that are stored, accurately evaluates them, qualifies them by significance and purpose, and automatically sends alerts with next best action recommendations in cases of conflicts, duplications, or potential violations. This way, data protection is profoundly enhanced while risk and operational costs are reduced.
    Starting Price: $2000/1TB/month
  • 13
    Open Content Platform

    Open Content Platform

    Open Content Platform

    Open Content Platform (OCP) is an open-source IT discovery and integration platform. OCP handles discovery, integration, and modeling for solutions in the datacenter. The core framework is micro-service, enabling process-level horizontal scalability. It’s fully extensible, designed for developers, and leverages components in the open-source space, to greatly reduce product complexity. The dynamic discovery is fundamentally different from other ITSM vendors, and the data-driven modeling keeps Applications and Services up-to-date for downstream consumption (e.g. Service-aware monitoring).
    Starting Price: $0
  • 14
    Enov8

    Enov8

    Enov8

    End-to-end “Business Intelligence” for your IT organization. Promoting transparency, control, and productivity across environments, release and data. Promote scaled agility across your IT fabric. A complete environment and release picture supporting collaboration across teams and providing the insight that organizations require today to drive competitive innovation. Improve visibility of your complex IT fabric allowing better collaboration and decision making. Manage complex computer systems & the end-to-end IT fabric through a centralized portal. Measure test environment usage to reduce IT spend and increase project productivity. Eliminate chaotic and non-repeatable operations by establishing control via centralized runbooks and using automation on regular & time consuming tasks. Manage change and contention effectively whilst providing real time health status and powerful analytics to determine business impact.
    Starting Price: $8 per month
  • 15
    Protegrity

    Protegrity

    Protegrity

    Our platform allows businesses to use data—including its application in advanced analytics, machine learning, and AI—to do great things without worrying about putting customers, employees, or intellectual property at risk. The Protegrity Data Protection Platform doesn't just secure data—it simultaneously classifies and discovers data while protecting it. You can't protect what you don't know you have. Our platform first classifies data, allowing users to categorize the type of data that can mostly be in the public domain. With those classifications established, the platform then leverages machine learning algorithms to discover that type of data. Classification and discovery finds the data that needs to be protected. Whether encrypting, tokenizing, or applying privacy methods, the platform secures the data behind the many operational systems that drive the day-to-day functions of business, as well as the analytical systems behind decision-making.
  • 16
    Cyberquery

    Cyberquery

    Cyberscience Corporation

    Cyberscience is an international software organization which offers a Business Intelligence software suite named Cyberquery. Cyberquery is offered in both SaaS and traditional licensing models. Some of Cyberquery’s most valued features include intuitive UI, analytics with drills, data visualization, dashboards, XLS integration and automated content distribution. Unlike most vendors in the BI space, Cyberscience differentiates itself by offering live phone support in addition to email, with a support team averaging 15 years industry experience. The Cyberscience support team provides same day responses to issues, and they score very highly on customer satisfaction surveys.
  • 17
    BigID

    BigID

    BigID

    BigID is data visibility and control for all types of data, everywhere. Reimagine data management for privacy, security, and governance across your entire data landscape. With BigID, you can automatically discover and manage personal and sensitive data – and take action for privacy, protection, and perspective. BigID uses advanced machine learning and data intelligence to help enterprises better manage and protect their customer & sensitive data, meet data privacy and protection regulations, and leverage unmatched coverage for all data across all data stores. 2
  • 18
    AutoDiscovery

    AutoDiscovery

    Butler Scientifics

    AutoDiscovery is an intelligent automated exploratory data analysis software that helps biomed researchers unveiling complex relationships hidden in the data files of scientific experiments and clinical trials. AutoDiscovery automatically evaluates the proper statistical tests to assess the relationships between every combination of variables at every individual subset of your data. Cause-effect potential, false discovery rates, small-complex data, groups and treatments and traceability of results are common biomed research needs specifically covered by AutoDiscovery. AutoDiscovery is targeted to Principal Investigators with very little time for data analysis and limited statistical knowledge focused on productive, high impact research.
    Starting Price: €1.795 per year
  • 19
    Fasoo Data Radar
    Fasoo Data Radar (FDR) is a data discovery and classification solution that helps organizations locate, analyze, and manage sensitive unstructured data across on-premise servers, cloud storage, and endpoints. It scans and classifies files based on keywords, regex patterns, file formats, and other predefined policies, ensuring organizations maintain control over critical information. With real-time monitoring and centralized policy enforcement, FDR enhances data security by identifying risks, preventing unauthorized access, and assisting with compliance requirements such as GDPR, HIPAA, and CCPA. Its integration with enterprise security frameworks allows organizations to apply consistent data protection policies while improving operation workflows. By automating data classification and governance, FDR increases efficiency and enhances data visibility for security and compliance management.
  • 20
    IRI Voracity

    IRI Voracity

    IRI, The CoSort Company

    Voracity is the only high-performance, all-in-one data management platform accelerating AND consolidating the key activities of data discovery, integration, migration, governance, and analytics. Voracity helps you control your data in every stage of the lifecycle, and extract maximum value from it. Only in Voracity can you: 1) CLASSIFY, profile and diagram enterprise data sources 2) Speed or LEAVE legacy sort and ETL tools 3) MIGRATE data to modernize and WRANGLE data to analyze 4) FIND PII everywhere and consistently MASK it for referential integrity 5) Score re-ID risk and ANONYMIZE quasi-identifiers 6) Create and manage DB subsets or intelligently synthesize TEST data 7) Package, protect and provision BIG data 8) Validate, scrub, enrich and unify data to improve its QUALITY 9) Manage metadata and MASTER data. Use Voracity to comply with data privacy laws, de-muck and govern the data lake, improve the reliability of your analytics, and create safe, smart test data
  • 21
    DragonGlass

    DragonGlass

    OpenCrowd

    DragonGlass is a highly scalable Platform as a Service infrastructure that stores and processes (unmarshalling, indexing, signature verification, etc) data collected from the Hedera Hashgraph platform at a TPS similar to the Mainnet. For end users the platform supports a Google-like search to access accounts, transaction, smart contracts files, … almost everything produced on the Mainnet. For developers, the platform supports a wide range of REST based API’s to access all data. DragonGlass also contains a dApp Marketplace where dApp developers can list their product for users to view all details related to the dApp, including transaction history, balance history, and payment receipts.
  • 22
    SalesIntel

    SalesIntel

    SalesIntel

    Get the Best B2B Data Available with SalesIntel. SalesIntel is the top provider of accurate and affordable sales and marketing contact data with highest number of direct dials and human-verified contacts in the industry. SalesIntel is your resource for contact and business intelligence. Our combination of automation and researchers allows us to reach 95% data accuracy for all our published contact data while continuing to scale up our number of contacts. SalesIntel’s new method is to have every direct dial number, and email tested and re-verified every 90 days. With the most comprehensive contact & company data and our excellent customer service, SalesIntel is your partner in growth.
  • 23
    TiMi

    TiMi

    TIMi

    With TIMi, companies can capitalize on their corporate data to develop new ideas and make critical business decisions faster and easier than ever before. The heart of TIMi’s Integrated Platform. TIMi’s ultimate real-time AUTO-ML engine. 3D VR segmentation and visualization. Unlimited self service business Intelligence. TIMi is several orders of magnitude faster than any other solution to do the 2 most important analytical tasks: the handling of datasets (data cleaning, feature engineering, creation of KPIs) and predictive modeling. TIMi is an “ethical solution”: no “lock-in” situation, just excellence. We guarantee you a work in all serenity and without unexpected extra costs. Thanks to an original & unique software infrastructure, TIMi is optimized to offer you the greatest flexibility for the exploration phase and the highest reliability during the production phase. TIMi is the ultimate “playground” that allows your analysts to test the craziest ideas!
  • 24
    MINDely
    MIND is the first-ever data security platform that puts data loss prevention (DLP) and insider risk management (IRM) programs on autopilot, so you can automatically identify, detect, and prevent data leaks at machine speed. Continuously find your sensitive data in files spread across your IT environments whether at rest, in motion, or in use. MIND continuously exposes blindspots of sensitive data across your IT environments including SaaS, AI apps, endpoints, on-premise file shares, and emails. MIND monitors and analyzes billions of data security events in real time, enriches each incident with context, and remediates autonomously. MIND automatically blocks sensitive data in real-time from escaping your control, or collaborates with users to remediate risks and educate on your policies. MIND continuously exposes blindspots of sensitive data at rest, in motion, and in use by integrating with data sources across your IT workloads, e.g. SaaS, AI apps, on-premises, endpoints, and emails.
  • 25
    Nyxeia Information Governance Suite
    The Information Governance Suite is a set of products aimed at helping organizations to better discover, categorize, enhance, and govern their information assets regardless of the systems in which they are managed. Products in the suite include: - .discover, which connects to information systems to index and categorize unstructured and structured information assets - .policy, which allows organizations to create full lifecycle policies for information retention and disposal - .preserve, for digital asset preservation near the end of the asset lifecycle - .process, for automating content related actions like content categorization to help records teams deal with escalating workload The solution helps identify sensitive information that may reduce compliance with regulations like GDPR, as well as information that may be redundant, trivial, or obsolete.
  • 26
    Enterprise Recon

    Enterprise Recon

    Ground Labs

    With Enterprise Recon by Ground Labs, organizations can find and remediate sensitive information across the broadest range of structured and unstructured data, whether it’s stored on your servers, on your employees’ devices, or in the cloud. Enterprise Recon enables organizations worldwide to seamlessly discover all of their data and comply with GDPR, PCI DSS, CCPA, HIPAA, Australian Privacy and other data security standards that require the ability to locate and secure PII data as well as information on gender, ethnicity and health… or even non-PII financial data. Enterprise Recon is powered by GLASS™, Ground Labs' proprietary technology that enables the quickest and most accurate data discovery across the broadest set of platforms available. Enterprise Recon natively supports sensitive data discovery on Windows, macOS, Linux, FreeBSD, Solaris, HP-UX and IBM AIX using agent and agentless options. Additional remote options also enable almost any network data stored.
  • Previous
  • You're on page 1
  • Next