Experience the power of large language models like never before, unleashing the full potential of Natural Language Processing (NLP) with Spark NLP, the open source library that delivers scalable LLMs. The full code base is open under the Apache 2.0 license, including pre-trained models and pipelines. The only NLP library built natively on Apache Spark. The most widely used NLP library in the enterprise. Spark ML provides a set of machine learning applications that can be built using two main components, estimators and transformers. The estimators have a method that secures and trains a piece of data to such an application. The transformer is generally the result of a fitting process and applies changes to the target dataset. These components have been embedded to be applicable to Spark NLP. Pipelines are a mechanism for combining multiple estimators and transformers in a single workflow. They allow multiple chained transformations along a machine-learning task.

Features

  • Text Preprocessing
  • Parsing and Analysis
  • Sentiment and Classification
  • Classification and Question Answering
  • Machine Translation and Generation
  • Integration and Interoperability (ONNX, OpenVINO)
  • Pre-trained Models (36000+ in +200 languages)
  • Multi-lingual Support

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow Spark NLP

Spark NLP Web Site

Other Useful Business Software
Gen AI apps are built with MongoDB Atlas Icon
Gen AI apps are built with MongoDB Atlas

The database for AI-powered applications.

MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Spark NLP!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Scala

Related Categories

Scala Natural Language Processing (NLP) Tool

Registered

2024-08-12