Samudra Manthan uses C and MPI for finding interesting n-grams(terms) in a large corpus of data. We use the GigaWord corpus to find top m interesting n-grams using TF*IDF measure.

Project Activity

See All Activity >

License

GNU General Public License version 2.0 (GPLv2)

Follow Samudra-Manthan

Samudra-Manthan Web Site

Other Useful Business Software
Keep company data safe with Chrome Enterprise Icon
Keep company data safe with Chrome Enterprise

Protect your business with AI policies and data loss prevention in the browser

Make AI work your way with Chrome Enterprise. Block unapproved sites and set custom data controls that align with your company's policies.
Download Chrome
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Samudra-Manthan!

Additional Project Details

Operating Systems

Linux

Intended Audience

Science/Research

Programming Language

C

Related Categories

C Distributed Computing Software

Registered

2008-10-24