The Apache Software Foundation created Apache Hadoop, an open-source, Java-based framework for storing and analyzing large amounts of data.
MongoDB is an open-source NoSQL database that stores and processes large amounts of data without tables, ensuring high performance and scalability.
RainStor manages massive data and uses de-duplication to simplify storage by eliminating duplicate files, improving data organization.
Cassandra is a NoSQL database for real-time data analysis. It offers high scalability and performance with CQL interface.
Presto by Facebook: open-source SQL query engine for interactive analytics on massive data, without the need to move it to a separate system
RapidMiner is an open-source predictive analytics data mining app for data scientists and analysts. It supports model deployment and operation.
Elasticsearch is an open-source distributed analytics search engine for indexing, searching, and analyzing all types of data.
Apache Kafka: popular open-source event store and streaming tech for high-performance data pipelines, integration and analytics.
software that analyzes machine-generated data to offer metrics, diagnose problems, and gain insight into corporate processes.
KNIME is a free open-source platform for data analytics, integration, and reporting, designed to simplify data science processes.