FAQ

❗️
This is a legacy Apache Ignite documentation
The new documentation is hosted here: https://ignite.apache.org/docs/latest/

What is the difference between on-heap and off-heap memory storage?
Off-Heap memory allows your cache to overcome lengthy JVM Garbage Collection (GC) pauses when working with large heap sizes by caching data outside of main Java Heap space, but still in RAM.
Read more
Is Apache Ignite a key value / store?
Apache Ignite is a resilient in-memory distributed object store with compute capabilities. In its simplest form, yes, Apache Ignite can be used as a key / value store (cache) but also exposes further rich APIs to interact with the data such as fully ANSI 99 compliant SQL querying, text searching, transactions, etc.
Read more
Does Apache Ignite support JSON documents?
Currently Apache Ignite does not fully support JSON documents but the Node.JS client which is currently in beta will support JSON documents.
Can we use Apache Ignite with Apache Hive?
Yes, Apache Ignite Hadoop Accelerator provides a set of components allowing for in-memory Hadoop job execution and file system operations for any Hadoop distribution including Apache Hive.
Running Apache Hive over Ignited Hadoop
InPESSIMISTIC Mode with transaction Isolation, do you lock keys for reading and writing?
Yes, the main difference is that in PESSIMISTIC mode locks are acquired at the time of access, while in OPTIMISTIC mode locks are acquired during the commit phase.
Read more
Can I use Hibernate to access Apache Ignite?
Yes, Apache Ignite In-Memory Data Fabric can be used as Hibernate Second-Level cache (or L2 cache), which can significantly speed-up the persistence layer of your application.
Read more
Does Apache Ignite support JDBC?
Yes, Apache Ignite is shipped with a JDBC driver that allows you to retrieve distributed data from cache using standard SQL queries and JDBC API.
Read more
Does Apache Ignite guarantee ordering of messages?
Yes, sendOrdered(...) method can be used if you want to receive messages in the order they were sent. A timeout parameter is passed to specify how long a message will stay in the queue to wait for messages that are supposed to be sent before this message. If the timeout expires, then all the messages that have not yet arrived for a given topic on that node will be ignored.
Read more
Can I run Java and .NET closures? How does that work?
.NET nodes can execute both Java and .NET closures whereas standard Java nodes can execute Java closures only. When you start ApacheIgnite.exe this will start CLR and JVM together under the same process using a script located under IGNITE_HOME/platforms/dotnet/bin and .NET closures are handed to the CLR for execution.
What is the cost of conversion between Java and .NET?
The only minimal possible overhead is an additional array copy + JNI call. This overhead might degrade performance in local benchmarks, but is negligible in real distributed loads.
How do closures get shipped around?
Every closure is an object of a particular class. When the closure is being sent it gets serialized to a binary form, send over the wire to a remote node and deserialized there. The remote node should have the closure's class in its classpath or enable peerClassLoading in order to load the class from the sender side.
Are SQL queries load balanced?
SQL queries are always broadcasted to every node that keeps data for caches used in a query. The exception is local SQL queries (query.setLocal(true)) that are executed on a local node only and some of the queries that allow identifying a node precisely.
Can I control resource allocation by user? I.e. can I restrict User A to 50 nodes, but User B can run tasks on all 100?
Multi-tenancy exists only for caches. They can be created on a subset of nodes (see CacheConfiguration.setNodeFilter) and security allows you to give permissions on per-cache basis.
What is the future of IGFS?
IGFS was developed with the thinking that it would be a solution for Hadoop acceleration. However, in practice IGFS provides inconsistent performance benefits, and any increases it does provide are insignificant for production deployments. Plus, it requires notable integration efforts. To get orders-of-magnitude performance gains, your RAM-based storage has to be tightly coupled with the APIs used by the applications. With IGFS the storage is Ignite while the APIs were developed separately by Hive, Impala, Pig, MapReduce, etc.
That's why for Hadoop offloading use cases and real-time analytics, it's best to deploy Ignite in one of its standard configurations: Ignite with native persistence enabled. And then use Ignite SQL, compute grid, or ML for the data located in Ignite, and use Hadoop frameworks for HDFS data sets. Consider Spark as a generic API that can be used to merge data stored in both databases.