Table of Contents
Open source is always a hot topic among developers including software development outsourcing companies, and for good reason. Not only does it offer an alternative to costly proprietary software, but it also allows developers to peek under the hood and see how things work.
In this blog post, we’ll take a look at 10 of the best open source graph databases. Each has its own unique features and benefits, so read on to find out which one is right for you!
1. Neo4j Community Edition
Neo4j Community Edition is an open source graph database, licensed and distributed under GPL v3. It is a high performance graph db that supports Linux, Mac and Windows. This database is a perfect choice for managing data relationships. It’s a great tool for powering business applications and data-driven projects.
If for example you are working on an application project that has access to a Neo4j database running with organizational devices or even personal devices, then it is free to use the Community Edition whether or not the application is closed source.
Please note that Neo4j has a commercial version or what is commonly referred to as the Enterprise Edition. This is understandable because at the end of the day the creators need to find a way to power these resources. And so the commercial edition provides the resources to continue providing the amazing product.
The good part is that for a limited number of users, the commercial edition is available at no cost. Startups with a staff count of less than 50 can contact Neo4j and request a free Startup License. Single users can also get a free license for development work.
ArangoDB offers a comprehensive, powerful, and intuitive graph database solution that is perfect for managing complex data landscapes. It’s a multi-model database that is easy to use, and scalable, making it the perfect choice for any business or organization in need of a robust graph database solution.
It has tons of great features that make it one of the best graph db available, including:
- A powerful and intuitive query language that makes it easy to find the information you need, fast.
- The ability to handle very large graphs without running into performance issues.
- The ability to scale up easily to meet increased demand.
- Supports graph, document and search.
- Optimum flexibility as all data models & access patterns that are supported can be combined in queries.
- User-friendly interface.
- A unique storage model that makes it perfect for storing and querying data in graphs.
- Can handle large graphs with ease, making it a great choice for data-intensive applications.
- Its scalability means that it can handle even the largest data sets with ease.
- It’s easy to use, making it a great choice for developers who want to get up and running quickly.
3. Apache TinkerPop
Apache TinkerPop is an open source graph database that is released under the Apache License. It provides a set of interfaces and implementations for a graph database that can be used by application developers to create, read, update, and delete data in a graph database.
The Apache Tinkerpop™ framework can be used to store both graphs and analytical data. It’s an easy way for businesses, such as those in the financial or travel industry, who want to access their information on any device at any time without having to worry about losing some critical piece of information.
This graph uses the Gremlin query language. The Gremlin language is a functional, data-flow text that enables users to concisely express complex traversals of a property graph.
TinkerPop offers official language drivers for Java, Groovy, Python, .NET, and NodeJS, plus additional language drivers from the community.
- TinkerPop is designed to be scalable, which means it can handle large amounts of data without issues.
- TinkerPop is fast and efficient, so you get results in no time.
- TinkerPop is flexible and customizable, so tailor it to your specific needs.
4. Nebula Graph
Nebula Graph is quickly becoming one of the most popular open source graph database thanks to its impressive features and ease of use. Whether you’re looking to store complex data or simply need a fast and reliable database, Nebula Graph is the perfect solution.
One of the most outstanding aspects about Nebula Graph is that it is properly built for the cloud. This means it offers some of the best in class features.
Here are the impressive features of the Nebula Graph:
- High performance: Nebula Graph uses the powerful storage engine RocksDB to provide fast read and write performance with high throughput. It also has a state-of-the art design that delivers highly concurrent access, allowing it to process graphs containing trillions of edges or vertices without issue!
- Horizontal scalability: Nebula Graph offers scalability thanks to the shared-nothing distributed architecture that it uses. This means the performance of the cluster will not be affected whenever new nodes or services are added.
- Compatible with openCypher: The nGQL query language of Nebula Graph is gradually being incorporated into openCypher. As a result, openCypher users can easily start using Nebula Graph with no learning curve.
- Stability: Nebula Graph offers unparalleled stability, always ready to be deployed in the most demanding production ecosystems.This has been validated by the long list of large companies that are already using it in real-world environments. Some of the companies that trust the Nebula Graph include Tencent, WeBank, and Meituan.
This graph is best suited for DevOps & DBA, data architects and scientists. It integrates very well with other open source big data frameworks like Spark GraphX, and Flink among others.
Dgraph is an open source graph database that is unrivaled in terms of speed, scalability, and features. It is designed to be horizontally scalable, so it can easily handle large amounts of data.
Dgraph also has a number of unique features that make it stand out from other graph databases. For example, Dgraph supports real-time search and analytics, so you can instantly find the information you need. Additionally, Dgraph is highly customizable, so you can tailor it to your specific needs.
- Supports Native GraphQL.
- Scales horizontally and vertically without affecting uptime.
- Supports Distributed ACID Transactions (Jepsen tested).
- Two Jepsen tests have been completed successfully.
- Provides an SQL migration tool.
- All database files are encrypted at rest using keys that are supplied by the user.
There’s a huge amount of documentation and support available, and the developer community is always working on new features and improvements.
Memgraph is a powerful open source graph database that can handle massive amounts of data. It’s perfect for companies that need to manage complex data relationships, and offers a lot of flexibility and scalability.
Developers can build cool data-driven stuff with Memgraph such as recommendation engines, e-commerce platforms or fraud detection systems. It offers a PageRank algorithm which developers can use to create social network features to measure the value of nodes based on incoming relationships and how valuable the source node is.
What makes Memgraph particularly interesting is that it’s designed to work with big data sets. That means businesses and organizations of all sizes can take advantage of its capabilities, and they don’t have to worry about licensing fees or restrictions.
Some of the popular use case for Memgraph include;
- Fraud Detection
- Social Network Analysis
- Route Computation
- Identity and Access Management
- Process Optimisation
- Recommendation Engines
7. OrientDB Community Edition
The OrientDB Community Edition is the open source version of OrientDB, a very powerful open source graph database. It’s written in Java and runs on Windows, Mac, and Linux. It has an index-free search algorithm, which makes finding information a breeze. Plus, it has a replication feature that ensures your data is always safe.
OrientDB also boasts an impressive performance, making it the perfect choice for high-volume applications. And if that’s not enough, it also has an intuitive user interface that makes it easy to get started.
- Offers both ACID and BASE guarantees, so you can be sure your data is safe and reliable
- Powerful search engine that makes finding specific information quick and easy.
- Supports clustering, so you can easily scale your database as your needs grow.
- A rich set of APIs that make it easy to integrate with your existing systems.
It’s important to note that OrientDB combines the power of graphs and document scalability to deliver high performance. It’s the first Multi-Model Open Source NoSQL DBMS that has been able to achieve this.
HypergraphDB is one of the most popular graph databases because it offers a lot of features and benefits that other graph databases don’t. Its storage mechanism is based on the directed hypergraphs mechanism, a high performance knowledge management formalism. This graph is most suited for knowledge management, AI and projects that are focussed on the semantic web. It can also be used for all sizes of Java projects as an object-oriented database.
HypergraphDB has a powerful query engine that lets you run complex queries in a fraction of the time it would take with other databases. The user interface is intuitive, making it easy to manage data.
Here are the standout features that elevate HypergraphDB to this list of the very best graph databases:
- Powerful data modeling and knowledge representation.
- Customizable indexing and storage management.
- An extendable, dynamic DB schema that accommodates custom typing.
- Java OO database.
- MVCC/STM: fully transactional and multi-threaded. Concurrent writes and reads don’t create blocking problems!
- Efficient data distribution thanks to a P2P framework.
The creators of Cayley say that this open source graph was inspired by the graph databases behind Freebase as well as the knowledge graph by Google. There you have it! And indeed the inspiration must be doing magic because they have built a rich graph with lots of easy to use features. It’s available under Apache 2.0 license, free to use for any kind of projects.
These are some of the characteristics that make Cayley an outstanding graph db:
- Excellent for linked data: Supports RDF databases, including data formats that are multiple linked such as NQuads and JSON-LD.
- Query languages: Supports multiple query languages including Gizmo (Gremlin dialect), MQL and GraphQL dialect
- Integration with existing databases: Cayley works perfectly on top of existing databases regardless of the data model that is used.
- Modularity: Everything in Cayley is modular, meaning you can extend it with whatever you want including new formats, query languages, or custom models.
- Community driven: This is perhaps the most valuable feature of this graph. A community is very important in open source databases and Cayley boasts an active community that shares a passion for linked data.
RedisGraph is a new addition to the Redis family of databases that offers powerful capabilities for storing and manipulating graph data. It is different from other graph databases in that it is built on top of Redis, which gives it some significant advantages. Chief among these is the speed; RedisGraph is incredibly fast, making it an excellent choice for real-time applications. The support for traversals is up there, which is a key requirement for graph databases.
Check out these powerful feature of RedisGraph:
- RedisGraph offers quick indexing and querying.
- The data structures are memory efficient and allow customization.
- Supports the openCypher query language, one of the most popular graph query languages for graphs.
- On-disk persistence, which means that your data will be safe even if there is a power outage or other similar problems.
- Tabular result sets:
RedisGraph is licensed under the Redis License Agreement. It’s a young project but still a competent alternative to many existing databases.
Graph databases are becoming more and more popular every day. As this technology grows, the community around it is evidently growing with it. This is good news because it means that newcomers can look forward to an active and supportive community.
And so if you’re looking for a graph database that has an active and helpful community, remember these top graph databases. All these options have great communities that will be happy to help you get started.
Is Neo4j the best graph database? ›
They store nodes and relationships instead of documents or tables. Traversing through nodes, joins, and relationships is a lot faster than assessing individual values. Here is a list of 9 open-source graph databases for different use cases.What is the most popular open-source software? ›
- Shotcut. ...
- LibreOffice. ...
- Mozilla Thunderbird. ...
- Mozilla Firefox. ...
- VLC Media Player. ...
- GIMP. ...
- Linux. ...
- WordPress. WordPress is probably the most popular blogging platform in the world, used by more than 60 million people and organizations.
MySQL is the primary database used by Facebook for storing all social data. They started with the InnoDB MySQL database engine and then wrote MyRocksDB, which was eventually used as the MySQL Database engine. Memcache sits in front of MySQL as a cache.Does Facebook use graph database? ›
Every time you visit LinkedIn and see first-, second-, or third-degree connections, you're getting results from the social networking site's professional network graph built on a graph database. Facebook, Instagram and Twitter all use graph databases and analytics to understand how users relate to each other and ...What is Neo4j weakness? ›
Additionally, Neo4j has scalability weaknesses related to scaling writes, hence if your application is expected to have very large write throughputs, then Neo4j is not for you.Which big companies use Neo4j? ›
Leading telcos like Verizon, Orange, Comcast, and AT&T rely on Neo4j to manage networks, control access, and enable customer 360.Why is Neo4j so popular? ›
Neo4j (an early open-sourced graph database, and still quite popular) began gaining popularity for certain types of mathematical graph processing. At the same time, hardware (by way of cloud computing) had gained enough speed to resolve many of the early performance challenges.Is Neo4j faster than SQL? ›
How much faster is a graph database, really?
|Depth||Execution Time – MySQL||Execution Time –Neo4j|
|5||Not Finished in 1 Hour||2.132|
What is the most popular database? ›
As of February 2023, the most popular database management system (DBMS) in the world was Oracle, with a ranking score of 1247.52; MySQL and Microsoft SQL server rounded out the top three.What is the most stable database? ›
MySQL is extremely easy to deploy and manage. It supports ACID (Atomicity, Consistency, Isolation, Durability), thereby making it the most reliable. This Relational Database Management System (RDBMS) offers fast-loading utilities with different memory cache to maintain and administer Servers.What is the most popular kind of database today? ›
Today, MySQL is one of the most popular and widely used SQL databases. It is also one of the most used databases in Web Applications. Some of the world's largest Web-Scale applications (e.g., Facebook, Uber) uses MySQL.What is an example of a graph database? ›
With the Graph Database model, Digital Asset Management becomes intuitive. Graph Database Example: Netflix uses Graph Database for its Digital Asset Management because it is a perfect way to track which movies (assets) each viewer has already watched, and which movies they are allowed to watch (access management).Which is the example of open source database? ›
Open source databases in the cloud
Amazon relational database. Microsoft Azure SQL database. Oracle database. IBM Db2 on cloud.
Google Cloud has long been a leading open source contributor and partner to organizations that are focused on fully managed open source databases and community-driven innovation.What is the largest open source? ›
Open Source Projects and GitHub
The largest of platforms that provide the space for such projects is GitHub. With hundreds of thousands of projects and millions of devs belonging to all tiers of expertise, it is the most innovative community on the internet.
- Autodesk SketchBook – Best Graphics & Drawing Software.
- Spotify – Best Entertainment and Media Software.
- PhotoDirector 10 Essential – Best Photo Editor Software.
- Dropbox – Best Storage Software.
- Revo Uninstaller – Best Utilities Software.
- Mozilla Firefox.
- VLC media player.
- Apache web server.
Amazon RDS supports 6 familiar engines, including 3 open source databases: MySQL, PostgreSQL, and MariaDB.
What database is used by Netflix? ›
The biggest CockroachDB cluster at Netflix is a 60-node, single-region cluster with 26.5 terabytes of data.Which database is Google using? ›
Databases Used By Google
If you just need a quick answer, Google uses BigTable, Spanner, Google Cloud SQL, MySQL, Dremel, Millwheel, Firestore, Memorystore Firebase, Cloud Dataflow, BigQuery & many more. It has a polyglot persistence architecture. If you want to stick around for details here we go.
Netflix adopted JanusGraph + cassandra + elasticsearch as their graph database infrastructure. The use of the graph database is in their digital asset management. Entities such as Assets, Movie, Display Sets and so on are vertex, and all relations are edges.Does Google Maps use graph database? ›
We're using a graph database like DataStax Graph to store the graph for our design.What graph database does Instagram use? ›
The API is built on the Facebook Graph API.Can Neo4j handle big data? ›
Graph databases like Neo4j are ideal for modeling complex relationships--and they move through big data at lightspeed.What language does Neo4j use? ›
Cypher is Neo4j's graph query language that lets you retrieve data from the graph. It is like SQL for graphs, and was inspired by SQL so it lets you focus on what data you want out of the graph (not how to go get it).How many nodes can Neo4j handle? ›
|Number of total users||100-200 (end users, most likely accessing via front end applications)|
|Number of visits (read/queries) per day per user||5|
|Number of Nodes||50-75 MM|
|Number of Relationships||100 – 150 MM|
“Neo4j helps us to understand our online shoppers' behavior and the relationship between our customers and products, providing a perfect tool for real-time product recommendations....Does Amazon use Neo4j? ›
In the public subnets, Neo4j deployed to Amazon Elastic Compute Cloud (Amazon EC2) instances in an Auto Scaling group. You can deploy a single instance or an autonomous cluster of 3–10 instances (three shown). *
Does Google use Neo4j? ›
Developers and data scientists on Google Cloud Platform (GCP) can accelerate graph adoption to build today's intelligent applications and machine learning (ML) workflows with Neo4j – the most deployed and trusted graph data platform that's available as a fully managed cloud service or self-hosted.What problem does Neo4j solve? ›
Neo4j is an internet-scale, native graph database that leverages connected data to help companies build intelligent applications that meet today's evolving challenges including machine learning and artificial intelligence, fraud detection, real- time recommendations and master data.Who owns Neo4j? ›
The Neo4j, Inc. board of directors consists of Michael Treskow (Eight Roads), Emanuel Lang (Greenbridge), Christian Jepsen (Sunstone), Denise Persson (CMO of Snowflake), David Klein (One Peak), Nathalie Kornhoff-Brüls (Eurazeo), Patrick Pichette (Inovia Capital and former CFO of Google), and Emil Eifrem (CEO of Neo4j).Do you have to pay for Neo4j? ›
Neo4j Community Edition is fully open source, licensed and distributed under GPL v3. Neo4j offers a number of commercial licensing options, including free licenses for development, startups, academic-educational uses and of course, evaluation.How long does it take to learn Neo4j? ›
Neo4j Fundamentals — a course aimed at beginners, whether technical or non-technical, who want to learn what the fuss is all about. (30–60 mins)How much memory does Neo4j need? ›
1GB is a good starting point for a server that is dedicated to running Neo4j. However, there are cases where the amount reserved for the OS is significantly larger than 1GB, such as servers with exceptionally large RAM.What is Neo4j best for? ›
Neo4j is a graph database. A graph database, instead of having rows and columns has nodes edges and properties. It is more suitable for certain big data and analytics applications than row and column databases or free-form JSON document databases for many use cases. A graph database is used to represent relationships.Which database is best for millions of records? ›
Oracle has provided high-quality database solutions since the 1970s. The most recent version of Oracle Database was designed to integrate with cloud-based systems, and it allows you to manage massive databases with billions of records. Oracle offers SQL and NoSQL database solutions.
- Alibaba Cloud.
- Amazon Web Services (AWS)
- Google Cloud Platform (GCP)
RonDB provides the best low-latency, high throughput, and high availability database available today. Stockholm, Sweden, Feb. 24, 2021 (GLOBE NEWSWIRE) -- Logical Clocks announces the early availability of RonDB, the world's fastest key-value store with SQL capabilities offered now as a managed service on the cloud.
What database do most websites use? ›
- The Oracle. Oracle is the most widely used commercial relational database management system, built-in assembly languages such as C, C++, and Java. ...
- MySQL. ...
- MS SQL Server. ...
- PostgreSQL. ...
- MongoDB. ...
- IBM DB2. ...
- Redis. ...
- Database Oasis. Database Oasis is a popular solution that allows you to create and custom databases in minutes. ...
- Amazon SimpleDB. Another super popular solid but simple database software solution. ...
- Apache OpenOffice Base. ...
- Basemaster. ...
- Knack. ...
- Sesame Database Manager. ...
- Restdb.io. ...
Neo4j Community Edition is fully open source, licensed and distributed under GPL v3. Neo4j offers a number of commercial licensing options, including free licenses for development, startups, academic-educational uses and of course, evaluation.What is the most widely used open source database? ›
Most popular open source database management systems worldwide 2022. As of January 2022, the most popular open source database management system (DBMS) in the world was MySQL, with a ranking score of 1206.Does Microsoft have a graph database? ›
Microsoft Graph Data Connect provides a new way for you to interact with the data that's available through Microsoft Graph APIs. Data Connect provides a unique set of tools that streamline the building of intelligent applications, all within the Microsoft cloud.Is GraphDB free? ›
GraphDB-Free is free to use. SE and EE are licensed per CPU-Core used. Perpetual and annual subscription models are available. As of April 2021, Graph DB is ranked as the 4th most -popular RDF store and 6th most-popular Graph DBMS system.How much does Neo4j license cost? ›
|MFR Part #||Product Name / Description||Annual Subscription Price|
|NT 010||Neo4j Enterprise Bundle Base||$ 189,188|
|NT 011||Additional Production Capacity (per Core)||$ 6,609|
|NT 012||Disaster Recovery (per Core)||$ 3,305|
|NT 013||Additional Test Instances (per Instance)||$ 4,957|
MySQL, Oracle, PostgreSQL, Microsoft SQL Server, MongoDB, Redis, Elasticsearch, Cassandra, MariaDB, IBM Db2.What are the 4 types of database? ›
The other three include:
- hierarchical database systems.
- network database systems.
- object-oriented database systems.
As of February 2023, the most popular database management system (DBMS) in the world was Oracle, with a ranking score of 1247.52; MySQL and Microsoft SQL server rounded out the top three.
Does Amazon have a graph database? ›
Amazon Neptune is a purpose-built, high-performance graph database engine optimized for storing billions of relationships and querying the graph with milliseconds latency.Does Google offer a graph database? ›
Google Cloud and Neo4j offer scalable, intelligent tools for making the most of graph data. Neo4j Graph Data Science and Google Cloud Vertex AI make building AI models on top of graph data fast and easy.What are graph databases not good for? ›
Graph databases don't allow modeling nested relations, which would require having a binary edge point to a binary edge. The only possible way to achieve this would be through reification, transforming an edge into a node so that another edge can point to it.Which database is completely free? ›
InterBase, Microsoft SQL, MY SQL, PostgreSQL, and MongoDB are some of the best free database software available.Is graph database NoSQL? ›
A graph database is a type of NoSQL database that is designed to handle data with complex relationships and interconnections. In a graph database, data is stored as nodes and edges, where nodes represent entities and edges represent the relationships between those entities.