Elena-Simona Apostol, Adrian-Cosmin Cojocaru and Ciprian-Octavian Truică. Large-Scale Graphs Community Detection using Spark GraphFrames, The 23rd International Symposium on Parallel and Distributed Computing (ISPDC), July 2024.
- Apache Spark Environment
- Python
Packages needed:
- graphframes
- pyspark.sql
- itertools
- networkx
- pandas
- time
- heapq
The dataset for testing authors_graph.csv
Run: python3.7 communitydetection_graphframes.py.py