Skip to content
View SemyonSinchenko's full-sized avatar
👷‍♂️
I may be slow to respond.
👷‍♂️
I may be slow to respond.

Organizations

@apache

Block or report SemyonSinchenko

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
SemyonSinchenko/README.md

Sem Sinchenko

Data Egnineer, Open Source Software enthusiast, Apache Software Foundation committer.

I'm developing in Python, Scala/Java and some Rust. Mostly my activities are related to the Apache Spark / PySpark ecosystem and Data Engineering tools.

I'm a maintainer at the following projects:

  • GraphFrames -- scalabale graph algorithms on top of Apache Spark DataFrames.
  • Apache GraphAr (incubating) -- universal "open-table" format for storing Property Graphs.
  • graphframes-rs -- vertex-centric graph algorithms on top of Apache Datafusion.
  • spark-fast-tests -- Apache Spark testing helpers and assertions (Scala).
  • chispa -- Apache Spark testing helpers and assertions (Python).
  • falsa -- CLI tool for generating datasets of the H2O benchmark. Wriiten in Rust.

And other various projects.

Wakatime weekly stats:

Scala             11 hrs 37 mins  █████████████████████░░░░   84.57 %
YAML              1 hr 7 mins     ██░░░░░░░░░░░░░░░░░░░░░░░   08.19 %
Markdown          13 mins         ▒░░░░░░░░░░░░░░░░░░░░░░░░   01.67 %
Protocol Buffer   12 mins         ▒░░░░░░░░░░░░░░░░░░░░░░░░   01.56 %
properties        8 mins          ▒░░░░░░░░░░░░░░░░░░░░░░░░   01.02 %

About any open source activities and / or collaborations you can reach me using [email protected].

About any other activities and / or collaborations you can reach me using my private email [email protected].

Pinned Loading

  1. apache/incubator-graphar apache/incubator-graphar Public

    An open source, standard data file format for graph data storage and retrieval.

    C++ 306 76

  2. graphframes/graphframes graphframes/graphframes Public

    GraphFrames is a package for Apache Spark which provides DataFrame-based Graphs

    Scala 1.1k 255

  3. graphframes-rs graphframes-rs Public

    GraphFrames but in DataFusion

    Rust 7 1

  4. flake8-pyspark-with-column flake8-pyspark-with-column Public

    A flake8 plugin that detects of usage withColumn in a loop or inside reduce

    Python 28 1

  5. mrpowers-io/falsa mrpowers-io/falsa Public

    Python 8 2

  6. MrPowers/chispa MrPowers/chispa Public

    PySpark test helper methods with beautiful error messages

    Python 722 74