Apache Arrow

Krisztián Szűcs

Database Engineer
Data Engineer
Software Engineer
Apache Arrow defines a language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware like CPUs and GPUs. The Arrow memory format also supports zero-copy reads for lightning-fast data access without serialization overhead.
Some of my contributions to the project:
• Implementing the entire CI/CD system for Apache Arrow over all the implementations, several architectures and platforms producing packages with more than a 100 million downloads per month.
• Generic Maintenance of the Apache Arrow project as a PMC member.
• Developing Apache Arrow Python and C++ implementations.
• Contributing to various Arrow subprojects, like the now top-level Apache DataFusion
• Developing and optimizing Python, NumPy, Pandas to/from Arrow conversions paths in PyArrow.
Partner With Krisztián
View Services

More Projects by Krisztián