Python API for Apache Spark - A unified analytics engine for large-scale data processing
URL: Visit APIs.json URL
- big data, distributed computing, data processing, machine learning, streaming, python
- Created: 2024
- Modified: 2024
Core Spark functionality including RDDs, SparkContext, and basic operations
Human URL: https://spark.apache.org/docs/latest/api/python/reference/pyspark.html
- rdd, spark-context, core
Structured data processing with DataFrame and SQL operations
Human URL: https://spark.apache.org/docs/latest/sql-programming-guide.html
- dataframe, sql, structured-data
Real-time stream processing capabilities
Human URL: https://spark.apache.org/docs/latest/streaming-programming-guide.html
- streaming, real-time, dstream
Machine learning library with scalable algorithms
Human URL: https://spark.apache.org/docs/latest/ml-guide.html
- machine-learning, mllib, algorithms
DataFrame-based machine learning API
Human URL: https://spark.apache.org/docs/latest/ml-pipeline.html
- machine-learning, pipeline, dataframe
Graph processing and analysis capabilities
Human URL: https://graphframes.github.io/graphframes/docs/_site/index.html
- graph, graphframes, network-analysis
- Website
- GitHub
- Installation Guide
- Quick Start
- Downloads
- Community
- Mailing Lists
- Issue Tracker
- Release Notes
- Security
FN: Apache Software Foundation
Email: dev@spark.apache.org