Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University
-
Updated
Apr 18, 2024 - HTML
Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University
Reference Architectures for Datalakes on AWS
Using machine learning models to predict if patients have chronic kidney disease based on a few features. The results of the models are also interpreted to make it more understandable to health practitioners.
This course will teach students to use popular tools for sourcing data, transforming it, building and optimizing models, communicating these as visual stories, and deploying them in production.
Course materials for CDS 101: Introduction to Computational and Data Sciences, offered at George Mason University
A website to help users view, verify and modify data for preprocessing and apply various classical ML algorrithms
XABN (XML Abbreviated Notation) - Combines a simplified format for representing XML data with a cross platform object notation covering a comprehensive range of data structures and types. XABN allows for the direct exchange of objects between applications including XML generation and conversion.
Implementation of a traditional classifier of argumentative components (claims and premises), trained with features/metadata previously extracted from manually annotated argumentative sentences from the citizen proposals available in the Decide Madrid platform.
Introduction to Diffusion Real-Time Event Stream through a simple application using Diffusion Cloud and Apache Kafka. A simple projects illustrating real-time replication and fan-out of foreign exchange (fx) event streams from Kafka cluster A to Kafka cluster B, through Diffusion Cloud instance via the use of our Kafka Adapter.
A Jupyter notebook documentation of an ETL (extract -> transform -> load) data pipeline
Add a description, image, and links to the data-transformation topic page so that developers can more easily learn about it.
To associate your repository with the data-transformation topic, visit your repo's landing page and select "manage topics."