Skip to content

Conversation

@bimanmandal
Copy link
Collaborator

No description provided.

@bimanmandal bimanmandal merged commit d4de06d into feature/scala-code/ch01 Dec 29, 2021
@bimanmandal bimanmandal deleted the feature/generate-sh-files branch December 29, 2021 06:24
deepakmca05 added a commit that referenced this pull request Dec 29, 2021
* Scala Code for data algorithms with spark-ch01

* Refactored the package name

* With spark contextclose

* Feature/scala code/ch01 biman (#3)

* fold by key

* added comments for fold by key

* added Average by Group By Key

* added Average by ReduceByKey

* dataframe action describe

* dataframe add column

* dataframe drop column

* dataframe filter rows

* dataframe join

* dataframe join inner

* dataframe join left

* dataframe join right

* dataframe sql

* rdd map transformations

* rdd filter transformations

* rdd flatMap transformations

* rdd transformations groupByKey

* rdd transformations Join

* rdd transformations Cartesian

* rdd transformations Combine By Key

* rdd transformations MapPartitions

* rdd transformations Sort By

* rdd action take ordered

* Feature/scala code/ch01 deepak (#4)

* fold by key

* added comments for fold by key

* added Average by Group By Key

* added Average by ReduceByKey

* dataframe action describe

* dataframe add column

* dataframe drop column

* dataframe filter rows

* dataframe join

* dataframe join inner

* dataframe join left

* Refactroed code + RDD Transformation

* INDENTATION CORRECT

* ReduceByKey

* ReduceByKey1

Co-authored-by: bimanmandal <biman29@gmail.com>

* added gradle shorthands

* Feature/generate sh files (#6)

* added gradle shorthands

* added the shell script

* added the shell script

* updated the shell commands

Co-authored-by: Biman Mandal <43241698+bimanmandal@users.noreply.github.com>
Co-authored-by: bimanmandal <biman29@gmail.com>
bimanmandal added a commit that referenced this pull request Jan 7, 2022
* added kmers for FASTA and FASTQ formats

* DNA Based count in scala

* Indentation fix

* improved documentation

* ch02

* Feature/scala code/ch01 (#5)

* Scala Code for data algorithms with spark-ch01

* Refactored the package name

* With spark contextclose

* Feature/scala code/ch01 biman (#3)

* fold by key

* added comments for fold by key

* added Average by Group By Key

* added Average by ReduceByKey

* dataframe action describe

* dataframe add column

* dataframe drop column

* dataframe filter rows

* dataframe join

* dataframe join inner

* dataframe join left

* dataframe join right

* dataframe sql

* rdd map transformations

* rdd filter transformations

* rdd flatMap transformations

* rdd transformations groupByKey

* rdd transformations Join

* rdd transformations Cartesian

* rdd transformations Combine By Key

* rdd transformations MapPartitions

* rdd transformations Sort By

* rdd action take ordered

* Feature/scala code/ch01 deepak (#4)

* fold by key

* added comments for fold by key

* added Average by Group By Key

* added Average by ReduceByKey

* dataframe action describe

* dataframe add column

* dataframe drop column

* dataframe filter rows

* dataframe join

* dataframe join inner

* dataframe join left

* Refactroed code + RDD Transformation

* INDENTATION CORRECT

* ReduceByKey

* ReduceByKey1

Co-authored-by: bimanmandal <biman29@gmail.com>

* added gradle shorthands

* Feature/generate sh files (#6)

* added gradle shorthands

* added the shell script

* added the shell script

* updated the shell commands

Co-authored-by: Biman Mandal <43241698+bimanmandal@users.noreply.github.com>
Co-authored-by: bimanmandal <biman29@gmail.com>

* updated README.md

* updated README.md

* ch02-changes

* ch02-changes

* Feature/scala code/ch01 missing class gradle (#7)

* gradle wrapper fix

* gradle wrapper fix

* gradle wrapper fix

* gradle wrapper fix

* added bonus chapter correlation

* added bonus chapter correlation

* updated docs

* updated docs

* updated docs

* updated docs

* updated docs

* updated docs

* updated docs

* updated docs

* updated docs

* updated docs

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* DNABaseCountFastq

* added the code changes for chapter 2

* added the run_spark_applications_scripts

* added the conditions for 1GB data

* added the readme file

Co-authored-by: Mahmoud Parsian <mahmoud.parsian@yahoo.com>
Co-authored-by: deepakmca05 <deepakmca05@gmail.com>
deepakmca05 added a commit that referenced this pull request Jan 7, 2022
* added kmers for FASTA and FASTQ formats

* DNA Based count in scala

* Indentation fix

* improved documentation

* ch02

* Feature/scala code/ch01 (#5)

* Scala Code for data algorithms with spark-ch01

* Refactored the package name

* With spark contextclose

* Feature/scala code/ch01 biman (#3)

* fold by key

* added comments for fold by key

* added Average by Group By Key

* added Average by ReduceByKey

* dataframe action describe

* dataframe add column

* dataframe drop column

* dataframe filter rows

* dataframe join

* dataframe join inner

* dataframe join left

* dataframe join right

* dataframe sql

* rdd map transformations

* rdd filter transformations

* rdd flatMap transformations

* rdd transformations groupByKey

* rdd transformations Join

* rdd transformations Cartesian

* rdd transformations Combine By Key

* rdd transformations MapPartitions

* rdd transformations Sort By

* rdd action take ordered

* Feature/scala code/ch01 deepak (#4)

* fold by key

* added comments for fold by key

* added Average by Group By Key

* added Average by ReduceByKey

* dataframe action describe

* dataframe add column

* dataframe drop column

* dataframe filter rows

* dataframe join

* dataframe join inner

* dataframe join left

* Refactroed code + RDD Transformation

* INDENTATION CORRECT

* ReduceByKey

* ReduceByKey1

Co-authored-by: bimanmandal <biman29@gmail.com>

* added gradle shorthands

* Feature/generate sh files (#6)

* added gradle shorthands

* added the shell script

* added the shell script

* updated the shell commands

Co-authored-by: Biman Mandal <43241698+bimanmandal@users.noreply.github.com>
Co-authored-by: bimanmandal <biman29@gmail.com>

* updated README.md

* updated README.md

* ch02-changes

* ch02-changes

* Feature/scala code/ch01 missing class gradle (#7)

* gradle wrapper fix

* gradle wrapper fix

* gradle wrapper fix

* gradle wrapper fix

* added bonus chapter correlation

* added bonus chapter correlation

* updated docs

* updated docs

* updated docs

* updated docs

* updated docs

* updated docs

* updated docs

* updated docs

* updated docs

* updated docs

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* improved documentation

* DNABaseCountFastq

* added the code changes for chapter 2

* added the run_spark_applications_scripts

* added the conditions for 1GB data

* added the readme file

Co-authored-by: Mahmoud Parsian <mahmoud.parsian@yahoo.com>
Co-authored-by: deepakmca05 <deepakmca05@gmail.com>

Co-authored-by: Biman Mandal <43241698+bimanmandal@users.noreply.github.com>
Co-authored-by: Mahmoud Parsian <mahmoud.parsian@yahoo.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants