Issue search results

Filter by

18 results

(81 ms)inAlexIoannides/pyspark-example-project (press backspace or delete to remove)

AlexIoannides/pyspark-example-project
Does not work when running on yarn in client mode

Hi seems like etl_config.json is not accessible when running in yarn client mode. Could you please me to investigate this issue?

Rustem

Opened
on Dec 15, 2021

AlexIoannides/pyspark-example-project
Failed TestCase

ERROR: test_transform_data (tests.test_etl_job.SparkETLTests) Test data transformer. Traceback (most recent call last): File /home/brix/pyspark-workloads/pyspark-example-project/tests/test_etl_job.py ...

marouenes

Opened
on Aug 5, 2021

AlexIoannides/pyspark-example-project
Add License

Please add the LICENSE file to the repo so that one is sure of how to use it in closed-source codebases. See here

ajknzhol

Opened
on Mar 31, 2021

AlexIoannides/pyspark-example-project
Pass Parameters to Spark

Could you add functionality to pass Job level parameters. Pass via parameters file maybe?

sou-joshi

Opened
on Mar 22, 2021

AlexIoannides/pyspark-example-project
ModuleNotFoundError: No module named 'dependencies'

when I run the code with following command . $spark-submit --master local[*] jobs/reconciliation.py I get the error ModuleNotFoundError: No module named dependencies Its because jobs and dependencies ...

mohit-manna

Opened
on Jan 28, 2021

AlexIoannides/pyspark-example-project
Issue while executing the code via pycharm

File /home/ashish/Downloads/pyspark-example-project-master/jobs/etl_job.py , line 57, in main data_transformed = transform_data(data, config[ steps_per_floor ]) TypeError: NoneType object is not subscriptable ...

averma111

Opened
on Oct 3, 2020

AlexIoannides/pyspark-example-project
Wrong variables in example

https://github.com/AlexIoannides/pyspark-example-project/blob/13d6fb2f5fb45135499dbd1bc3f1bdac5b8451db/tests/test_etl_job.py#L64 You should use data_transformed not expected_data for actual transformation ...

minhsphuc12

Opened
on Sep 30, 2020

AlexIoannides/pyspark-example-project
etl_config.json not loaded in EMR

First of all, thanks for the great work! I am new to spark and this repo has really helped me getting started. I am trying to get my etl job running on aws EMR in cluster mode, but got hit with an issue ...

junjchen

Opened
on Sep 25, 2020

AlexIoannides/pyspark-example-project
Setup and Teardown should be @classmethods setUpClass and tearDownClass

if they are not class methods then the method would be invoked for every test and a session would be created for each of those tests. `class PySparkTest(unittest.TestCase): @classmethod def suppress_py4j_logging(cls): ...

enhancement

good first issue

amrishan

Opened
on Sep 16, 2020

AlexIoannides/pyspark-example-project
`AttributeError: module 'logging' has no attribute 'Handler' in PySpark3

When I import from pyspark import SparkFiles, I got the error AttributeError: module logging has no attribute Handler . Python version: 3.8.5 Spark version: 3.0.0 pyspark version3.0.1 Anyone knows how ...

kychanbp

Opened
on Sep 11, 2020

Learn how you can use GitHub Issues to plan and track your work.

Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub Issues

ProTip!

Restrict your search to the title by using the in:title qualifier.

Learn how you can use GitHub Issues to plan and track your work.

Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub Issues

ProTip!

Press the

key to activate the search input again and adjust your query.

Languages

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Filter by

State

Advanced

AlexIoannides/pyspark-example-project
Does not work when running on yarn in client mode

AlexIoannides/pyspark-example-project
Failed TestCase

AlexIoannides/pyspark-example-project
Add License

AlexIoannides/pyspark-example-project
Pass Parameters to Spark

AlexIoannides/pyspark-example-project
ModuleNotFoundError: No module named 'dependencies'

AlexIoannides/pyspark-example-project
Issue while executing the code via pycharm

AlexIoannides/pyspark-example-project
Wrong variables in example

AlexIoannides/pyspark-example-project
etl_config.json not loaded in EMR

AlexIoannides/pyspark-example-project
Setup and Teardown should be @classmethods setUpClass and tearDownClass

AlexIoannides/pyspark-example-project
`AttributeError: module 'logging' has no attribute 'Handler' in PySpark3

Learn how you can use GitHub Issues to plan and track your work.

Learn how you can use GitHub Issues to plan and track your work.

issues Search Results · repo:AlexIoannides/pyspark-example-project language:Python

Filter by

State

Advanced

18 results

Learn how you can use GitHub Issues to plan and track your work.

Learn how you can use GitHub Issues to plan and track your work.