New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unable to install to install Optimus #391
Comments
Hi! Thanks for the interest. That post it’s a little outdated. You can run: pip install optimuspyspark And then check the examples from the master branch! You have to do a different import now. Please let me know if this work for you! :) |
Hi FavioVazquez, Thanks for reaching out. I'm sure I'm tried your suggestion and it didn't work out ... going to try it again now and let you know. |
OK, I'm not getting the following error from the terminal when I issue the following command after installing pip install optimuspyspark from https://github.com/ironmussa/Optimus from optimus import Optimus packt@ubuntu-c: |
Sorry, I meant to say I'm NOW getting the following error from the terminal when I issue the following command after installing pip install optimuspyspark from https://github.com/ironmussa/Optimus from optimus import Optimus packt@ubuntu-c:/.local/lib/python3.5/site-packages/optimus$ from optimus import Optimus |
Any thoughts? |
I tried to install on another unbuntu with the command sudo pip install optimuspyspark and I got the following error:
|
Any help will be greatly appreciated |
Hi, I managed to get this up and running. However, when I go through the example shown here I run the command df.cols.replace("product","taaaccoo","taco")\ but there isn't any change to the table Any reasons why? |
hi @cpatte7372
Is this what you expect? |
Hi ironmussa,
This worked perfectly, thanks.
I just have a few learning questions:
1. Can you explain what exactly the following command set out to achieve:
def func(value, arg):
return "this was a number
2. Is there place where I can find all the switches e.g. remove_special_chars, remove_accents, years_between.
Thanks
Carlton
From: Argenis Leon <notifications@github.com>
Sent: 10 November 2018 23:56
To: ironmussa/Optimus <Optimus@noreply.github.com>
Cc: cpatte7372 <carlton@keyloop.co.uk>; Mention <mention@noreply.github.com>
Subject: Re: [ironmussa/Optimus] Unable to install to install Optimus (#391)
hi @cpatte7372 <https://github.com/cpatte7372>
It should work, I just execute
df =op.load.url("https://raw.githubusercontent.com/ironmussa/Optimus/master/examples/data/foo.csv")
df.table()
<https://user-images.githubusercontent.com/37144/48307388-5353b680-e511-11e8-95bc-eec2aaf6bf09.png>
def func(value, arg):
return "this was a number"
df\
.rows.sort("product","desc")\
.cols.lower(["firstName","lastName"])\
.cols.date_transform("birth", "yyyy/MM/dd", "dd-MM-YYYY")\
.cols.years_between("birth", "yyyy/MM/dd")\
.cols.remove_accents("lastName")\
.cols.remove_special_chars("lastName")\
.cols.replace("product","taaaccoo","taco")\
.cols.replace("product",["piza","pizzza"],"pizza")\
.rows.drop(df["id"]<7)\
.cols.drop("dummyCol")\
.cols.rename(str.lower)\
.cols.apply_by_dtypes("product",func,"string", data_type="integer")\
.cols.trim("*")\
.table()
<https://user-images.githubusercontent.com/37144/48307404-91e97100-e511-11e8-91b5-971813dd4fc5.png>
Is this what you expect?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub <#391 (comment)> , or mute the thread <https://github.com/notifications/unsubscribe-auth/AXd82cCazDR7FAqYkVfjXYkeRu05AFGQks5ut2ergaJpZM4YXoY4> . <https://github.com/notifications/beacon/AXd82a1qwnz0omoyxC1gCOmuY3qXHlfWks5ut2ergaJpZM4YXoY4.gif>
|
Hi argenisleon, Any idea where I can find more cleansing examples? Can you let me know where I can find a list of switches as shown below? Thanks |
@cpatte7372
Is a function applied all the integers in the 'product' column. More especicaflly in transform all integers to the string "this was a number"
You also can find a lot of examples and uses cases in the examples folder Hope this help |
thanks for the information, that's great. I'm trying to run the same commands from my Databricks workspace notebook (as you probably know Databricks is built on Spark and uses the same principle as Jupyter Notebook). I issue the following commands:
However, there isn't any output. Do you have any idea why? Cheers |
@FavioVazquez can you please look at this? |
Hi @cpatte7372! The table is a method of the df, so you have to run
Please let me know if this helps :) |
@cpatte7372 Did you solve the problem? |
@argenisleon this fixed the problem. Thank you |
Hello all,
I'm really excited about what I've read about Optimus. Unfortunately, I'm coming across a number of issues getting it up and running.
I have followed the instructions here:
https://medium.com/hi-optimus/how-to-install-jupyter-notebook-4-4-0-and-optimus-on-ubuntu-18-04-92ff5ef30ea4
However, whenever i try to issue the following script to my notebook I get the following error:
import optimus as op
ModuleNotFoundError Traceback (most recent call last)
in ()
----> 1 import optimus as op
ModuleNotFoundError: No module named 'optimus'
Im running python 3.6
I feel that this is a simple error that I have made, but I'm at a loss.
Any help will be greatly appreciated.
Cheers
The text was updated successfully, but these errors were encountered: