Apache Drill
Lowy Shin edited this page Mar 13, 2020
·
9 revisions
Schema-free SQL Query Engine for Hadoop, NoSQL and Cloud Storage, Files(CSV, TSV, Text), RDBMS.
It can be join cross data platform on apache drill.
Official : https://drill.apache.org/
- Download Drill
- Run drill-embedded
./drill-embedded
- Run browser
http://localhost:8047
-
日本語 : https://talklowy-jp.blogspot.com/2020/03/apache-drilldata-scientist.html#more
-
Install JDK
- https://www.oracle.com/technetwork/java/javase/downloads/jdk8-downloads-2133151.html
- Set JAVA_HOME to Windows system environment
-
Download Drill
- http://apache.mirrors.hoobly.com/drill/drill-1.17.0/apache-drill-1.17.0.tar.gz
- Unzip
tar -xzf apache-drill-1.17.0.tar.gz
- Run create working directory on cmd prompt
mkdir "%userprofile%\drill"
mkdir "%userprofile%\drill\udf"
mkdir "%userprofile%\drill\udf\registry"
mkdir "%userprofile%\drill\udf\tmp"
mkdir "%userprofile%\drill\udf\staging"
takeown /R /F "%userprofile%\drill"
- Run drill-embedded
- Double click
drill-embedded.bat
- Double click
- Run browser
http://localhost:8047
- Download MySQL JDBC Connector
- Add connection string to Apache drill
{
"type": "jdbc",
"driver": "com.mysql.jdbc.Driver",
"url": "jdbc:mysql://my.server.net:3306",
"username": "myroot",
"password": "mypwd",
"enabled": true
}
- Download PostrgreSQL JDBC connector
- Add Connection string on Apache Drill
{
"type": "jdbc",
"driver": "org.postgresql.Driver",
"url": "jdbc:postgresql://psql.server.net:5432",
"username": "psroot",
"password": "pspwd",
"enabled": true
}
- Modify
http://localhost:8047/storage/dfs
after execute apache drill- You may add file type or working directory information
- Get file form WebUI
curl -X POST -H "Content-Type: application/json" -d '{"queryType":"SQL", "query": "select * from mysvr.mydb.`tuser` limit 10"}' http://localhost:8047/query.json >>tuser.181012.json
- Select JSON file (using dfs)
select * from dfs.`/Users/lowyshin/Downloads/mydb.tuser.181012.json` where cast(`rows.role` as varchar)='Admin' order by uid desc;
- If fields not fixec(etc log or text file)
select f from (select FLATTEN(`columns`) as f from dfs.`/Users/lowyshin/worker.tsv` t) t2 where t2.f like '%expires%' limit 10;
- Create table
Create table tRst as select * from dfs.`/Users/lowyshin/worker.tsv`
- When you can not see data then execute on drill prompt
alter system set `store.json.all_text_mode` = false
- Apache Arrow(PyArrow)を使って簡単かつ高速にParquetファイルに変換する
- online CSV to JSON converter
- KB wiki is multi-lingual writing. If you want translate to your language, use google translate!
- Go to giip service Page : http://giipweb.littleworld.net
- Documentation : https://github.com/LowyShin/giip/wiki
- Sample automation scripts : https://github.com/LowyShin/giip/tree/gh-pages/giipscripts
See more : https://github.com/LowyShin/giip/wiki
- Token exchanges : https://tokenjar.io/GIIP
- Token exchanges manual : https://www.slideshare.net/LowyShin/giipentokenjario-giip-token-trade-manual-20190416-141149519
- GIIP Token Etherscan : https://etherscan.io/token/0x33be026eff080859eb9dfff6029232b094732c52
If you want get GIIP, contact us any time!
- LowyWorkEnv : https://github.com/LowyShin/Lowyworkenv/wiki
- CLoud RDBMS比較
- MachineLearning
- SQL Server
- ORACLE
- MySQL
- mariadh
- Redis with MySQL
- Mongodb
- Apache-Drill
- A5MK2-MultiDBMSTool
- BI
- Referrals
- Juliaの自動微分パッケージ Zygote の紹介
- color code table
- Canva - free Online Design tool
- vscode
- git(github)
- GoogleSheet
- UiPath(RPA)
- WinAutomation(official)
- Excel
- VNC
- VPN
- Note Tool
- Blog
- google news alert
- Online PDF to JPG
- LoadTest " putty
- Shell/bash
- crontab
- PowerShell
- Windows(DOS) batch
- Wscript/wsf
- AWS
- Azure
- File Sync(rsync, scp...)
- KnownPort(wikipedia)
- CentOS(Linux)
- Windows10
- Android