Usage

PySQL

PySQL is a Python library for interacting with SQL Server databases. It provides a simplified interface for common database operations.

## Installation

pip install pySqlServer

pySQL requires the following packages:

pandas
numpy
sqlalchemy==1.4.32
pyodbc

pip install pandas numpy sqlalchemy==1.4.32 pyodbc

Usage

Import PySQL and Table_analyzer:

from pySqlServer import PySQL, Table_analyzer

Table_analyzer usage:

A helper class for analyzing Pandas DataFrames to determine optimal SQL datatypes.

df = pd.read_csv('test.csv')
TA = Table_analyzer()
dtype_dict = TA.analyze(df,texts_buffer=0.2)

for texts_buffer > 1 : N in nvarchar(N) sets to len_max_length + texts_buffer

for 0 < texts_buffer < 1 : N in nvarchar(N) sets to len_max_length + texts_buffer*len_max_length (extra percentage)

PySQL usage:

Create a connection

pysql = PySQL()
pysql.create_connection(server='host_ip', database='mydb', username='myuser', password='mypassword')

your user must have 'db_datareader', 'db_datawriter', 'db_ddlAdmin' permissions to module works perfectly

at first use (for every tables) you must call create_dtypes()

dtype_dict = {'col1':sql.sqltypes.INTEGER , 'col2':sql.sqltypes.NVARCHAR(100)} # it's suggested to use Table_analyzer to calculate optimal dtype_dict
# dtype_dict = TA.analyze(df,texts_buffer=0.2) 
pysql.create_dtypes(dtype_dict=dtype_dict, table_name='Test_table', schema='Test_schema')

at all next usages you must call load_dtypes and next you can use to_sql method to send data

pysql.load_dtypes(table_name='Test_table', schema='Test_schema')    # created before
pysql.to_sql(df,'Test_table', schema='Test_schema', if_exists='append', text_cutter=True, date_normalizer=True)

you can use primary_key='column_name' to set tables primary_key

in next usages it's not allowed to use this

'text_cutter' trys to cut new text if those length was taller than column capacity

'date_normalizer' trys to make date format colums suitable for sql server

and there is some read data methods in order to read data from your database (returns pandas dataframe)

pysql.tables_list(schema=None)
pysql.read_sql_table(table_name, schema=None)
pysql.read_sql_query(query='SELECT * FROM TABLE_NAME')

logger is a common method to use ( it logs datetime and process_id and actor_user in order to make your actions trackable )

every to_sql calls can join with logs with 'process_id' column , so you can find who and when it's started to store data and how long takes it process

pysql.logger('create_connection', 'success', 'connected')

auto log system logs all your method calls like bellow sample:

auto_log sample:

function	state	log	connection_user	process_id	datetime
create_connection	success	connected	pysql_user	-1	2023-08-12 16:04:22.000
read_sql_table	start	success	pysql_user	0	2023-08-12 16:04:22.000
read_sql_table	end	success	pysql_user	0	2023-08-12 16:04:23.000
create_dtypes	start	success	pysql_user	1	2023-08-12 16:04:23.000
create_dtypes	end	success	pysql_user	1	2023-08-12 16:04:23.000
load_dtypes	start	success	pysql_user	2	2023-08-12 16:04:24.000
load_dtypes	end	success	pysql_user	2	2023-08-12 16:04:24.000
to_sql	start	success	pysql_user	3	2023-08-12 16:04:24.000
to_sql	end	success	pysql_user	3	2023-08-12 16:04:32.000

data to_sql sample:

user_id	id	created_at	lang	favorite_count	quote_count	reply_count	retweet_count	views_count	bookmark_count
*****17323920629770	1647162******634625	2023-04-15 08:57:34.000	en	2	0	0	0	177	1
*****0417	1647153******613570	2023-04-15 08:23:18.000	en	1	0	1	0	12	0
*****49585152565249	1682090******974593	2023-07-20 18:08:18.000	en	642	83	65	213	1749178	8
*****87375859568642	1647152******957248	2023-04-15 08:18:00.000	ar	22	8	0	0	7	0
*****0013028323329	1647127******033537	2023-04-15 06:40:00.000	en	2	0	1	0	84	2

dtypes table sample:

table	schema	dtypes_str	proccess_id
T1	Twitter	{'user_id': 'types.BIGINT()', ... , 'bookmark_count': 'types.INTEGER()'}	1
T2	Twitter	{'user_profile_banner_url': 'types.NVARCHAR(88)',..., 'user_description_urls': 'types.NVARCHAR(443)'}	8
T3	Twitter	{'in_reply_to_status_id_str': 'types.BIGINT()',..., 'is_quote': 'types.BOOLEAN()'}	11

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
readme		readme
LICENSE		LICENSE
PySQL.py		PySQL.py
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PySQL

Usage

Table_analyzer usage:

PySQL usage:

About

Uh oh!

Releases

Packages

Languages

License

sajad-git/PySQL

Folders and files

Latest commit

History

Repository files navigation

PySQL

Usage

Table_analyzer usage:

PySQL usage:

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages