Howard Wetsman MD hwetsman

A retired addiction psychiatrist retraining in data science. I'm excited by the prospect of helping individuals have better lives using their own data.

1 follower · 0 following

View GitHub Profile

Recently created

Least recently created

Recently updated

Least recently updated

hwetsman / Numpy.md

Created January 4, 2022 00:34

np.random.choice(array,n,replace=True,p)

hwetsman / Pandas.md

Last active January 18, 2022 23:22

Pandas

Data Inspection

df.info() - gives column by col review of dtypes and number of non-null entries

df.sample(n) - returns a random sample of n entries from the df. Good for looking for quality problems. Default is n=1.

df.head(n) - returns the first n rows of the dataframe

df.tail(n) - returns the last n rows of the dataframe

hwetsman / Python_Cheatsheet.md

Last active January 9, 2022 19:07

A listing of python syntax

datetime

create datetime object

dt1 = datetime.strptime('20091031','%Y%m%d')

get 'date' object

dt1.date()

get 'time' object

dt1.time()

hwetsman / readme.md

Last active July 15, 2022 19:59 — forked from AlexMercedCoder/readme.md

Data Terms/Concepts Cheatsheet

Data Analytics/Science Terms and Concepts Cheatsheet

Structured Data

Data is organized to meet a schema. Think tables which organize data into rows and columns.

Unstructured Data

Data is unorganized and lacks a schema. Imagine collections of html documents including text and images not organized in any consistent way.