pandas创建和文件读取笔记(一)
结构:

代码:
import pandas as pd
import numpy as np
#pandas依赖处理Excel的xlrd模块,所以我们需要提前安装这个,安装命令是:pip install xlrd
#在更新为2.0.1之后,如果不指定engine='openpyxl’的话默认xlrd只能读取xls文件,安装pip install openpyxl
data_series=pd.Series([1,2,3,4]) #列表生成series
print(data_series)
##可用包含等长列表(数组/Series)的字典生成DataFrame
data_pd=pd.DataFrame({"id":[1,2,3,4],"name":["gree","huitao","luss","penfei"]})
print(data_pd)
#分别查看值和索引
print(data_series.values)
print(data_series.index)
print(data_pd.values)
print(data_pd.index)
#读取其他数据文件
pd_excel=pd.read_excel("huitao.xlsx",sheet_name="Sheet1",engine="openpyxl")#现在就ok了
p_head=pd_excel.head()#默认显示前5行数据
print(p_head)
#读取cvs
df_csv = pd.read_csv('huitao.csv') #读入csv文件
print(df_csv)
#读取json数据
df_json=pd.read_json("huitao.json")
print(df_json)
#输出数据文件
data_pd .to_csv('save_file.csv')
data_pd .to_excel('save_file.xlsx')
data_pd .to_json('save_file.json')
测试结果:
F:\开发工具\pythonProject\tools\venv\Scripts\python.exe F:/开发工具/pythonProject/tools/python的sklear学习/pandastest.py
0 1
1 2
2 3
3 4
dtype: int64
id name
0 1 gree
1 2 huitao
2 3 luss
3 4 penfei
[1 2 3 4]
RangeIndex(start=0, stop=4, step=1)
[[1 'gree']
[2 'huitao']
[3 'luss']
[4 'penfei']]
RangeIndex(start=0, stop=4, step=1)
id name adrress
0 1 huitao beijing
1 2 qiwei shanghai
2 3 weiping shanxi
Empty DataFrame
Columns: [谢谢你给我的爱,让我度过那个春天]
Index: []
id name address
0 1 huitao 陕西省清涧县
1 2 huitao 陕西省梓州
Process finished with exit code 0