pandas練習（三）------ 數據分組

時間 2019-12-11

標籤 pandas 練習數據分組简体版

原文原文鏈接

探索酒類消費數據

相關數據見（github）html

步驟1 - 導入pandas庫

import pandas as pd

步驟2 - 數據集

path3 = "./data/drinks.csv"      # drinks.csv

步驟3 將數據框命名爲drinks

drinks = pd.read_csv(path3) drinks.head()

輸出：python

步驟4 哪一個大陸(continent)平均消耗的啤酒(beer)更多？

beeravg = drinks.groupby('continent').beer_servings.mean() beeravg.sort_values(ascending=False)

輸出：git

步驟5 打印出每一個大陸(continent)的紅酒消耗(wine_servings)的描述性統計值

drinks.groupby('continent').wine_servings.describe()

輸出：github

步驟6 打印出每一個大陸每種酒類別的消耗平均值

drinks.groupby('continent').mean()

輸出：ui

步驟7 打印出每一個大陸每種酒類別的消耗中位數

drinks.groupby('continent').median()

輸出：spa

步驟8 打印出每一個大陸對spirit飲品消耗的平均值，最大值和最小值

drinks.groupby('continent').spirit_servings.agg(['mean', 'min', 'max'])

輸出：3d

參考連接：

一、http://pandas.pydata.org/pandas-docs/stable/cookbook.html#cookbookcode

二、https://www.analyticsvidhya.com/blog/2016/01/12-pandas-techniques-python-data-manipulation/htm

三、https://github.com/guipsamora/pandas_exercisesblog

相關文章

相關標籤/搜索

數據分層、數據分組、數據集

每日一句

每一个你不满意的现在，都有一个你没有努力的曾经。

最新文章

本站公眾號

歡迎關注本站公眾號,獲取更多信息

相關文章

>>更多相關文章<<