将 pandas DataFrame 旋转为正确的格式:`DataError: No numeric types to aggregate`正确、格式、DataError、DataFrame

2023-09-07 17:14:34 作者：起步向前走

这是我想要操作的 pandas DataFrame:

Here is a pandas DataFrame I would like to manipulate:

import pandas as pd

data = {"grouping": ["item1", "item1", "item1", "item2", "item2", "item2", "item2", ...],
        "labels": ["A", "B", "C", "A", "B", "C", "D", ...],
        "count": [5, 1, 8, 3, 731, 189, 9, ...]}

df = pd.DataFrame(data)

print(df)
>>>   grouping            labels       count
0        item1             A            5
1        item1             B            1
2        item1             C            8
3        item2             A            3
4        item2             B          731
5        item2             C          189
6        item2             D            9
7        ...               ...         ....

我想将此数据框展开"为以下格式:

I would like to "unfold" this dataframe into the following format:

grouping    A    B    C    D
item1       5    1    8    3
item2       3    731  189  9
....        ........

如何做到这一点?我认为这会起作用:

How would one do this? I would think that this would work:

pd.pivot_table(df,index=["grouping", "labels"]

但我收到以下错误:

DataError: No numeric types to aggregate

推荐答案

有四种惯用的 pandas 方法可以做到这一点.

There are four idiomatic pandas ways to do this.

分组列之间没有重复.不需要聚合枢轴set_index 数据透视表分组方式

枢轴

df.pivot('grouping', 'labels', 'count')

set_index

df.set_index(['grouping', 'labels'])['count'].unstack()

pivot_table

df.pivot_table('count', 'grouping', 'labels')

groupby

df.groupby(['grouping', 'labels'])['count'].sum().unstack()

全部收益

labels      A      B      C    D
grouping                        
item1     5.0    1.0    8.0  NaN
item2     3.0  731.0  189.0  9.0

时机

使用 groupby、set_index 或 pivot_table 方法，您可以使用 fill_value=0


With the groupby, set_index, or pivot_table approach, you can easily fill in missing values with fill_value=0
df.pivot_table('count', 'grouping', 'labels', fill_value=0)

df.groupby(['grouping', 'labels'])['count'].sum().unstack(fill_value=0)

df.set_index(['grouping', 'labels'])['count'].sum().unstack(fill_value=0)

全部收益
labels    A    B    C  D
grouping                
item1     5    1    8  0
item2     3  731  189  9

关于groupby的其他想法
因为我们不需要任何聚合.如果我们想使用 groupby，我们可以通过使用影响较小的聚合器来最小化隐式聚合的影响.
Because we don't require any aggregation.  If we wanted to use groupby, we can minimize the impact of the implicit aggregation by utilizing a less impactful aggregator. 
df.groupby(['grouping', 'labels'])['count'].max().unstack()

或
df.groupby(['grouping', 'labels'])['count'].first().unstack()

定时groupby



                
                
                                    上一篇：突出3D点到2D屏幕坐标坐标、点到、突出、屏幕
                                                            下一篇：SQL Server 使用 case 语句进行数据透视语句、透视、数据、SQL
                                    

                
                
                    
                        相关推荐
                       
                    
                  

                    
什么是正确的方法来确定接收使用鼠标拖动的结束？鼠标、
在 JavaScript 中正确引用 ASP.NET 用户控件中的控件
如何使用MediaStore查询来获取艺术家没有重复？如何使用
多长时间应用程序被批准为亚马逊的AppStore？亚马逊、多
在seekTo（）函数不会在VideoView工作会在、函数、工作、s
不甘的EditText onTouchListener函数调用不甘、函数、
括号检查器代码的测试用例不正确.对于&#39;(()&#39;输
DrawerLayout $ MainActivity.onTouchEvent的对$ pven
是否有可能发挥GIF格式的Android？有可能、格式、oi
 pandas :pivot 和 pivot_table 之间的区别.为什么只
				   
                

                


    
        
                  

        
        
                  

          

             
        
    
    
                  

    


                
                
                    
                        猜您喜欢
                    
                    
					 
								
								牛奶和鸡蛋能一起吃吗 牛奶和鸡蛋属于标
							
						
                        
   网格布局支持Android的API 10网格、布局、Android、API
     软硬度：寻找设计模式来显示忙碌光标，而我的应用程序是与QUOT;忙QUO
     凡被保存在存储在Android上的PhoneGap应用程序文件？应用程序、存
     圣诞节祝福语一句话英文_圣诞节祝福的话英语 圣诞节英文祝福语
     如何净化网络环境 了解净化方法_净化网络环境宣传标语
                                                      
                                        

                
                
                
                
                    
                        精彩图集
                     
                    
                       
                    宇宙这么大，那么宇宙之外的是什么?会有什
                        
                    浩瀚宇宙有多大:宇宙到底有多大呢？是无边
                        
                    地球和仙女座的距离，我们来计算一下有多少
                        
                    磁星是宇宙中的贵族，至今仅发现20余颗，其磁
                        
                    感受一下白垩纪著名的恐龙灭绝事件，一代地
                        
                    物种演化离不开自然法则，但早已克服生存困
                        
                    人类在其他星球都能跳跃多少高度呢？接下来



            
            

    
    
        


   
    
        精彩推荐
        
        
 
 
    

    
        
 
 
    
    
        图片推荐
        
            
                    
                    最狼狈的事务莫过于?10件狼狈到顶点的搞