Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
338 views
in Technique[技术] by (71.8m points)

python - MongoDB: Split multi-serie in multiple columns

I have loaded a set of tweet in a MongoDB DB, and calculated the number of tweets grouped per month per User.

I have a list as below:

_id User Month NbTweet
1 User1 1 10
2 User1 2 20
3 User2 1 15
4 User2 2 25
5 User3 1 12
6 User3 2 22

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Reply

0 votes
by (71.8m points)

That should do it.

>>> df = pd.DataFrame({'id': [1, 2, 3, 4, 5, 6],
 'User': ['User1', 'User1', 'User2', 'User2', 'User3', 'User3'],
 'Month': [1, 2, 1, 2, 1, 2],
 'NbTweet': [10, 20, 15, 25, 12, 22]})

>>> df1
   id   User  Month  NbTweet
0   1  User1      1       10
1   2  User1      2       20
2   3  User2      1       15
3   4  User2      2       25
4   5  User3      1       12
5   6  User3      2       22

>>> df.set_index(['Month','User']).unstack()['NbTweet']
 User   User1  User2  User3
Month
1         10     15     12
2         20     25     22

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
OGeek|极客中国-欢迎来到极客的世界,一个免费开放的程序员编程交流平台!开放,进步,分享!让技术改变生活,让极客改变未来! Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

1.4m articles

1.4m replys

5 comments

56.7k users

...