1
我想将日期变得凌乱的笔记本转换为熊猫中的排序日期序列。在熊猫数据框中提取和解析日期
0 03/25/93 Total time of visit (in minutes):\n
1 6/18/85 Primary Care Doctor:\n
2 sshe plans to move as of 7/8/71 In-Home Servic...
3 7 on 9/27/75 Audit C Score Current:\n
4 2/6/96 sleep studyPain Treatment Pain Level (N...
5 .Per 7/06/79 Movement D/O note:\n
6 4, 5/18/78 Patient's thoughts about current su...
7 10/24/89 CPT Code: 90801 - Psychiatric Diagnos...
8 3/7/86 SOS-10 Total Score:\n
9 (4/10/71)Score-1Audit C Score Current:\n
10 (5/11/85) Crt-1.96, BUN-26; AST/ALT-16/22; WBC...
11 4/09/75 SOS-10 Total Score:\n
12 8/01/98 Communication with referring physician...
13 1/26/72 Communication with referring physician...
14 5/24/1990 CPT Code: 90792: With medical servic...
15 1/25/2011 CPT Code: 90792: With medical servic...
我有多种日期格式,如04/20/2009;零九年四月二十零日; 09年4月20日; 09年4月3日。我想将所有这些转换为mm/dd/yyyy到一个新列。
到目前为止,我已经做了
df2['date']= df2['text'].str.extractall(r'(\d{1,2}[/-]\d{1,2}[/-]\d{2,})')
而且,我不如何提取只有MM/YY或YYYY格式的日期都没有行与上面的代码干扰。请记住,在没有日或月的情况下,我会将第一个和第一个月作为默认值。