我有一个数据集应该被重新整形为一个宽的格式。在多个层次上重新整形
数据目前很长,观察结果为给定学校中的每个人确定了“面积”。由于数据有两层,因此传统整形代码出现问题:首先应该重新整形,以便每个观察能够唯一地识别一个人和一个学校(具有多个区域)。其次,我们最终应该对每个人(包含多个学校和多个区域)进行一次观察。
下面是数据如何貌似现在一个例子:
* Example generated by -dataex-. To install: ssc install dataex
clear
input str4 id str2 school_code str1 area
"a111" "1x" "a"
"a111" "1x" "b"
"a111" "1x" "c"
"a111" "1y" "a"
"a111" "1y" "b"
"a111" "1y" "c"
"x222" "1z" "d"
"x222" "1z" "e"
"x222" "1z" "f"
"x222" "1k" "g"
"x222" "1k" "h"
"x222" "1k" "i"
end
这里是如何,我想数据集试探性的例子是:
* Example generated by -dataex-. To install: ssc install dataex
clear
input str4 id str2(school_code_1 school_code_2) str1(school1_area1 school1_area2 school1_area3 school2_area1 school2_area2 school2_area3)
"a111" "1x" "1y" "a" "b" "c" "a" "b" "c"
"x222" "1z" "1k" "d" "e" "f" "g" "h" "i"
end