Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 1104 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 43.3 KiB |
Average record size in memory | 40.1 B |
Variable types
Text | 3 |
---|---|
Categorical | 2 |
Dataset
Description | 키,명칭,행정시,행정구,행정동 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-13045/S/1/datasetView.do |
Reproduction
Analysis started | 2023-12-11 07:34:53.822531 |
---|---|
Analysis finished | 2023-12-11 07:34:54.631657 |
Duration | 0.81 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
키
Text
UNIQUE
 
Distinct | 1104 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 8.8 KiB |
Length
Max length | 12 |
---|---|
Median length | 12 |
Mean length | 12 |
Min length | 12 |
Characters and Unicode
Total characters | 13248 |
---|---|
Distinct characters | 16 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1104 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | BE_IW16-0511 |
---|---|
2nd row | BE_IW16-1049 |
3rd row | BE_IW16-1050 |
4th row | BE_IW16-1051 |
5th row | BE_IW16-1052 |
Value | Count | Frequency (%) |
be_iw16-0511 | 1 | 0.1% |
be_iw16-0968 | 1 | 0.1% |
be_iw16-0963 | 1 | 0.1% |
be_iw16-0964 | 1 | 0.1% |
be_iw16-0965 | 1 | 0.1% |
be_iw16-0966 | 1 | 0.1% |
be_iw16-0967 | 1 | 0.1% |
be_iw16-0960 | 1 | 0.1% |
be_iw16-0970 | 1 | 0.1% |
be_iw16-0959 | 1 | 0.1% |
Other values (1094) | 1094 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 1535 | |
6 | 1424 | |
0 | 1422 | |
B | 1104 | |
E | 1104 | |
_ | 1104 | |
I | 1104 | |
W | 1104 | |
- | 1104 | |
2 | 321 | 2.4% |
Other values (6) | 1922 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 6624 | |
Uppercase Letter | 4416 | |
Connector Punctuation | 1104 | 8.3% |
Dash Punctuation | 1104 | 8.3% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 1535 | |
6 | 1424 | |
0 | 1422 | |
2 | 321 | 4.8% |
4 | 321 | 4.8% |
3 | 321 | 4.8% |
5 | 320 | 4.8% |
9 | 320 | 4.8% |
8 | 320 | 4.8% |
7 | 320 | 4.8% |
Uppercase Letter
Value | Count | Frequency (%) |
B | 1104 | |
E | 1104 | |
I | 1104 | |
W | 1104 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1104 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1104 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 8832 | |
Latin | 4416 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 1535 | |
6 | 1424 | |
0 | 1422 | |
_ | 1104 | |
- | 1104 | |
2 | 321 | 3.6% |
4 | 321 | 3.6% |
3 | 321 | 3.6% |
5 | 320 | 3.6% |
9 | 320 | 3.6% |
Other values (2) | 640 |
Latin
Value | Count | Frequency (%) |
B | 1104 | |
E | 1104 | |
I | 1104 | |
W | 1104 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 13248 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 1535 | |
6 | 1424 | |
0 | 1422 | |
B | 1104 | |
E | 1104 | |
_ | 1104 | |
I | 1104 | |
W | 1104 | |
- | 1104 | |
2 | 321 | 2.4% |
Other values (6) | 1922 |
명칭
Text
Distinct | 706 |
---|---|
Distinct (%) | 63.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 8.8 KiB |
Value | Count | Frequency (%) |
31 | 2.8% | |
村 | 20 | 1.8% |
全州餐 | 19 | 1.7% |
家 | 11 | 1.0% |
村小屋 | 11 | 1.0% |
餐 | 10 | 0.9% |
南原泥 | 9 | 0.8% |
麻浦排骨 | 8 | 0.7% |
老村子 | 8 | 0.7% |
柳?家 | 7 | 0.6% |
Other values (667) | 974 |
Most occurring characters
Value | Count | Frequency (%) |
? | 1159 | |
家 | 111 | 2.7% |
屋 | 93 | 2.2% |
餐 | 87 | 2.1% |
村 | 86 | 2.1% |
山 | 67 | 1.6% |
店 | 54 | 1.3% |
大 | 46 | 1.1% |
牛 | 36 | 0.9% |
原 | 36 | 0.9% |
Other values (511) | 2360 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 2780 | |
Other Punctuation | 1159 | |
Lowercase Letter | 107 | 2.6% |
Uppercase Letter | 33 | 0.8% |
Close Punctuation | 20 | 0.5% |
Open Punctuation | 20 | 0.5% |
Space Separator | 10 | 0.2% |
Dash Punctuation | 3 | 0.1% |
Decimal Number | 3 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
家 | 111 | 4.0% |
屋 | 93 | 3.3% |
餐 | 87 | 3.1% |
村 | 86 | 3.1% |
山 | 67 | 2.4% |
店 | 54 | 1.9% |
大 | 46 | 1.7% |
牛 | 36 | 1.3% |
原 | 36 | 1.3% |
南 | 36 | 1.3% |
Other values (468) | 2128 |
Lowercase Letter
Value | Count | Frequency (%) |
a | 19 | |
i | 17 | |
r | 9 | |
n | 8 | |
o | 8 | |
m | 6 | 5.6% |
g | 6 | 5.6% |
s | 6 | 5.6% |
u | 5 | 4.7% |
y | 4 | 3.7% |
Other values (8) | 19 |
Uppercase Letter
Value | Count | Frequency (%) |
G | 6 | |
M | 5 | |
K | 3 | |
D | 3 | |
Y | 3 | |
P | 2 | 6.1% |
L | 2 | 6.1% |
B | 2 | 6.1% |
A | 2 | 6.1% |
R | 1 | 3.0% |
Other values (4) | 4 |
Decimal Number
Value | Count | Frequency (%) |
6 | 1 | |
1 | 1 | |
9 | 1 |
Close Punctuation
Value | Count | Frequency (%) |
) | 17 | |
) | 3 | 15.0% |
Open Punctuation
Value | Count | Frequency (%) |
( | 17 | |
( | 3 | 15.0% |
Space Separator
Value | Count | Frequency (%) |
7 | ||
3 |
Other Punctuation
Value | Count | Frequency (%) |
? | 1159 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Han | 2767 | |
Common | 1215 | |
Latin | 140 | 3.4% |
Hangul | 13 | 0.3% |
Most frequent character per script
Han
Value | Count | Frequency (%) |
家 | 111 | 4.0% |
屋 | 93 | 3.4% |
餐 | 87 | 3.1% |
村 | 86 | 3.1% |
山 | 67 | 2.4% |
店 | 54 | 2.0% |
大 | 46 | 1.7% |
牛 | 36 | 1.3% |
原 | 36 | 1.3% |
南 | 36 | 1.3% |
Other values (463) | 2115 |
Latin
Value | Count | Frequency (%) |
a | 19 | 13.6% |
i | 17 | 12.1% |
r | 9 | 6.4% |
n | 8 | 5.7% |
o | 8 | 5.7% |
m | 6 | 4.3% |
g | 6 | 4.3% |
s | 6 | 4.3% |
G | 6 | 4.3% |
M | 5 | 3.6% |
Other values (22) | 50 |
Common
Value | Count | Frequency (%) |
? | 1159 | |
) | 17 | 1.4% |
( | 17 | 1.4% |
7 | 0.6% | |
) | 3 | 0.2% |
( | 3 | 0.2% |
- | 3 | 0.2% |
3 | 0.2% | |
6 | 1 | 0.1% |
1 | 1 | 0.1% |
Hangul
Value | Count | Frequency (%) |
쎱 | 9 | |
슲 | 1 | 7.7% |
싦 | 1 | 7.7% |
쑁 | 1 | 7.7% |
쒦 | 1 | 7.7% |
Most occurring blocks
Value | Count | Frequency (%) |
CJK | 2762 | |
ASCII | 1346 | |
Hangul | 13 | 0.3% |
None | 9 | 0.2% |
CJK Compat Ideographs | 5 | 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
? | 1159 | |
a | 19 | 1.4% |
i | 17 | 1.3% |
) | 17 | 1.3% |
( | 17 | 1.3% |
r | 9 | 0.7% |
n | 8 | 0.6% |
o | 8 | 0.6% |
7 | 0.5% | |
m | 6 | 0.4% |
Other values (30) | 79 | 5.9% |
CJK
Value | Count | Frequency (%) |
家 | 111 | 4.0% |
屋 | 93 | 3.4% |
餐 | 87 | 3.1% |
村 | 86 | 3.1% |
山 | 67 | 2.4% |
店 | 54 | 2.0% |
大 | 46 | 1.7% |
牛 | 36 | 1.3% |
原 | 36 | 1.3% |
南 | 36 | 1.3% |
Other values (459) | 2110 |
Hangul
Value | Count | Frequency (%) |
쎱 | 9 | |
슲 | 1 | 7.7% |
싦 | 1 | 7.7% |
쑁 | 1 | 7.7% |
쒦 | 1 | 7.7% |
None
Value | Count | Frequency (%) |
) | 3 | |
( | 3 | |
3 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
金 | 2 | |
寧 | 1 | |
老 | 1 | |
李 | 1 |
행정시
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 8.8 KiB |
首?特?市 |
---|
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 首?特?市 |
---|---|
2nd row | 首?特?市 |
3rd row | 首?特?市 |
4th row | 首?特?市 |
5th row | 首?特?市 |
Common Values
Value | Count | Frequency (%) |
首?特?市 | 1104 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
首?特?市 | 1104 |
행정구
Categorical
Distinct | 25 |
---|---|
Distinct (%) | 2.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 8.8 KiB |
江南? | |
---|---|
瑞草? | |
?路? | |
江北? | |
中? | 58 |
Other values (20) |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 3.0452899 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 中浪? |
---|---|
2nd row | 江?? |
3rd row | 瑞草? |
4th row | ?路? |
5th row | 九老? |
Common Values
Value | Count | Frequency (%) |
江南? | 153 | 13.9% |
瑞草? | 93 | 8.4% |
?路? | 82 | 7.4% |
江北? | 67 | 6.1% |
中? | 58 | 5.3% |
松坡? | 57 | 5.2% |
麻浦? | 49 | 4.4% |
永登浦? | 45 | 4.1% |
冠岳? | 43 | 3.9% |
城北? | 42 | 3.8% |
Other values (15) | 415 |
Length
Value | Count | Frequency (%) |
江南 | 153 | 13.9% |
瑞草 | 93 | 8.4% |
路 | 82 | 7.4% |
江北 | 67 | 6.1% |
中 | 58 | 5.3% |
松坡 | 57 | 5.2% |
麻浦 | 49 | 4.4% |
永登浦 | 45 | 4.1% |
冠岳 | 43 | 3.9% |
城北 | 42 | 3.8% |
Other values (15) | 415 |
행정동
Text
Distinct | 300 |
---|---|
Distinct (%) | 27.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 8.8 KiB |
Value | Count | Frequency (%) |
牛耳洞 | 31 | 2.8% |
三1洞 | 25 | 2.3% |
路1.2.3.4街洞 | 25 | 2.3% |
洞 | 23 | 2.1% |
瑞草3洞 | 20 | 1.8% |
淸潭洞 | 20 | 1.8% |
2洞 | 20 | 1.8% |
谷洞 | 17 | 1.5% |
明洞 | 16 | 1.4% |
1洞 | 14 | 1.3% |
Other values (287) | 893 |
Most occurring characters
Value | Count | Frequency (%) |
洞 | 1104 | |
? | 566 | 13.6% |
1 | 254 | 6.1% |
2 | 213 | 5.1% |
3 | 101 | 2.4% |
. | 85 | 2.0% |
4 | 74 | 1.8% |
新 | 71 | 1.7% |
三 | 58 | 1.4% |
谷 | 51 | 1.2% |
Other values (171) | 1596 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 2843 | |
Decimal Number | 679 | 16.3% |
Other Punctuation | 651 | 15.6% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
洞 | 1104 | |
新 | 71 | 2.5% |
三 | 58 | 2.0% |
谷 | 51 | 1.8% |
路 | 41 | 1.4% |
瑞 | 40 | 1.4% |
街 | 39 | 1.4% |
草 | 39 | 1.4% |
大 | 36 | 1.3% |
方 | 32 | 1.1% |
Other values (161) | 1332 |
Decimal Number
Value | Count | Frequency (%) |
1 | 254 | |
2 | 213 | |
3 | 101 | 14.9% |
4 | 74 | 10.9% |
5 | 18 | 2.7% |
6 | 13 | 1.9% |
7 | 4 | 0.6% |
8 | 2 | 0.3% |
Other Punctuation
Value | Count | Frequency (%) |
? | 566 | |
. | 85 | 13.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Han | 2826 | |
Common | 1330 | |
Hangul | 17 | 0.4% |
Most frequent character per script
Han
Value | Count | Frequency (%) |
洞 | 1104 | |
新 | 71 | 2.5% |
三 | 58 | 2.1% |
谷 | 51 | 1.8% |
路 | 41 | 1.5% |
瑞 | 40 | 1.4% |
街 | 39 | 1.4% |
草 | 39 | 1.4% |
大 | 36 | 1.3% |
方 | 32 | 1.1% |
Other values (158) | 1315 |
Common
Value | Count | Frequency (%) |
? | 566 | |
1 | 254 | |
2 | 213 | 16.0% |
3 | 101 | 7.6% |
. | 85 | 6.4% |
4 | 74 | 5.6% |
5 | 18 | 1.4% |
6 | 13 | 1.0% |
7 | 4 | 0.3% |
8 | 2 | 0.2% |
Hangul
Value | Count | Frequency (%) |
쒧 | 11 | |
쑿 | 5 | |
씉 | 1 | 5.9% |
Most occurring blocks
Value | Count | Frequency (%) |
CJK | 2826 | |
ASCII | 1330 | |
Hangul | 17 | 0.4% |
Most frequent character per block
CJK
Value | Count | Frequency (%) |
洞 | 1104 | |
新 | 71 | 2.5% |
三 | 58 | 2.1% |
谷 | 51 | 1.8% |
路 | 41 | 1.5% |
瑞 | 40 | 1.4% |
街 | 39 | 1.4% |
草 | 39 | 1.4% |
大 | 36 | 1.3% |
方 | 32 | 1.1% |
Other values (158) | 1315 |
ASCII
Value | Count | Frequency (%) |
? | 566 | |
1 | 254 | |
2 | 213 | 16.0% |
3 | 101 | 7.6% |
. | 85 | 6.4% |
4 | 74 | 5.6% |
5 | 18 | 1.4% |
6 | 13 | 1.0% |
7 | 4 | 0.3% |
8 | 2 | 0.2% |
Hangul
Value | Count | Frequency (%) |
쒧 | 11 | |
쑿 | 5 | |
씉 | 1 | 5.9% |
키 | 명칭 | 행정시 | 행정구 | 행정동 | |
---|---|---|---|---|---|
0 | BE_IW16-0511 | 松林亭 | 首?特?市 | 中浪? | 面牧4洞 |
1 | BE_IW16-1049 | ?村 | 首?特?市 | 江?? | 高?1洞 |
2 | BE_IW16-1050 | ?村 | 首?特?市 | 瑞草? | 瑞草3洞 |
3 | BE_IW16-1051 | ?代花? | 首?特?市 | ?路? | 嘉?洞 |
4 | BE_IW16-1052 | ?代花? | 首?特?市 | 九老? | 九老1洞 |
5 | BE_IW16-1053 | ?代水? | 首?特?市 | 中? | 中林洞 |
6 | BE_IW16-1054 | 兄弟 | 首?特?市 | 城?? | ?水1街2洞 |
7 | BE_IW16-1055 | 兄弟?排骨 | 首?特?市 | 江?? | 千?2洞 |
8 | BE_IW16-1056 | 兄弟餐? | 首?特?市 | ?大?? | 踏十里2洞 |
9 | BE_IW16-1057 | 兄弟餐? | 首?特?市 | 江北? | 松川洞 |
키 | 명칭 | 행정시 | 행정구 | 행정동 | |
---|---|---|---|---|---|
1094 | BE_IW16-0501 | ?家 | 首?特?市 | 城北? | 城北洞 |
1095 | BE_IW16-0502 | ??餐? | 首?特?市 | 江北? | 仁水洞 |
1096 | BE_IW16-0503 | 松林巷 | 首?特?市 | ?雀? | 大方洞 |
1097 | BE_IW16-0504 | 松田木炭排骨 | 首?特?市 | 江北? | 松川洞 |
1098 | BE_IW16-0505 | 松香 | 首?特?市 | 永登浦? | ?坪2洞 |
1099 | BE_IW16-0506 | 松香 | 首?特?市 | ?山? | 元?路1洞 |
1100 | BE_IW16-0507 | 高杆旅? | 首?特?市 | ?路? | ?化洞 |
1101 | BE_IW16-0508 | 松潭泥?? | 首?特?市 | 麻浦? | 延南洞 |
1102 | BE_IW16-0509 | ??餐? | 首?特?市 | ?大?? | ?凉里洞 |
1103 | BE_IW16-0510 | 松林?店 | 首?特?市 | ?津? | 紫?3洞 |