Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 286 |
Missing cells | 294 |
Missing cells (%) | 20.6% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 11.3 KiB |
Average record size in memory | 40.5 B |
Variable types
Text | 4 |
---|---|
Categorical | 1 |
Dataset
Description | 키,명칭,행정 시,행정 구,행정 동 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-13021/S/1/datasetView.do |
Reproduction
Analysis started | 2023-12-11 10:13:45.220326 |
---|---|
Analysis finished | 2023-12-11 10:13:45.764708 |
Duration | 0.54 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
키
Text
UNIQUE
 
Distinct | 286 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.4 KiB |
Length
Max length | 12 |
---|---|
Median length | 12 |
Mean length | 12 |
Min length | 12 |
Characters and Unicode
Total characters | 3432 |
---|---|
Distinct characters | 16 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 286 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | BE_IW04-0232 |
---|---|
2nd row | BE_IW04-0233 |
3rd row | BE_IW04-0234 |
4th row | BE_IW04-0235 |
5th row | BE_IW04-0236 |
Value | Count | Frequency (%) |
be_iw04-0232 | 1 | 0.3% |
be_iw04-0133 | 1 | 0.3% |
be_iw04-0139 | 1 | 0.3% |
be_iw04-0138 | 1 | 0.3% |
be_iw04-0137 | 1 | 0.3% |
be_iw04-0136 | 1 | 0.3% |
be_iw04-0135 | 1 | 0.3% |
be_iw04-0143 | 1 | 0.3% |
be_iw04-0132 | 1 | 0.3% |
be_iw04-0141 | 1 | 0.3% |
Other values (276) | 276 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 728 | |
4 | 345 | |
B | 286 | 8.3% |
E | 286 | 8.3% |
_ | 286 | 8.3% |
I | 286 | 8.3% |
W | 286 | 8.3% |
- | 286 | 8.3% |
1 | 159 | 4.6% |
2 | 146 | 4.3% |
Other values (6) | 338 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 1716 | |
Uppercase Letter | 1144 | |
Connector Punctuation | 286 | 8.3% |
Dash Punctuation | 286 | 8.3% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 728 | |
4 | 345 | |
1 | 159 | 9.3% |
2 | 146 | 8.5% |
3 | 59 | 3.4% |
5 | 59 | 3.4% |
6 | 59 | 3.4% |
7 | 58 | 3.4% |
8 | 55 | 3.2% |
9 | 48 | 2.8% |
Uppercase Letter
Value | Count | Frequency (%) |
B | 286 | |
E | 286 | |
I | 286 | |
W | 286 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 286 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 286 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 2288 | |
Latin | 1144 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 728 | |
4 | 345 | |
_ | 286 | 12.5% |
- | 286 | 12.5% |
1 | 159 | 6.9% |
2 | 146 | 6.4% |
3 | 59 | 2.6% |
5 | 59 | 2.6% |
6 | 59 | 2.6% |
7 | 58 | 2.5% |
Other values (2) | 103 | 4.5% |
Latin
Value | Count | Frequency (%) |
B | 286 | |
E | 286 | |
I | 286 | |
W | 286 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 3432 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 728 | |
4 | 345 | |
B | 286 | 8.3% |
E | 286 | 8.3% |
_ | 286 | 8.3% |
I | 286 | 8.3% |
W | 286 | 8.3% |
- | 286 | 8.3% |
1 | 159 | 4.6% |
2 | 146 | 4.3% |
Other values (6) | 338 |
명칭
Text
Distinct | 207 |
---|---|
Distinct (%) | 72.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.4 KiB |
Length
Max length | 46 |
---|---|
Median length | 36 |
Mean length | 25.545455 |
Min length | 7 |
Characters and Unicode
Total characters | 7306 |
---|---|
Distinct characters | 65 |
Distinct categories | 8 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 168 ? |
---|---|
Unique (%) | 58.7% |
Sample
1st row | Sword in the Moon Filming site |
---|---|
2nd row | Sword in the Moon Filming site |
3rd row | Cheonghaejin Port |
4th row | Chowon Photo Studio |
5th row | Filming site |
Value | Count | Frequency (%) |
site | 118 | 9.9% |
filming | 117 | 9.9% |
set | 105 | 8.9% |
for | 103 | 8.7% |
flim | 65 | 5.5% |
films | 33 | 2.8% |
the | 29 | 2.4% |
of | 22 | 1.9% |
studio | 11 | 0.9% |
park | 11 | 0.9% |
Other values (302) | 572 |
Most occurring characters
Value | Count | Frequency (%) |
906 | 12.4% | |
i | 674 | 9.2% |
e | 662 | 9.1% |
n | 455 | 6.2% |
o | 446 | 6.1% |
m | 364 | 5.0% |
s | 360 | 4.9% |
l | 355 | 4.9% |
t | 339 | 4.6% |
a | 335 | 4.6% |
Other values (55) | 2410 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 5575 | |
Space Separator | 906 | 12.4% |
Uppercase Letter | 749 | 10.3% |
Other Punctuation | 25 | 0.3% |
Decimal Number | 22 | 0.3% |
Close Punctuation | 11 | 0.2% |
Open Punctuation | 11 | 0.2% |
Dash Punctuation | 7 | 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
i | 674 | |
e | 662 | |
n | 455 | 8.2% |
o | 446 | 8.0% |
m | 364 | 6.5% |
s | 360 | 6.5% |
l | 355 | 6.4% |
t | 339 | 6.1% |
a | 335 | 6.0% |
g | 321 | 5.8% |
Other values (16) | 1264 |
Uppercase Letter
Value | Count | Frequency (%) |
F | 141 | |
S | 118 | |
G | 54 | 7.2% |
T | 48 | 6.4% |
B | 46 | 6.1% |
M | 39 | 5.2% |
D | 37 | 4.9% |
C | 30 | 4.0% |
H | 28 | 3.7% |
O | 28 | 3.7% |
Other values (13) | 180 |
Decimal Number
Value | Count | Frequency (%) |
1 | 8 | |
2 | 4 | |
0 | 3 | 13.6% |
3 | 3 | 13.6% |
5 | 2 | 9.1% |
9 | 1 | 4.5% |
4 | 1 | 4.5% |
Other Punctuation
Value | Count | Frequency (%) |
' | 23 | |
. | 1 | 4.0% |
& | 1 | 4.0% |
Close Punctuation
Value | Count | Frequency (%) |
) | 7 | |
] | 4 |
Open Punctuation
Value | Count | Frequency (%) |
( | 7 | |
[ | 4 |
Space Separator
Value | Count | Frequency (%) |
906 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 7 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 6324 | |
Common | 982 | 13.4% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
i | 674 | 10.7% |
e | 662 | 10.5% |
n | 455 | 7.2% |
o | 446 | 7.1% |
m | 364 | 5.8% |
s | 360 | 5.7% |
l | 355 | 5.6% |
t | 339 | 5.4% |
a | 335 | 5.3% |
g | 321 | 5.1% |
Other values (39) | 2013 |
Common
Value | Count | Frequency (%) |
906 | ||
' | 23 | 2.3% |
1 | 8 | 0.8% |
) | 7 | 0.7% |
- | 7 | 0.7% |
( | 7 | 0.7% |
2 | 4 | 0.4% |
] | 4 | 0.4% |
[ | 4 | 0.4% |
0 | 3 | 0.3% |
Other values (6) | 9 | 0.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 7306 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
906 | 12.4% | |
i | 674 | 9.2% |
e | 662 | 9.1% |
n | 455 | 6.2% |
o | 446 | 6.1% |
m | 364 | 5.0% |
s | 360 | 4.9% |
l | 355 | 4.9% |
t | 339 | 4.6% |
a | 335 | 4.6% |
Other values (55) | 2410 |
행정 시
Categorical
Distinct | 12 |
---|---|
Distinct (%) | 4.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.4 KiB |
<NA> | |
---|---|
Gyeonggi-do | |
Gyeongsangnam-do | |
Gyeongsangbuk-do | |
Jeollanam-do | |
Other values (7) |
Length
Max length | 17 |
---|---|
Median length | 4 |
Mean length | 8.1013986 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Jeollanam-do |
---|---|
2nd row | <NA> |
3rd row | Jeollanam-do |
4th row | <NA> |
5th row | Gyeonggi-do |
Common Values
Value | Count | Frequency (%) |
<NA> | 147 | |
Gyeonggi-do | 20 | 7.0% |
Gyeongsangnam-do | 19 | 6.6% |
Gyeongsangbuk-do | 18 | 6.3% |
Jeollanam-do | 17 | 5.9% |
Jeju-do | 16 | 5.6% |
Gangwon-do | 12 | 4.2% |
Jeollabuk-do | 10 | 3.5% |
Chungcheongbuk-do | 10 | 3.5% |
Chungcheongnam-do | 8 | 2.8% |
Other values (2) | 9 | 3.1% |
Length
Value | Count | Frequency (%) |
na | 147 | |
gyeonggi-do | 20 | 7.0% |
gyeongsangnam-do | 19 | 6.6% |
gyeongsangbuk-do | 18 | 6.3% |
jeollanam-do | 17 | 5.9% |
jeju-do | 16 | 5.6% |
gangwon-do | 12 | 4.2% |
jeollabuk-do | 10 | 3.5% |
chungcheongbuk-do | 10 | 3.5% |
chungcheongnam-do | 8 | 2.8% |
Other values (2) | 9 | 3.1% |
행정 구
Text
MISSING
 
Distinct | 63 |
---|---|
Distinct (%) | 45.3% |
Missing | 147 |
Missing (%) | 51.4% |
Memory size | 2.4 KiB |
Length
Max length | 20 |
---|---|
Median length | 18 |
Mean length | 10.841727 |
Min length | 7 |
Characters and Unicode
Total characters | 1507 |
---|---|
Distinct characters | 38 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 32 ? |
---|---|
Unique (%) | 23.0% |
Sample
1st row | Damyang-gun |
---|---|
2nd row | Wando-gun |
3rd row | Yangju-si |
4th row | Namwon-si |
5th row | Hadong-gun |
Value | Count | Frequency (%) |
namyangju-si | 9 | 6.2% |
jeju-si | 9 | 6.2% |
seogwipo-si | 7 | 4.8% |
sancheon-gun | 7 | 4.8% |
wando-gun | 6 | 4.1% |
jecheon-si | 6 | 4.1% |
pyeongchang-gun | 5 | 3.4% |
mungyeong-si | 4 | 2.7% |
buyeo-gun | 4 | 2.7% |
wonmi-gu | 3 | 2.1% |
Other values (57) | 86 |
Most occurring characters
Value | Count | Frequency (%) |
n | 225 | |
g | 170 | |
- | 146 | |
u | 130 | |
o | 106 | 7.0% |
e | 98 | 6.5% |
a | 98 | 6.5% |
i | 88 | 5.8% |
s | 81 | 5.4% |
h | 46 | 3.1% |
Other values (28) | 319 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 1208 | |
Dash Punctuation | 146 | 9.7% |
Uppercase Letter | 146 | 9.7% |
Space Separator | 7 | 0.5% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
n | 225 | |
g | 170 | |
u | 130 | |
o | 106 | |
e | 98 | |
a | 98 | |
i | 88 | 7.3% |
s | 81 | 6.7% |
h | 46 | 3.8% |
c | 33 | 2.7% |
Other values (9) | 133 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 22 | |
J | 22 | |
N | 14 | |
B | 14 | |
G | 11 | |
Y | 11 | |
W | 9 | |
H | 8 | 5.5% |
P | 7 | 4.8% |
C | 6 | 4.1% |
Other values (7) | 22 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 146 |
Space Separator
Value | Count | Frequency (%) |
7 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 1354 | |
Common | 153 | 10.2% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
n | 225 | |
g | 170 | |
u | 130 | |
o | 106 | 7.8% |
e | 98 | 7.2% |
a | 98 | 7.2% |
i | 88 | 6.5% |
s | 81 | 6.0% |
h | 46 | 3.4% |
c | 33 | 2.4% |
Other values (26) | 279 |
Common
Value | Count | Frequency (%) |
- | 146 | |
7 | 4.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1507 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
n | 225 | |
g | 170 | |
- | 146 | |
u | 130 | |
o | 106 | 7.0% |
e | 98 | 6.5% |
a | 98 | 6.5% |
i | 88 | 5.8% |
s | 81 | 5.4% |
h | 46 | 3.1% |
Other values (28) | 319 |
행정 동
Text
MISSING
 
Distinct | 91 |
---|---|
Distinct (%) | 65.5% |
Missing | 147 |
Missing (%) | 51.4% |
Memory size | 2.4 KiB |
Length
Max length | 19 |
---|---|
Median length | 17 |
Mean length | 11.935252 |
Min length | 7 |
Characters and Unicode
Total characters | 1659 |
---|---|
Distinct characters | 42 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 64 ? |
---|---|
Unique (%) | 46.0% |
Sample
1st row | Geumseong-myeon |
---|---|
2nd row | Wando-eup |
3rd row | Yangju2-dong |
4th row | Noam-dong |
5th row | Jingyo-myeon |
Value | Count | Frequency (%) |
joan-myeon | 8 | 5.8% |
chahwang-myeon | 6 | 4.3% |
geumseong-myeon | 4 | 2.9% |
daegwallyeong-myeon | 4 | 2.9% |
gujwa-eup | 4 | 2.9% |
chunghwa-myeon | 3 | 2.2% |
mungyeong-eup | 3 | 2.2% |
sang3-dong | 3 | 2.2% |
wando-eup | 3 | 2.2% |
bukdo-myeon | 3 | 2.2% |
Other values (81) | 98 |
Most occurring characters
Value | Count | Frequency (%) |
n | 243 | |
o | 194 | |
e | 187 | |
- | 139 | |
g | 112 | 6.8% |
y | 112 | 6.8% |
m | 108 | 6.5% |
a | 101 | 6.1% |
u | 95 | 5.7% |
p | 38 | 2.3% |
Other values (32) | 330 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 1374 | |
Dash Punctuation | 139 | 8.4% |
Uppercase Letter | 139 | 8.4% |
Decimal Number | 7 | 0.4% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
n | 243 | |
o | 194 | |
e | 187 | |
g | 112 | |
y | 112 | |
m | 108 | |
a | 101 | |
u | 95 | 6.9% |
p | 38 | 2.8% |
h | 32 | 2.3% |
Other values (11) | 152 |
Uppercase Letter
Value | Count | Frequency (%) |
G | 18 | |
J | 17 | |
S | 16 | |
C | 15 | |
B | 15 | |
Y | 11 | |
D | 8 | |
H | 7 | 5.0% |
N | 7 | 5.0% |
W | 6 | 4.3% |
Other values (7) | 19 |
Decimal Number
Value | Count | Frequency (%) |
1 | 3 | |
3 | 3 | |
2 | 1 | 14.3% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 139 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 1513 | |
Common | 146 | 8.8% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
n | 243 | |
o | 194 | |
e | 187 | |
g | 112 | 7.4% |
y | 112 | 7.4% |
m | 108 | 7.1% |
a | 101 | 6.7% |
u | 95 | 6.3% |
p | 38 | 2.5% |
h | 32 | 2.1% |
Other values (28) | 291 |
Common
Value | Count | Frequency (%) |
- | 139 | |
1 | 3 | 2.1% |
3 | 3 | 2.1% |
2 | 1 | 0.7% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1659 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
n | 243 | |
o | 194 | |
e | 187 | |
- | 139 | |
g | 112 | 6.8% |
y | 112 | 6.8% |
m | 108 | 6.5% |
a | 101 | 6.1% |
u | 95 | 5.7% |
p | 38 | 2.3% |
Other values (32) | 330 |
행정 시 | 행정 구 | 행정 동 | |
---|---|---|---|
행정 시 | 1.000 | 1.000 | 0.999 |
행정 구 | 1.000 | 1.000 | 1.000 |
행정 동 | 0.999 | 1.000 | 1.000 |
키 | 명칭 | 행정 시 | 행정 구 | 행정 동 | |
---|---|---|---|---|---|
0 | BE_IW04-0232 | Sword in the Moon Filming site | Jeollanam-do | Damyang-gun | Geumseong-myeon |
1 | BE_IW04-0233 | Sword in the Moon Filming site | <NA> | <NA> | <NA> |
2 | BE_IW04-0234 | Cheonghaejin Port | Jeollanam-do | Wando-gun | Wando-eup |
3 | BE_IW04-0235 | Chowon Photo Studio | <NA> | <NA> | <NA> |
4 | BE_IW04-0236 | Filming site | Gyeonggi-do | Yangju-si | Yangju2-dong |
5 | BE_IW04-0237 | Chunhyangdyeon Filming site | <NA> | <NA> | <NA> |
6 | BE_IW04-0238 | Chunhyangdyeon Filming site | Jeollabuk-do | Namwon-si | Noam-dong |
7 | BE_IW04-0239 | Chwihwaseon Filming site | Gyeongsangnam-do | Hadong-gun | Jingyo-myeon |
8 | BE_IW04-0240 | Chwihwaseon Filming site | Jeollanam-do | Suncheon-si | Seungju-eup |
9 | BE_IW04-0241 | Friend Filming site | Busan | Saha-gu | Hadan1-dong |
키 | 명칭 | 행정 시 | 행정 구 | 행정 동 | |
---|---|---|---|---|---|
276 | BE_IW04-0222 | Jukseong Filming site | <NA> | <NA> | <NA> |
277 | BE_IW04-0223 | Jurassic studio | <NA> | <NA> | <NA> |
278 | BE_IW04-0224 | Stairway to Heaven Filming site | Incheon | Jung-gu | Yongyu-dong |
279 | BE_IW04-0225 | Heaven's Soldiers | Gyeongsangnam-do | Sancheon-gun | Chahwang-myeon |
280 | BE_IW04-0226 | Beyond the Years Filming site | Jeollanam-do | Jangheung-gun | Hoejin-myeon |
281 | BE_IW04-0227 | Beyond the Years Filming site | <NA> | <NA> | <NA> |
282 | BE_IW04-0228 | Cheongseokkol Filming site | <NA> | <NA> | <NA> |
283 | BE_IW04-0229 | Springtime Filming site | Gyeongsangnam-do | Hadong-gun | Agyang-myeon |
284 | BE_IW04-0230 | Springtime Filming site | <NA> | <NA> | <NA> |
285 | BE_IW04-0231 | Springtime Filming site | <NA> | <NA> | <NA> |