Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 1494 |
Missing cells | 1 |
Missing cells (%) | < 0.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 71.6 KiB |
Average record size in memory | 49.1 B |
Variable types
Text | 3 |
---|---|
Numeric | 1 |
Categorical | 2 |
Dataset
Description | 키값,등록번호,상호,행정시,행정구,행정동 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-13040/S/1/datasetView.do |
Reproduction
Analysis started | 2024-04-06 11:50:52.545190 |
---|---|
Analysis finished | 2024-04-06 11:50:54.858602 |
Duration | 2.31 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
키값
Text
UNIQUE
 
Distinct | 1494 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 11.8 KiB |
Length
Max length | 14 |
---|---|
Median length | 14 |
Mean length | 14 |
Min length | 14 |
Characters and Unicode
Total characters | 20916 |
---|---|
Distinct characters | 18 |
Distinct categories | 5 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1494 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | BE_LiST21-0595 |
---|---|
2nd row | BE_LiST21-0596 |
3rd row | BE_LiST21-0597 |
4th row | BE_LiST21-0598 |
5th row | BE_LiST21-0599 |
Value | Count | Frequency (%) |
be_list21-0595 | 1 | 0.1% |
be_list21-0403 | 1 | 0.1% |
be_list21-0412 | 1 | 0.1% |
be_list21-0411 | 1 | 0.1% |
be_list21-0410 | 1 | 0.1% |
be_list21-0409 | 1 | 0.1% |
be_list21-0408 | 1 | 0.1% |
be_list21-0407 | 1 | 0.1% |
be_list21-0406 | 1 | 0.1% |
be_list21-0405 | 1 | 0.1% |
Other values (1484) | 1484 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 2489 | |
2 | 1994 | |
0 | 1496 | 7.2% |
B | 1494 | 7.1% |
T | 1494 | 7.1% |
E | 1494 | 7.1% |
- | 1494 | 7.1% |
S | 1494 | 7.1% |
i | 1494 | 7.1% |
L | 1494 | 7.1% |
Other values (8) | 4479 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 8964 | |
Uppercase Letter | 7470 | |
Dash Punctuation | 1494 | 7.1% |
Lowercase Letter | 1494 | 7.1% |
Connector Punctuation | 1494 | 7.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 2489 | |
2 | 1994 | |
0 | 1496 | |
3 | 500 | 5.6% |
4 | 495 | 5.5% |
5 | 399 | 4.5% |
6 | 399 | 4.5% |
8 | 399 | 4.5% |
7 | 399 | 4.5% |
9 | 394 | 4.4% |
Uppercase Letter
Value | Count | Frequency (%) |
B | 1494 | |
T | 1494 | |
E | 1494 | |
S | 1494 | |
L | 1494 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1494 |
Lowercase Letter
Value | Count | Frequency (%) |
i | 1494 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1494 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 11952 | |
Latin | 8964 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 2489 | |
2 | 1994 | |
0 | 1496 | |
- | 1494 | |
_ | 1494 | |
3 | 500 | 4.2% |
4 | 495 | 4.1% |
5 | 399 | 3.3% |
6 | 399 | 3.3% |
8 | 399 | 3.3% |
Other values (2) | 793 | 6.6% |
Latin
Value | Count | Frequency (%) |
B | 1494 | |
T | 1494 | |
E | 1494 | |
S | 1494 | |
i | 1494 | |
L | 1494 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 20916 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 2489 | |
2 | 1994 | |
0 | 1496 | 7.2% |
B | 1494 | 7.1% |
T | 1494 | 7.1% |
E | 1494 | 7.1% |
- | 1494 | 7.1% |
S | 1494 | 7.1% |
i | 1494 | 7.1% |
L | 1494 | 7.1% |
Other values (8) | 4479 |
등록번호
Real number (ℝ)
UNIQUE
 
Distinct | 1494 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2337.7544 |
Minimum | 1 |
---|---|
Maximum | 9998 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 13.3 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 184.3 |
Q1 | 1364 |
median | 2577.5 |
Q3 | 3369.75 |
95-th percentile | 3988.7 |
Maximum | 9998 |
Range | 9997 |
Interquartile range (IQR) | 2005.75 |
Descriptive statistics
Standard deviation | 1237.5948 |
---|---|
Coefficient of variation (CV) | 0.52939472 |
Kurtosis | -0.19297006 |
Mean | 2337.7544 |
Median Absolute Deviation (MAD) | 947 |
Skewness | -0.22293602 |
Sum | 3492605 |
Variance | 1531640.9 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1884 | 1 | 0.1% |
1160 | 1 | 0.1% |
3258 | 1 | 0.1% |
3256 | 1 | 0.1% |
1202 | 1 | 0.1% |
1197 | 1 | 0.1% |
1195 | 1 | 0.1% |
1186 | 1 | 0.1% |
1175 | 1 | 0.1% |
1173 | 1 | 0.1% |
Other values (1484) | 1484 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
4 | 1 | |
16 | 1 | |
20 | 1 | |
21 | 1 | |
22 | 1 | |
24 | 1 | |
25 | 1 | |
27 | 1 |
Value | Count | Frequency (%) |
9998 | 1 | |
4138 | 1 | |
4137 | 1 | |
4136 | 1 | |
4135 | 1 | |
4133 | 1 | |
4132 | 1 | |
4127 | 1 | |
4126 | 1 | |
4125 | 1 |
상호
Text
Distinct | 1415 |
---|---|
Distinct (%) | 94.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 11.8 KiB |
Length
Max length | 75 |
---|---|
Median length | 59 |
Mean length | 24.661312 |
Min length | 6 |
Characters and Unicode
Total characters | 36844 |
---|---|
Distinct characters | 70 |
Distinct categories | 8 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1358 ? |
---|---|
Unique (%) | 90.9% |
Sample
1st row | Seoul Lee Geon Dental Clinic |
---|---|
2nd row | Mokhuri Oriental Medicine Hospital |
3rd row | My D Dermatology Clinic |
4th row | Ever M Dental Clinic |
5th row | Seoul Mirae Hospital |
Value | Count | Frequency (%) |
clinic | 957 | 18.0% |
surgery | 358 | 6.7% |
plastic | 324 | 6.1% |
dental | 286 | 5.4% |
medicine | 180 | 3.4% |
oriental | 154 | 2.9% |
hospital | 151 | 2.8% |
dermatology | 109 | 2.0% |
seoul | 79 | 1.5% |
gangnam | 62 | 1.2% |
Other values (1301) | 2666 |
Most occurring characters
Value | Count | Frequency (%) |
3954 | 10.7% | |
i | 3753 | 10.2% |
n | 3100 | 8.4% |
e | 2855 | 7.7% |
l | 2572 | 7.0% |
a | 2342 | 6.4% |
c | 1714 | 4.7% |
r | 1609 | 4.4% |
o | 1578 | 4.3% |
t | 1483 | 4.0% |
Other values (60) | 11884 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 27091 | |
Uppercase Letter | 5494 | 14.9% |
Space Separator | 3954 | 10.7% |
Other Punctuation | 153 | 0.4% |
Dash Punctuation | 96 | 0.3% |
Decimal Number | 44 | 0.1% |
Open Punctuation | 6 | < 0.1% |
Close Punctuation | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
i | 3753 | |
n | 3100 | |
e | 2855 | |
l | 2572 | |
a | 2342 | |
c | 1714 | 6.3% |
r | 1609 | 5.9% |
o | 1578 | 5.8% |
t | 1483 | 5.5% |
g | 1150 | 4.2% |
Other values (16) | 4935 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 1170 | |
S | 724 | |
D | 530 | |
M | 448 | 8.2% |
P | 428 | 7.8% |
H | 286 | 5.2% |
O | 227 | 4.1% |
G | 200 | 3.6% |
Y | 149 | 2.7% |
B | 130 | 2.4% |
Other values (15) | 1202 |
Decimal Number
Value | Count | Frequency (%) |
6 | 9 | |
3 | 8 | |
1 | 6 | |
2 | 5 | |
5 | 5 | |
7 | 4 | |
8 | 3 | 6.8% |
0 | 2 | 4.5% |
9 | 2 | 4.5% |
Other Punctuation
Value | Count | Frequency (%) |
' | 53 | |
& | 49 | |
. | 32 | |
, | 11 | 7.2% |
? | 7 | 4.6% |
: | 1 | 0.7% |
Space Separator
Value | Count | Frequency (%) |
3954 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 96 |
Open Punctuation
Value | Count | Frequency (%) |
( | 6 |
Close Punctuation
Value | Count | Frequency (%) |
) | 6 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 32585 | |
Common | 4259 | 11.6% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
i | 3753 | 11.5% |
n | 3100 | 9.5% |
e | 2855 | 8.8% |
l | 2572 | 7.9% |
a | 2342 | 7.2% |
c | 1714 | 5.3% |
r | 1609 | 4.9% |
o | 1578 | 4.8% |
t | 1483 | 4.6% |
C | 1170 | 3.6% |
Other values (41) | 10409 |
Common
Value | Count | Frequency (%) |
3954 | ||
- | 96 | 2.3% |
' | 53 | 1.2% |
& | 49 | 1.2% |
. | 32 | 0.8% |
, | 11 | 0.3% |
6 | 9 | 0.2% |
3 | 8 | 0.2% |
? | 7 | 0.2% |
( | 6 | 0.1% |
Other values (9) | 34 | 0.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 36844 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
3954 | 10.7% | |
i | 3753 | 10.2% |
n | 3100 | 8.4% |
e | 2855 | 7.7% |
l | 2572 | 7.0% |
a | 2342 | 6.4% |
c | 1714 | 4.7% |
r | 1609 | 4.4% |
o | 1578 | 4.3% |
t | 1483 | 4.0% |
Other values (60) | 11884 |
행정시
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 11.8 KiB |
Seoul | |
---|---|
Gyeonggi-do | 2 |
<NA> | 1 |
Length
Max length | 11 |
---|---|
Median length | 5 |
Mean length | 5.0073628 |
Min length | 4 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | Seoul |
---|---|
2nd row | Seoul |
3rd row | Seoul |
4th row | Seoul |
5th row | Seoul |
Common Values
Value | Count | Frequency (%) |
Seoul | 1491 | |
Gyeonggi-do | 2 | 0.1% |
<NA> | 1 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
seoul | 1491 | |
gyeonggi-do | 2 | 0.1% |
na | 1 | 0.1% |
행정구
Categorical
HIGH CORRELATION
 
Distinct | 28 |
---|---|
Distinct (%) | 1.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 11.8 KiB |
Gangnam-gu | |
---|---|
Seocho-gu | |
Jung-gu | |
Yeongdeungpo-gu | 46 |
Songpa-gu | 45 |
Other values (23) |
Length
Max length | 22 |
---|---|
Median length | 10 |
Mean length | 9.8440428 |
Min length | 4 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 0.2% |
Sample
1st row | Seocho-gu |
---|---|
2nd row | Gangnam-gu |
3rd row | Gangdong-gu |
4th row | Gangnam-gu |
5th row | Gangnam-gu |
Common Values
Value | Count | Frequency (%) |
Gangnam-gu | 740 | |
Seocho-gu | 193 | 12.9% |
Jung-gu | 94 | 6.3% |
Yeongdeungpo-gu | 46 | 3.1% |
Songpa-gu | 45 | 3.0% |
Gangseo-gu | 40 | 2.7% |
Mapo-gu | 33 | 2.2% |
Dongdaemun-gu | 28 | 1.9% |
Jongno-gu | 24 | 1.6% |
Gwanak-gu | 22 | 1.5% |
Other values (18) | 229 | 15.3% |
Length
Value | Count | Frequency (%) |
gangnam-gu | 740 | |
seocho-gu | 193 | 12.9% |
jung-gu | 94 | 6.3% |
yeongdeungpo-gu | 46 | 3.1% |
songpa-gu | 45 | 3.0% |
gangseo-gu | 40 | 2.7% |
mapo-gu | 33 | 2.2% |
dongdaemun-gu | 28 | 1.9% |
jongno-gu | 24 | 1.6% |
gwanak-gu | 22 | 1.5% |
Other values (19) | 231 | 15.4% |
행정동
Text
Distinct | 246 |
---|---|
Distinct (%) | 16.5% |
Missing | 1 |
Missing (%) | 0.1% |
Memory size | 11.8 KiB |
Length
Max length | 20 |
---|---|
Median length | 18 |
Mean length | 12.606162 |
Min length | 8 |
Characters and Unicode
Total characters | 18821 |
---|---|
Distinct characters | 49 |
Distinct categories | 5 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 118 ? |
---|---|
Unique (%) | 7.9% |
Sample
1st row | Seocho4-dong |
---|---|
2nd row | Dogok1-dong |
3rd row | Seongnae2-dong |
4th row | Nonhyeon1-dong |
5th row | Samseong1-dong |
Value | Count | Frequency (%) |
apgujeong-dong | 140 | 9.4% |
yeoksam1-dong | 137 | 9.2% |
sinsa-dong | 123 | 8.2% |
nonhyeon1-dong | 91 | 6.1% |
cheongdam-dong | 90 | 6.0% |
seocho4-dong | 75 | 5.0% |
myeong-dong | 56 | 3.8% |
nonhyeon2-dong | 56 | 3.8% |
jamwon-dong | 28 | 1.9% |
samseong1-dong | 21 | 1.4% |
Other values (236) | 676 |
Most occurring characters
Value | Count | Frequency (%) |
o | 2990 | |
n | 2850 | |
g | 2356 | |
d | 1625 | 8.6% |
- | 1493 | 7.9% |
e | 1040 | 5.5% |
a | 894 | 4.8% |
h | 498 | 2.6% |
S | 414 | 2.2% |
1 | 405 | 2.2% |
Other values (39) | 4256 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 14923 | |
Dash Punctuation | 1493 | 7.9% |
Uppercase Letter | 1493 | 7.9% |
Decimal Number | 854 | 4.5% |
Other Punctuation | 58 | 0.3% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
o | 2990 | |
n | 2850 | |
g | 2356 | |
d | 1625 | |
e | 1040 | 7.0% |
a | 894 | 6.0% |
h | 498 | 3.3% |
s | 397 | 2.7% |
m | 397 | 2.7% |
i | 305 | 2.0% |
Other values (11) | 1571 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 414 | |
Y | 199 | |
N | 158 | 10.6% |
A | 149 | 10.0% |
C | 108 | 7.2% |
J | 101 | 6.8% |
D | 85 | 5.7% |
M | 83 | 5.6% |
H | 57 | 3.8% |
G | 56 | 3.8% |
Other values (7) | 83 | 5.6% |
Decimal Number
Value | Count | Frequency (%) |
1 | 405 | |
2 | 202 | |
4 | 121 | 14.2% |
3 | 77 | 9.0% |
6 | 22 | 2.6% |
5 | 15 | 1.8% |
7 | 10 | 1.2% |
0 | 1 | 0.1% |
8 | 1 | 0.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1493 |
Other Punctuation
Value | Count | Frequency (%) |
. | 58 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 16416 | |
Common | 2405 | 12.8% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
o | 2990 | |
n | 2850 | |
g | 2356 | |
d | 1625 | |
e | 1040 | 6.3% |
a | 894 | 5.4% |
h | 498 | 3.0% |
S | 414 | 2.5% |
s | 397 | 2.4% |
m | 397 | 2.4% |
Other values (28) | 2955 |
Common
Value | Count | Frequency (%) |
- | 1493 | |
1 | 405 | 16.8% |
2 | 202 | 8.4% |
4 | 121 | 5.0% |
3 | 77 | 3.2% |
. | 58 | 2.4% |
6 | 22 | 0.9% |
5 | 15 | 0.6% |
7 | 10 | 0.4% |
0 | 1 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 18821 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
o | 2990 | |
n | 2850 | |
g | 2356 | |
d | 1625 | 8.6% |
- | 1493 | 7.9% |
e | 1040 | 5.5% |
a | 894 | 4.8% |
h | 498 | 2.6% |
S | 414 | 2.2% |
1 | 405 | 2.2% |
Other values (39) | 4256 |
등록번호 | 행정시 | 행정구 | |
---|---|---|---|
등록번호 | 1.000 | 0.000 | 0.177 |
행정시 | 0.000 | 1.000 | 1.000 |
행정구 | 0.177 | 1.000 | 1.000 |
행정구 | 행정시 | |
---|---|---|
행정구 | 1.000 | 0.992 |
행정시 | 0.992 | 1.000 |
등록번호 | 행정시 | 행정구 | |
---|---|---|---|
등록번호 | 1.000 | 0.000 | 0.078 |
행정시 | 0.000 | 1.000 | 0.992 |
행정구 | 0.078 | 0.992 | 1.000 |
키값 | 등록번호 | 상호 | 행정시 | 행정구 | 행정동 | |
---|---|---|---|---|---|---|
0 | BE_LiST21-0595 | 1884 | Seoul Lee Geon Dental Clinic | Seoul | Seocho-gu | Seocho4-dong |
1 | BE_LiST21-0596 | 1901 | Mokhuri Oriental Medicine Hospital | Seoul | Gangnam-gu | Dogok1-dong |
2 | BE_LiST21-0597 | 1902 | My D Dermatology Clinic | Seoul | Gangdong-gu | Seongnae2-dong |
3 | BE_LiST21-0598 | 1904 | Ever M Dental Clinic | Seoul | Gangnam-gu | Nonhyeon1-dong |
4 | BE_LiST21-0599 | 1908 | Seoul Mirae Hospital | Seoul | Gangnam-gu | Samseong1-dong |
5 | BE_LiST21-0600 | 1910 | Lee Beom-geun Dental Clinic | Seoul | Jung-gu | Myeong-dong |
6 | BE_LiST21-0601 | 1915 | Geon Rehabilitation Clinic | Seoul | Seocho-gu | Seocho2-dong |
7 | BE_LiST21-0602 | 1920 | Cham Teunteun Hospital | Seoul | Guro-gu | Guro3-dong |
8 | BE_LiST21-0603 | 1921 | UD Dental Clinic | Seoul | Seongbuk-gu | Dongseon-dong |
9 | BE_LiST21-0604 | 1922 | Yeoreobun Hospital | Seoul | Gangnam-gu | Nonhyeon1-dong |
키값 | 등록번호 | 상호 | 행정시 | 행정구 | 행정동 | |
---|---|---|---|---|---|---|
1484 | BE_LiST21-1485 | 4121 | S-Top Plastic Surgery | Seoul | Gangseo-gu | Hwagok3-dong |
1485 | BE_LiST21-1486 | 4127 | Seoul Surgical Hospital | Seoul | Songpa-gu | Garakbon-dong |
1486 | BE_LiST21-1487 | 4123 | Sebarun Hospital | Seoul | Seocho-gu | Seocho3-dong |
1487 | BE_LiST21-1488 | 4124 | Bareuda Yu Oriental Medicine Clinic | Seoul | Jongno-gu | Jongno1.2.3.4ga-dong |
1488 | BE_LiST21-1489 | 4125 | CY ENT Center | Seoul | Gangnam-gu | Yeoksam1-dong |
1489 | BE_LiST21-1490 | 4132 | System Plastic Surgery | Seoul | Gangnam-gu | Cheongdam-dong |
1490 | BE_LiST21-1491 | 4136 | Cheongdam Best Internal Medicine Clinic | Seoul | Gangnam-gu | Cheongdam-dong |
1491 | BE_LiST21-1492 | 4133 | Caheum Pain Clinic | Seoul | Mapo-gu | Dohwa-dong |
1492 | BE_LiST21-1493 | 4137 | TS Plastic Surgery | Seoul | Gangnam-gu | Nonhyeon1-dong |
1493 | BE_LiST21-1494 | 4138 | WidWinDermatology Clinic | Seoul | Gangnam-gu | Apgujeong-dong |