Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 1494 |
Missing cells | 1 |
Missing cells (%) | < 0.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 71.6 KiB |
Average record size in memory | 49.1 B |
Variable types
Text | 3 |
---|---|
Numeric | 1 |
Categorical | 2 |
Dataset
Description | 키값,등록번호,상호,행정시,행정구,행정동 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-13041/S/1/datasetView.do |
Reproduction
Analysis started | 2024-04-19 06:17:14.987129 |
---|---|
Analysis finished | 2024-04-19 06:17:15.739689 |
Duration | 0.75 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
키값
Text
UNIQUE
 
Distinct | 1494 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 11.8 KiB |
Length
Max length | 14 |
---|---|
Median length | 14 |
Mean length | 14 |
Min length | 14 |
Characters and Unicode
Total characters | 20916 |
---|---|
Distinct characters | 18 |
Distinct categories | 5 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1494 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | BE_LiST21-0936 |
---|---|
2nd row | BE_LiST21-0937 |
3rd row | BE_LiST21-0938 |
4th row | BE_LiST21-0939 |
5th row | BE_LiST21-0940 |
Value | Count | Frequency (%) |
be_list21-0936 | 1 | 0.1% |
be_list21-0673 | 1 | 0.1% |
be_list21-0682 | 1 | 0.1% |
be_list21-0681 | 1 | 0.1% |
be_list21-0680 | 1 | 0.1% |
be_list21-0679 | 1 | 0.1% |
be_list21-0678 | 1 | 0.1% |
be_list21-0677 | 1 | 0.1% |
be_list21-0676 | 1 | 0.1% |
be_list21-0675 | 1 | 0.1% |
Other values (1484) | 1484 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 2489 | |
2 | 1994 | |
0 | 1496 | 7.2% |
B | 1494 | 7.1% |
T | 1494 | 7.1% |
E | 1494 | 7.1% |
- | 1494 | 7.1% |
S | 1494 | 7.1% |
i | 1494 | 7.1% |
L | 1494 | 7.1% |
Other values (8) | 4479 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 8964 | |
Uppercase Letter | 7470 | |
Dash Punctuation | 1494 | 7.1% |
Lowercase Letter | 1494 | 7.1% |
Connector Punctuation | 1494 | 7.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 2489 | |
2 | 1994 | |
0 | 1496 | |
3 | 500 | 5.6% |
4 | 495 | 5.5% |
6 | 399 | 4.5% |
7 | 399 | 4.5% |
5 | 399 | 4.5% |
8 | 399 | 4.5% |
9 | 394 | 4.4% |
Uppercase Letter
Value | Count | Frequency (%) |
B | 1494 | |
T | 1494 | |
E | 1494 | |
S | 1494 | |
L | 1494 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1494 |
Lowercase Letter
Value | Count | Frequency (%) |
i | 1494 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1494 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 11952 | |
Latin | 8964 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 2489 | |
2 | 1994 | |
0 | 1496 | |
- | 1494 | |
_ | 1494 | |
3 | 500 | 4.2% |
4 | 495 | 4.1% |
6 | 399 | 3.3% |
7 | 399 | 3.3% |
5 | 399 | 3.3% |
Other values (2) | 793 | 6.6% |
Latin
Value | Count | Frequency (%) |
B | 1494 | |
T | 1494 | |
E | 1494 | |
S | 1494 | |
i | 1494 | |
L | 1494 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 20916 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 2489 | |
2 | 1994 | |
0 | 1496 | 7.2% |
B | 1494 | 7.1% |
T | 1494 | 7.1% |
E | 1494 | 7.1% |
- | 1494 | 7.1% |
S | 1494 | 7.1% |
i | 1494 | 7.1% |
L | 1494 | 7.1% |
Other values (8) | 4479 |
등록번호
Real number (ℝ)
UNIQUE
 
Distinct | 1494 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2337.7544 |
Minimum | 1 |
---|---|
Maximum | 9998 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 13.3 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 184.3 |
Q1 | 1364 |
median | 2577.5 |
Q3 | 3369.75 |
95-th percentile | 3988.7 |
Maximum | 9998 |
Range | 9997 |
Interquartile range (IQR) | 2005.75 |
Descriptive statistics
Standard deviation | 1237.5948 |
---|---|
Coefficient of variation (CV) | 0.52939472 |
Kurtosis | -0.19297006 |
Mean | 2337.7544 |
Median Absolute Deviation (MAD) | 947 |
Skewness | -0.22293602 |
Sum | 3492605 |
Variance | 1531640.9 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2688 | 1 | 0.1% |
2783 | 1 | 0.1% |
2349 | 1 | 0.1% |
2312 | 1 | 0.1% |
2330 | 1 | 0.1% |
2326 | 1 | 0.1% |
2315 | 1 | 0.1% |
2298 | 1 | 0.1% |
2279 | 1 | 0.1% |
2281 | 1 | 0.1% |
Other values (1484) | 1484 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
4 | 1 | |
16 | 1 | |
20 | 1 | |
21 | 1 | |
22 | 1 | |
24 | 1 | |
25 | 1 | |
27 | 1 |
Value | Count | Frequency (%) |
9998 | 1 | |
4138 | 1 | |
4137 | 1 | |
4136 | 1 | |
4135 | 1 | |
4133 | 1 | |
4132 | 1 | |
4127 | 1 | |
4126 | 1 | |
4125 | 1 |
상호
Text
Distinct | 1384 |
---|---|
Distinct (%) | 92.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 11.8 KiB |
Value | Count | Frequency (%) |
院 | 18 | 1.2% |
ud牙科?院 | 12 | 0.8% |
整形外科?院 | 11 | 0.7% |
牙科?院 | 9 | 0.6% |
拉???院 | 5 | 0.3% |
熙春??院 | 4 | 0.3% |
美皮?科?院 | 4 | 0.3% |
微笑?牙科?院 | 4 | 0.3% |
挺挺?院 | 4 | 0.3% |
the | 4 | 0.3% |
Other values (1369) | 1443 |
Most occurring characters
Value | Count | Frequency (%) |
? | 3028 | |
院 | 1284 | 11.2% |
科 | 915 | 8.0% |
外 | 349 | 3.0% |
整 | 332 | 2.9% |
形 | 332 | 2.9% |
牙 | 291 | 2.5% |
皮 | 125 | 1.1% |
美 | 108 | 0.9% |
e | 88 | 0.8% |
Other values (573) | 4623 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 6796 | |
Other Punctuation | 3060 | |
Uppercase Letter | 971 | 8.5% |
Lowercase Letter | 503 | 4.4% |
Decimal Number | 59 | 0.5% |
Close Punctuation | 26 | 0.2% |
Space Separator | 25 | 0.2% |
Open Punctuation | 24 | 0.2% |
Dash Punctuation | 6 | 0.1% |
Math Symbol | 5 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
院 | 1284 | |
科 | 915 | 13.5% |
外 | 349 | 5.1% |
整 | 332 | 4.9% |
形 | 332 | 4.9% |
牙 | 291 | 4.3% |
皮 | 125 | 1.8% |
美 | 108 | 1.6% |
世 | 84 | 1.2% |
首 | 84 | 1.2% |
Other values (499) | 2892 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 81 | 8.3% |
E | 76 | 7.8% |
I | 66 | 6.8% |
N | 63 | 6.5% |
A | 60 | 6.2% |
U | 56 | 5.8% |
M | 50 | 5.1% |
D | 50 | 5.1% |
L | 48 | 4.9% |
B | 40 | 4.1% |
Other values (16) | 381 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 88 | |
a | 46 | |
i | 46 | |
n | 40 | 8.0% |
r | 35 | 7.0% |
l | 35 | 7.0% |
o | 31 | 6.2% |
s | 28 | 5.6% |
h | 22 | 4.4% |
u | 22 | 4.4% |
Other values (15) | 110 |
Decimal Number
Value | Count | Frequency (%) |
1 | 11 | |
5 | 8 | |
6 | 8 | |
3 | 8 | |
8 | 6 | |
2 | 6 | |
4 | 4 | 6.8% |
0 | 3 | 5.1% |
7 | 3 | 5.1% |
9 | 2 | 3.4% |
Other Punctuation
Value | Count | Frequency (%) |
? | 3028 | |
& | 20 | 0.7% |
' | 6 | 0.2% |
. | 3 | 0.1% |
: | 1 | < 0.1% |
, | 1 | < 0.1% |
, | 1 | < 0.1% |
Close Punctuation
Value | Count | Frequency (%) |
) | 25 | |
) | 1 | 3.8% |
Space Separator
Value | Count | Frequency (%) |
25 |
Open Punctuation
Value | Count | Frequency (%) |
( | 24 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 6 |
Math Symbol
Value | Count | Frequency (%) |
+ | 5 |
Most occurring scripts
Value | Count | Frequency (%) |
Han | 6788 | |
Common | 3205 | |
Latin | 1474 | 12.8% |
Hangul | 8 | 0.1% |
Most frequent character per script
Han
Value | Count | Frequency (%) |
院 | 1284 | |
科 | 915 | 13.5% |
外 | 349 | 5.1% |
整 | 332 | 4.9% |
形 | 332 | 4.9% |
牙 | 291 | 4.3% |
皮 | 125 | 1.8% |
美 | 108 | 1.6% |
世 | 84 | 1.2% |
首 | 84 | 1.2% |
Other values (494) | 2884 |
Latin
Value | Count | Frequency (%) |
e | 88 | 6.0% |
S | 81 | 5.5% |
E | 76 | 5.2% |
I | 66 | 4.5% |
N | 63 | 4.3% |
A | 60 | 4.1% |
U | 56 | 3.8% |
M | 50 | 3.4% |
D | 50 | 3.4% |
L | 48 | 3.3% |
Other values (41) | 836 |
Common
Value | Count | Frequency (%) |
? | 3028 | |
25 | 0.8% | |
) | 25 | 0.8% |
( | 24 | 0.7% |
& | 20 | 0.6% |
1 | 11 | 0.3% |
5 | 8 | 0.2% |
6 | 8 | 0.2% |
3 | 8 | 0.2% |
8 | 6 | 0.2% |
Other values (13) | 42 | 1.3% |
Hangul
Value | Count | Frequency (%) |
슾 | 2 | |
쒧 | 2 | |
쎖 | 2 | |
앖 | 1 | |
쎱 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
CJK | 6788 | |
ASCII | 4629 | |
None | 50 | 0.4% |
Hangul | 8 | 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
? | 3028 | |
e | 88 | 1.9% |
S | 81 | 1.7% |
E | 76 | 1.6% |
I | 66 | 1.4% |
N | 63 | 1.4% |
A | 60 | 1.3% |
U | 56 | 1.2% |
M | 50 | 1.1% |
D | 50 | 1.1% |
Other values (61) | 1011 | 21.8% |
CJK
Value | Count | Frequency (%) |
院 | 1284 | |
科 | 915 | 13.5% |
外 | 349 | 5.1% |
整 | 332 | 4.9% |
形 | 332 | 4.9% |
牙 | 291 | 4.3% |
皮 | 125 | 1.8% |
美 | 108 | 1.6% |
世 | 84 | 1.2% |
首 | 84 | 1.2% |
Other values (494) | 2884 |
None
Value | Count | Frequency (%) |
) | 25 | |
( | 24 | |
, | 1 | 2.0% |
Hangul
Value | Count | Frequency (%) |
슾 | 2 | |
쒧 | 2 | |
쎖 | 2 | |
앖 | 1 | |
쎱 | 1 |
행정시
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 11.8 KiB |
首?特?市 | |
---|---|
京畿道 | 2 |
<NA> | 1 |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 4.9966533 |
Min length | 3 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | 首?特?市 |
---|---|
2nd row | 首?特?市 |
3rd row | 首?特?市 |
4th row | 首?特?市 |
5th row | 首?特?市 |
Common Values
Value | Count | Frequency (%) |
首?特?市 | 1491 | |
京畿道 | 2 | 0.1% |
<NA> | 1 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
首?特?市 | 1491 | |
京畿道 | 2 | 0.1% |
na | 1 | 0.1% |
행정구
Categorical
HIGH CORRELATION
 
Distinct | 28 |
---|---|
Distinct (%) | 1.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 11.8 KiB |
江南? | |
---|---|
瑞草? | |
中? | |
永登浦? | 46 |
松坡? | 45 |
Other values (23) |
Length
Max length | 7 |
---|---|
Median length | 3 |
Mean length | 3.0046854 |
Min length | 2 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 0.2% |
Sample
1st row | 江南? |
---|---|
2nd row | 江南? |
3rd row | ?原? |
4th row | 江南? |
5th row | ?路? |
Common Values
Value | Count | Frequency (%) |
江南? | 740 | |
瑞草? | 193 | 12.9% |
中? | 94 | 6.3% |
永登浦? | 46 | 3.1% |
松坡? | 45 | 3.0% |
江西? | 40 | 2.7% |
麻浦? | 33 | 2.2% |
?大?? | 28 | 1.9% |
?路? | 24 | 1.6% |
冠岳? | 22 | 1.5% |
Other values (18) | 229 | 15.3% |
Length
Value | Count | Frequency (%) |
江南 | 740 | |
瑞草 | 193 | 12.9% |
中 | 94 | 6.3% |
永登浦 | 46 | 3.1% |
松坡 | 45 | 3.0% |
江西 | 40 | 2.7% |
麻浦 | 33 | 2.2% |
大 | 28 | 1.9% |
路 | 24 | 1.6% |
冠岳 | 22 | 1.5% |
Other values (19) | 231 | 15.4% |
행정동
Text
Distinct | 234 |
---|---|
Distinct (%) | 15.7% |
Missing | 1 |
Missing (%) | 0.1% |
Memory size | 11.8 KiB |
Value | Count | Frequency (%) |
狎?亭洞 | 140 | 9.4% |
三1洞 | 137 | 9.2% |
新沙洞 | 123 | 8.2% |
1洞 | 91 | 6.1% |
淸潭洞 | 90 | 6.0% |
瑞草4洞 | 75 | 5.0% |
2洞 | 59 | 4.0% |
明洞 | 56 | 3.8% |
쒧院洞 | 28 | 1.9% |
三成1洞 | 21 | 1.4% |
Other values (222) | 673 |
Most occurring characters
Value | Count | Frequency (%) |
洞 | 1492 | |
? | 869 | |
1 | 404 | 7.3% |
2 | 204 | 3.7% |
三 | 193 | 3.5% |
新 | 169 | 3.0% |
亭 | 146 | 2.6% |
狎 | 140 | 2.5% |
沙 | 126 | 2.3% |
瑞 | 122 | 2.2% |
Other values (153) | 1690 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 3774 | |
Other Punctuation | 927 | 16.7% |
Decimal Number | 854 | 15.4% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
洞 | 1492 | |
三 | 193 | 5.1% |
新 | 169 | 4.5% |
亭 | 146 | 3.9% |
狎 | 140 | 3.7% |
沙 | 126 | 3.3% |
瑞 | 122 | 3.2% |
草 | 122 | 3.2% |
潭 | 90 | 2.4% |
淸 | 90 | 2.4% |
Other values (142) | 1084 |
Decimal Number
Value | Count | Frequency (%) |
1 | 404 | |
2 | 204 | |
4 | 121 | 14.2% |
3 | 76 | 8.9% |
6 | 22 | 2.6% |
5 | 15 | 1.8% |
7 | 10 | 1.2% |
8 | 1 | 0.1% |
0 | 1 | 0.1% |
Other Punctuation
Value | Count | Frequency (%) |
? | 869 | |
. | 58 | 6.3% |
Most occurring scripts
Value | Count | Frequency (%) |
Han | 3713 | |
Common | 1781 | |
Hangul | 61 | 1.1% |
Most frequent character per script
Han
Value | Count | Frequency (%) |
洞 | 1492 | |
三 | 193 | 5.2% |
新 | 169 | 4.6% |
亭 | 146 | 3.9% |
狎 | 140 | 3.8% |
沙 | 126 | 3.4% |
瑞 | 122 | 3.3% |
草 | 122 | 3.3% |
潭 | 90 | 2.4% |
淸 | 90 | 2.4% |
Other values (139) | 1023 |
Common
Value | Count | Frequency (%) |
? | 869 | |
1 | 404 | |
2 | 204 | 11.5% |
4 | 121 | 6.8% |
3 | 76 | 4.3% |
. | 58 | 3.3% |
6 | 22 | 1.2% |
5 | 15 | 0.8% |
7 | 10 | 0.6% |
8 | 1 | 0.1% |
Hangul
Value | Count | Frequency (%) |
쒧 | 53 | |
씉 | 6 | 9.8% |
쑿 | 2 | 3.3% |
Most occurring blocks
Value | Count | Frequency (%) |
CJK | 3711 | |
ASCII | 1781 | |
Hangul | 61 | 1.1% |
CJK Compat Ideographs | 2 | < 0.1% |
Most frequent character per block
CJK
Value | Count | Frequency (%) |
洞 | 1492 | |
三 | 193 | 5.2% |
新 | 169 | 4.6% |
亭 | 146 | 3.9% |
狎 | 140 | 3.8% |
沙 | 126 | 3.4% |
瑞 | 122 | 3.3% |
草 | 122 | 3.3% |
潭 | 90 | 2.4% |
淸 | 90 | 2.4% |
Other values (138) | 1021 |
ASCII
Value | Count | Frequency (%) |
? | 869 | |
1 | 404 | |
2 | 204 | 11.5% |
4 | 121 | 6.8% |
3 | 76 | 4.3% |
. | 58 | 3.3% |
6 | 22 | 1.2% |
5 | 15 | 0.8% |
7 | 10 | 0.6% |
8 | 1 | 0.1% |
Hangul
Value | Count | Frequency (%) |
쒧 | 53 | |
씉 | 6 | 9.8% |
쑿 | 2 | 3.3% |
CJK Compat Ideographs
Value | Count | Frequency (%) |
磻 | 2 |
등록번호 | 행정시 | 행정구 | |
---|---|---|---|
등록번호 | 1.000 | 0.000 | 0.177 |
행정시 | 0.000 | 1.000 | 1.000 |
행정구 | 0.177 | 1.000 | 1.000 |
행정구 | 행정시 | |
---|---|---|
행정구 | 1.000 | 0.992 |
행정시 | 0.992 | 1.000 |
등록번호 | 행정시 | 행정구 | |
---|---|---|---|
등록번호 | 1.000 | 0.000 | 0.078 |
행정시 | 0.000 | 1.000 | 0.992 |
행정구 | 0.078 | 0.992 | 1.000 |
키값 | 등록번호 | 상호 | 행정시 | 행정구 | 행정동 | |
---|---|---|---|---|---|---|
0 | BE_LiST21-0936 | 2688 | 世美整形外科?院 | 首?特?市 | 江南? | 新沙洞 |
1 | BE_LiST21-0937 | 2645 | UD江南牙科?院 | 首?特?市 | 江南? | ?三1洞 |
2 | BE_LiST21-0938 | 2659 | ?挺挺?院 | 首?特?市 | ?原? | 上?6.7洞 |
3 | BE_LiST21-0939 | 2637 | JS美?院 | 首?特?市 | 江南? | ?三1洞 |
4 | BE_LiST21-0940 | 2689 | 三星?耳鼻咽喉科 | 首?特?市 | ?路? | ?路1.2.3.4街洞 |
5 | BE_LiST21-0941 | 2647 | 江南高??院 | 首?特?市 | 冠岳? | 幸?洞 |
6 | BE_LiST21-0942 | 2649 | 我的未?皮?科?院 | 首?特?市 | 永登浦? | 汝矣?洞 |
7 | BE_LiST21-0943 | 2681 | SEBARUN?院 | 首?特?市 | 江西? | 登村1洞 |
8 | BE_LiST21-0944 | 2690 | ?永?院 | 首?特?市 | 瑞草? | 瑞草4洞 |
9 | BE_LiST21-0945 | 2686 | 威尼斯牙科?院 | 首?特?市 | 中? | 光熙洞 |
키값 | 등록번호 | 상호 | 행정시 | 행정구 | 행정동 | |
---|---|---|---|---|---|---|
1484 | BE_LiST21-0464 | 1429 | RAUM整形外科?院 | 首?特?市 | 江南? | 狎?亭洞 |
1485 | BE_LiST21-0465 | 1431 | UD牙科?院 | 首?特?市 | ?原? | 上?6.7洞 |
1486 | BE_LiST21-0466 | 1438 | 松坡第一?院 | 首?特?市 | 松坡? | 松坡1洞 |
1487 | BE_LiST21-0467 | 1448 | S普?普姿整形外科 | 首?特?市 | 江南? | ??1洞 |
1488 | BE_LiST21-0468 | 1451 | UD牙科?院 | 首?特?市 | 瑞草? | 瑞草2洞 |
1489 | BE_LiST21-0469 | 1452 | ?熙??院 | 首?特?市 | ?大?? | 祭基洞 |
1490 | BE_LiST21-0470 | 1456 | ???牙科?院 | 首?特?市 | 中? | 明洞 |
1491 | BE_LiST21-0471 | 1457 | UD牙科?院 | 首?特?市 | 永登浦? | 汝矣?洞 |
1492 | BE_LiST21-0472 | 1461 | UD牙科?院 | 首?特?市 | ?大?? | 典?2洞 |
1493 | BE_LiST21-0473 | 1463 | UD牙科?院 | 首?特?市 | 麻浦? | 西?洞 |