Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 419.9 KiB |
Average record size in memory | 43.0 B |
Variable types
Numeric | 2 |
---|---|
Categorical | 1 |
Text | 1 |
Dataset
Description | 경기도 경기통계시스템 차원정보 |
---|---|
Author | 경기도 |
URL | https://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=Y3KQ0RNVT4RZK0SJ7Q1R33499065&infSeq=1 |
Reproduction
Analysis started | 2023-12-10 22:30:38.325642 |
---|---|
Analysis finished | 2023-12-10 22:30:39.062766 |
Duration | 0.74 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
표항목인식번호
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 7307 |
---|---|
Distinct (%) | 73.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 38319.243 |
Minimum | 1 |
---|---|
Maximum | 512846 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 87 |
Q1 | 735 |
median | 4045 |
Q3 | 65766.75 |
95-th percentile | 197232.6 |
Maximum | 512846 |
Range | 512845 |
Interquartile range (IQR) | 65031.75 |
Descriptive statistics
Standard deviation | 60813.479 |
---|---|
Coefficient of variation (CV) | 1.5870219 |
Kurtosis | 3.3696824 |
Mean | 38319.243 |
Median Absolute Deviation (MAD) | 3881 |
Skewness | 1.8965717 |
Sum | 3.8319243 × 108 |
Variance | 3.6982793 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
74 | 10 | 0.1% |
48 | 10 | 0.1% |
55 | 10 | 0.1% |
101 | 10 | 0.1% |
112 | 10 | 0.1% |
19 | 10 | 0.1% |
27 | 10 | 0.1% |
14 | 10 | 0.1% |
23 | 9 | 0.1% |
110 | 9 | 0.1% |
Other values (7297) | 9902 |
Value | Count | Frequency (%) |
1 | 8 | |
2 | 5 | |
3 | 4 | |
4 | 4 | |
5 | 9 | |
6 | 7 | |
7 | 9 | |
8 | 2 | < 0.1% |
9 | 3 | < 0.1% |
10 | 6 |
Value | Count | Frequency (%) |
512846 | 1 | |
512841 | 1 | |
512840 | 1 | |
431284 | 1 | |
431280 | 1 | |
431279 | 1 | |
202858 | 1 | |
202845 | 1 | |
202840 | 1 | |
202833 | 1 |
조직번호
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
210 |
---|
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 210 |
---|---|
2nd row | 210 |
3rd row | 210 |
4th row | 210 |
5th row | 210 |
Common Values
Value | Count | Frequency (%) |
210 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
210 | 10000 |
통계표ID
Text
Distinct | 253 |
---|---|
Distinct (%) | 2.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 19 |
---|---|
Median length | 11 |
Mean length | 11.9806 |
Min length | 10 |
Characters and Unicode
Total characters | 119806 |
---|---|
Distinct characters | 29 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 28 ? |
---|---|
Unique (%) | 0.3% |
Sample
1st row | DT_1B00003_BK |
---|---|
2nd row | DT_1K00003_BK |
3rd row | DT_1L00004 |
4th row | DT_210J0047 |
5th row | DT_210J0047 |
Value | Count | Frequency (%) |
dt_210j0047 | 3855 | |
dt_210j0045 | 526 | 5.3% |
dt_1b00003_bk | 403 | 4.0% |
dt_210j0044 | 361 | 3.6% |
dt_1p00035 | 205 | 2.1% |
dt_210j0038 | 157 | 1.6% |
dt_1h00001_1_bk | 131 | 1.3% |
dt_21002_k001 | 129 | 1.3% |
dt_21002_k026 | 106 | 1.1% |
dt_21002_l010a | 101 | 1.0% |
Other values (243) | 4026 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 32045 | |
_ | 14131 | |
1 | 13561 | |
T | 10336 | 8.6% |
2 | 10152 | 8.5% |
D | 10079 | 8.4% |
4 | 6936 | 5.8% |
J | 5220 | 4.4% |
7 | 4298 | 3.6% |
K | 2662 | 2.2% |
Other values (19) | 10386 | 8.7% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 71262 | |
Uppercase Letter | 34413 | |
Connector Punctuation | 14131 | 11.8% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
T | 10336 | |
D | 10079 | |
J | 5220 | |
K | 2662 | 7.7% |
B | 2471 | 7.2% |
A | 801 | 2.3% |
P | 527 | 1.5% |
M | 513 | 1.5% |
E | 442 | 1.3% |
I | 349 | 1.0% |
Other values (8) | 1013 | 2.9% |
Decimal Number
Value | Count | Frequency (%) |
0 | 32045 | |
1 | 13561 | |
2 | 10152 | 14.2% |
4 | 6936 | 9.7% |
7 | 4298 | 6.0% |
3 | 2068 | 2.9% |
5 | 1205 | 1.7% |
6 | 364 | 0.5% |
8 | 349 | 0.5% |
9 | 284 | 0.4% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 14131 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 85393 | |
Latin | 34413 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
T | 10336 | |
D | 10079 | |
J | 5220 | |
K | 2662 | 7.7% |
B | 2471 | 7.2% |
A | 801 | 2.3% |
P | 527 | 1.5% |
M | 513 | 1.5% |
E | 442 | 1.3% |
I | 349 | 1.0% |
Other values (8) | 1013 | 2.9% |
Common
Value | Count | Frequency (%) |
0 | 32045 | |
_ | 14131 | |
1 | 13561 | |
2 | 10152 | 11.9% |
4 | 6936 | 8.1% |
7 | 4298 | 5.0% |
3 | 2068 | 2.4% |
5 | 1205 | 1.4% |
6 | 364 | 0.4% |
8 | 349 | 0.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 119806 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 32045 | |
_ | 14131 | |
1 | 13561 | |
T | 10336 | 8.6% |
2 | 10152 | 8.5% |
D | 10079 | 8.4% |
4 | 6936 | 5.8% |
J | 5220 | 4.4% |
7 | 4298 | 3.6% |
K | 2662 | 2.2% |
Other values (19) | 10386 | 8.7% |
최종변경일
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 69 |
---|---|
Distinct (%) | 0.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20165729 |
Minimum | 20150625 |
---|---|
Maximum | 20230616 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 20150625 |
---|---|
5-th percentile | 20150625 |
Q1 | 20150722 |
median | 20170424 |
Q3 | 20170510 |
95-th percentile | 20210223 |
Maximum | 20230616 |
Range | 79991 |
Interquartile range (IQR) | 19788 |
Descriptive statistics
Standard deviation | 15422.032 |
---|---|
Coefficient of variation (CV) | 0.00076476441 |
Kurtosis | 4.9535042 |
Mean | 20165729 |
Median Absolute Deviation (MAD) | 86 |
Skewness | 1.7859807 |
Sum | 2.0165729 × 1011 |
Variance | 2.3783907 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20170510 | 3855 | |
20170416 | 1044 | 10.4% |
20150625 | 734 | 7.3% |
20150626 | 644 | 6.4% |
20170504 | 429 | 4.3% |
20150629 | 408 | 4.1% |
20150727 | 292 | 2.9% |
20150728 | 177 | 1.8% |
20150724 | 174 | 1.7% |
20150722 | 173 | 1.7% |
Other values (59) | 2070 |
Value | Count | Frequency (%) |
20150625 | 734 | |
20150626 | 644 | |
20150629 | 408 | |
20150702 | 101 | 1.0% |
20150703 | 69 | 0.7% |
20150706 | 10 | 0.1% |
20150707 | 20 | 0.2% |
20150708 | 14 | 0.1% |
20150709 | 45 | 0.4% |
20150714 | 19 | 0.2% |
Value | Count | Frequency (%) |
20230616 | 85 | |
20230203 | 25 | 0.2% |
20221110 | 1 | < 0.1% |
20221109 | 74 | |
20221006 | 32 | 0.3% |
20220328 | 42 | 0.4% |
20210602 | 9 | 0.1% |
20210318 | 8 | 0.1% |
20210316 | 76 | |
20210311 | 129 |
표항목인식번호 | 최종변경일 | |
---|---|---|
표항목인식번호 | 1.000 | 0.726 |
최종변경일 | 0.726 | 1.000 |
표항목인식번호 | 최종변경일 | |
---|---|---|
표항목인식번호 | 1.000 | 0.645 |
최종변경일 | 0.645 | 1.000 |
표항목인식번호 | 조직번호 | 통계표ID | 최종변경일 | |
---|---|---|---|---|
65472 | 10603 | 210 | DT_1B00003_BK | 20150629 |
70685 | 706 | 210 | DT_1K00003_BK | 20150625 |
97852 | 2348 | 210 | DT_1L00004 | 20150724 |
25436 | 85261 | 210 | DT_210J0047 | 20170510 |
23244 | 76628 | 210 | DT_210J0047 | 20170510 |
82042 | 3802 | 210 | DT_1P00035 | 20150727 |
37144 | 57554 | 210 | DT_210J0047 | 20170510 |
31827 | 964 | 210 | DT_210J0038 | 20170416 |
81696 | 1744 | 210 | DT_1I00114 | 20150730 |
67364 | 265 | 210 | DT_1M00009_4_BK | 20150625 |
표항목인식번호 | 조직번호 | 통계표ID | 최종변경일 | |
---|---|---|---|---|
64016 | 54 | 210 | DT_1B00004_BK | 20150626 |
85106 | 2691 | 210 | DT_21002_K001 | 20210311 |
20098 | 60014 | 210 | DT_210J0047 | 20170510 |
73481 | 33 | 210 | DT_1M00013_BK | 20150626 |
92569 | 2107 | 210 | DT_1P00035 | 20150727 |
88824 | 485 | 210 | DT_1L00004 | 20150724 |
33898 | 76081 | 210 | DT_210J0047 | 20170510 |
97687 | 616 | 210 | DT_21002H006A | 20150703 |
87587 | 35 | 210 | DT_1C00009_1 | 20150721 |
16462 | 58850 | 210 | DT_210J0047 | 20170510 |