Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 10000 |
Missing cells | 4470 |
Missing cells (%) | 8.9% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 498.0 KiB |
Average record size in memory | 51.0 B |
Variable types
Categorical | 2 |
---|---|
Text | 1 |
Numeric | 2 |
Dataset
Description | 경기도 경기통계시스템 주기 |
---|---|
Author | 경기도 |
URL | https://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=9JRU77ZKD8EZ22XTHZ8A33463563&infSeq=1 |
Reproduction
Analysis started | 2023-12-10 21:23:26.551835 |
---|---|
Analysis finished | 2023-12-10 21:23:27.427539 |
Duration | 0.88 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
조직번호
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
210 |
---|
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 210 |
---|---|
2nd row | 210 |
3rd row | 210 |
4th row | 210 |
5th row | 210 |
Common Values
Value | Count | Frequency (%) |
210 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
210 | 10000 |
통계표ID
Text
Distinct | 3532 |
---|---|
Distinct (%) | 35.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 23 |
---|---|
Median length | 22 |
Mean length | 12.6128 |
Min length | 6 |
Characters and Unicode
Total characters | 126128 |
---|---|
Distinct characters | 35 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1503 ? |
---|---|
Unique (%) | 15.0% |
Sample
1st row | DT_21002_P013_BK |
---|---|
2nd row | DT_1K00004_BK |
3rd row | DT_1E00032 |
4th row | DT_1C00003_BK |
5th row | DT_21002_P012_BK |
Value | Count | Frequency (%) |
dt_1a00004 | 69 | 0.7% |
dt_21002h005_4_bk | 61 | 0.6% |
dt_21002h006_4_bk | 54 | 0.5% |
dt_21002h010_bk | 48 | 0.5% |
dt_2020037_006 | 48 | 0.5% |
dt_21002b011_bk | 47 | 0.5% |
dt_21002h003_bk | 46 | 0.5% |
dt_21002h009_bk | 44 | 0.4% |
dt_21002_j001_bk | 44 | 0.4% |
dt_1f00007 | 44 | 0.4% |
Other values (3522) | 9495 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 31122 | |
_ | 16575 | |
1 | 14544 | |
2 | 11783 | 9.3% |
T | 11002 | 8.7% |
D | 9834 | 7.8% |
B | 3799 | 3.0% |
K | 3753 | 3.0% |
3 | 2368 | 1.9% |
4 | 2279 | 1.8% |
Other values (25) | 19069 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 68347 | |
Uppercase Letter | 41206 | |
Connector Punctuation | 16575 | 13.1% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
T | 11002 | |
D | 9834 | |
B | 3799 | 9.2% |
K | 3753 | 9.1% |
A | 1792 | 4.3% |
S | 1182 | 2.9% |
G | 1174 | 2.8% |
E | 1059 | 2.6% |
I | 936 | 2.3% |
M | 923 | 2.2% |
Other values (14) | 5752 |
Decimal Number
Value | Count | Frequency (%) |
0 | 31122 | |
1 | 14544 | |
2 | 11783 | 17.2% |
3 | 2368 | 3.5% |
4 | 2279 | 3.3% |
5 | 1594 | 2.3% |
6 | 1255 | 1.8% |
7 | 1253 | 1.8% |
8 | 1215 | 1.8% |
9 | 934 | 1.4% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 16575 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 84922 | |
Latin | 41206 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
T | 11002 | |
D | 9834 | |
B | 3799 | 9.2% |
K | 3753 | 9.1% |
A | 1792 | 4.3% |
S | 1182 | 2.9% |
G | 1174 | 2.8% |
E | 1059 | 2.6% |
I | 936 | 2.3% |
M | 923 | 2.2% |
Other values (14) | 5752 |
Common
Value | Count | Frequency (%) |
0 | 31122 | |
_ | 16575 | |
1 | 14544 | |
2 | 11783 | 13.9% |
3 | 2368 | 2.8% |
4 | 2279 | 2.7% |
5 | 1594 | 1.9% |
6 | 1255 | 1.5% |
7 | 1253 | 1.5% |
8 | 1215 | 1.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 126128 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 31122 | |
_ | 16575 | |
1 | 14544 | |
2 | 11783 | 9.3% |
T | 11002 | 8.7% |
D | 9834 | 7.8% |
B | 3799 | 3.0% |
K | 3753 | 3.0% |
3 | 2368 | 1.9% |
4 | 2279 | 1.8% |
Other values (25) | 19069 |
주기구분
Categorical
IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Y | |
---|---|
M | |
F | 243 |
Q | 119 |
H | 4 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | M |
---|---|
2nd row | Y |
3rd row | Y |
4th row | Y |
5th row | M |
Common Values
Value | Count | Frequency (%) |
Y | 7517 | |
M | 2117 | 21.2% |
F | 243 | 2.4% |
Q | 119 | 1.2% |
H | 4 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
y | 7517 | |
m | 2117 | 21.2% |
f | 243 | 2.4% |
q | 119 | 1.2% |
h | 4 | < 0.1% |
수록시점
Real number (ℝ)
HIGH CORRELATION
  SKEWED
 
Distinct | 567 |
---|---|
Distinct (%) | 5.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 60545.299 |
Minimum | 1925 |
---|---|
Maximum | 20050401 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1925 |
---|---|
5-th percentile | 1971 |
Q1 | 1997 |
median | 2007 |
Q3 | 2020 |
95-th percentile | 201608 |
Maximum | 20050401 |
Range | 20048476 |
Interquartile range (IQR) | 23 |
Descriptive statistics
Standard deviation | 535168.44 |
---|---|
Coefficient of variation (CV) | 8.8391412 |
Kurtosis | 1356.6712 |
Mean | 60545.299 |
Median Absolute Deviation (MAD) | 11 |
Skewness | 36.415637 |
Sum | 6.0545299 × 108 |
Variance | 2.8640526 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2003 | 316 | 3.2% |
2001 | 306 | 3.1% |
2004 | 298 | 3.0% |
2002 | 297 | 3.0% |
2010 | 293 | 2.9% |
2008 | 273 | 2.7% |
2007 | 271 | 2.7% |
2009 | 260 | 2.6% |
2005 | 257 | 2.6% |
2000 | 245 | 2.5% |
Other values (557) | 7184 |
Value | Count | Frequency (%) |
1925 | 1 | < 0.1% |
1951 | 1 | < 0.1% |
1957 | 1 | < 0.1% |
1958 | 1 | < 0.1% |
1960 | 29 | |
1961 | 38 | |
1962 | 41 | |
1963 | 53 | |
1964 | 40 | |
1965 | 49 |
Value | Count | Frequency (%) |
20050401 | 2 | < 0.1% |
20040101 | 1 | < 0.1% |
20030401 | 3 | |
20020401 | 1 | < 0.1% |
202306 | 1 | < 0.1% |
202305 | 2 | < 0.1% |
202304 | 5 | |
202303 | 5 | |
202302 | 4 | |
202301 | 5 |
최종변경일
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 380 |
---|---|
Distinct (%) | 6.9% |
Missing | 4470 |
Missing (%) | 44.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20174121 |
Minimum | 20070830 |
---|---|
Maximum | 20230912 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 20070830 |
---|---|
5-th percentile | 20101203 |
Q1 | 20150324 |
median | 20200924 |
Q3 | 20210209 |
95-th percentile | 20220728 |
Maximum | 20230912 |
Range | 160082 |
Interquartile range (IQR) | 59885 |
Descriptive statistics
Standard deviation | 44594.109 |
---|---|
Coefficient of variation (CV) | 0.0022104611 |
Kurtosis | -1.2682978 |
Mean | 20174121 |
Median Absolute Deviation (MAD) | 19601 |
Skewness | -0.56885699 |
Sum | 1.1156289 × 1011 |
Variance | 1.9886346 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20210208 | 1130 | 11.3% |
20101203 | 834 | 8.3% |
20210209 | 662 | 6.6% |
20150625 | 461 | 4.6% |
20150624 | 166 | 1.7% |
20150626 | 141 | 1.4% |
20121121 | 120 | 1.2% |
20211015 | 111 | 1.1% |
20121122 | 105 | 1.1% |
20150324 | 71 | 0.7% |
Other values (370) | 1729 | 17.3% |
(Missing) | 4470 |
Value | Count | Frequency (%) |
20070830 | 5 | 0.1% |
20070902 | 1 | < 0.1% |
20080227 | 6 | 0.1% |
20080228 | 3 | < 0.1% |
20090309 | 1 | < 0.1% |
20090327 | 15 | |
20100217 | 1 | < 0.1% |
20100326 | 1 | < 0.1% |
20100330 | 4 | < 0.1% |
20100408 | 5 | 0.1% |
Value | Count | Frequency (%) |
20230912 | 2 | < 0.1% |
20230908 | 2 | < 0.1% |
20230901 | 1 | < 0.1% |
20230830 | 1 | < 0.1% |
20230824 | 3 | < 0.1% |
20230809 | 3 | < 0.1% |
20230728 | 3 | < 0.1% |
20230710 | 9 | |
20230707 | 4 | |
20230706 | 1 | < 0.1% |
주기구분 | 수록시점 | 최종변경일 | |
---|---|---|---|
주기구분 | 1.000 | 0.136 | 0.551 |
수록시점 | 0.136 | 1.000 | NaN |
최종변경일 | 0.551 | NaN | 1.000 |
수록시점 | 최종변경일 | 주기구분 | |
---|---|---|---|
수록시점 | 1.000 | 0.620 | 0.167 |
최종변경일 | 0.620 | 1.000 | 0.260 |
주기구분 | 0.167 | 0.260 | 1.000 |
조직번호 | 통계표ID | 주기구분 | 수록시점 | 최종변경일 | |
---|---|---|---|---|---|
50472 | 210 | DT_21002_P013_BK | M | 201001 | 20210209 |
48325 | 210 | DT_1K00004_BK | Y | 1996 | 20150625 |
5929 | 210 | DT_1E00032 | Y | 1987 | 20101203 |
12526 | 210 | DT_1C00003_BK | Y | 2010 | 20150625 |
28002 | 210 | DT_21002_P012_BK | M | 201211 | 20210209 |
60162 | 210 | DT_21002H006_4_BK | M | 200411 | 20210208 |
5231 | 210 | TX_210020633 | Y | 2004 | <NA> |
34501 | 210 | DT_2020037_005 | M | 202012 | 20220824 |
6629 | 210 | DT_1K00331 | Y | 1997 | 20101203 |
35233 | 210 | DT_1E00033_BK | Y | 1994 | 20150728 |
조직번호 | 통계표ID | 주기구분 | 수록시점 | 최종변경일 | |
---|---|---|---|---|---|
57409 | 210 | DT_21002H005_BK | M | 201003 | 20210208 |
54417 | 210 | DT_21002I009_BK | Y | 2016 | 20210209 |
36165 | 210 | DT_21002_K002 | Y | 2014 | <NA> |
37496 | 210 | DT_2021057_1_8 | F | 2015 | 20210405 |
60748 | 210 | DT_21002_J016_BK | Y | 2015 | 20210209 |
56020 | 210 | DT_21002H005_4_BK | M | 199905 | 20210208 |
36405 | 210 | DT_210J0013 | Y | 2000 | <NA> |
49535 | 210 | DT_21002C002_BK | Q | 200601 | 20210208 |
48797 | 210 | DT_21002E048 | Y | 2019 | 20210426 |
49257 | 210 | DT_1MB0001 | Y | 1975 | <NA> |