Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 10000 |
Missing cells | 10000 |
Missing cells (%) | 12.5% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 771.5 KiB |
Average record size in memory | 79.0 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 2 |
Unsupported | 1 |
Dataset
Description | 경기도_BMS 공통일자 정보 |
---|---|
Author | 경기도 |
URL | https://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=Z5AMQ5BX3QVVO7RGOLH333026625&infSeq=1 |
연도 is highly overall correlated with 연월일 | High correlation |
연월일 is highly overall correlated with 연도 | High correlation |
요일순번 is highly overall correlated with 요일 and 1 other fields | High correlation |
요일 is highly overall correlated with 요일순번 and 1 other fields | High correlation |
휴일여부 is highly overall correlated with 요일순번 and 1 other fields | High correlation |
휴일여부명 has 10000 (100.0%) missing values | Missing |
연월일 has unique values | Unique |
휴일여부명 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2023-12-10 22:28:05.173781 |
---|---|
Analysis finished | 2023-12-10 22:28:09.127462 |
Duration | 3.95 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
연도
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 31 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2014.9793 |
Minimum | 2000 |
---|---|
Maximum | 2030 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2000 |
---|---|
5-th percentile | 2001 |
Q1 | 2007 |
median | 2015 |
Q3 | 2023 |
95-th percentile | 2029 |
Maximum | 2030 |
Range | 30 |
Interquartile range (IQR) | 16 |
Descriptive statistics
Standard deviation | 8.928821 |
---|---|
Coefficient of variation (CV) | 0.0044312222 |
Kurtosis | -1.1965978 |
Mean | 2014.9793 |
Median Absolute Deviation (MAD) | 8 |
Skewness | 0.00033060113 |
Sum | 20149793 |
Variance | 79.723844 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2014 | 340 | 3.4% |
2017 | 330 | 3.3% |
2018 | 330 | 3.3% |
2000 | 329 | 3.3% |
2026 | 328 | 3.3% |
2008 | 327 | 3.3% |
2010 | 327 | 3.3% |
2024 | 327 | 3.3% |
2022 | 327 | 3.3% |
2002 | 326 | 3.3% |
Other values (21) | 6709 |
Value | Count | Frequency (%) |
2000 | 329 | |
2001 | 322 | |
2002 | 326 | |
2003 | 313 | |
2004 | 319 | |
2005 | 323 | |
2006 | 321 | |
2007 | 319 | |
2008 | 327 | |
2009 | 323 |
Value | Count | Frequency (%) |
2030 | 318 | |
2029 | 318 | |
2028 | 321 | |
2027 | 313 | |
2026 | 328 | |
2025 | 318 | |
2024 | 327 | |
2023 | 325 | |
2022 | 327 | |
2021 | 314 |
월
Real number (ℝ)
Distinct | 12 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.5282 |
Minimum | 1 |
---|---|
Maximum | 12 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 4 |
median | 7 |
Q3 | 10 |
95-th percentile | 12 |
Maximum | 12 |
Range | 11 |
Interquartile range (IQR) | 6 |
Descriptive statistics
Standard deviation | 3.4495209 |
---|---|
Coefficient of variation (CV) | 0.52840307 |
Kurtosis | -1.2052583 |
Mean | 6.5282 |
Median Absolute Deviation (MAD) | 3 |
Skewness | -0.0038003324 |
Sum | 65282 |
Variance | 11.899195 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
12 | 869 | |
7 | 852 | |
5 | 845 | |
3 | 841 | |
8 | 840 | |
10 | 839 | |
1 | 839 | |
6 | 839 | |
4 | 837 | |
11 | 819 | |
Other values (2) | 1580 |
Value | Count | Frequency (%) |
1 | 839 | |
2 | 774 | |
3 | 841 | |
4 | 837 | |
5 | 845 | |
6 | 839 | |
7 | 852 | |
8 | 840 | |
9 | 806 | |
10 | 839 |
Value | Count | Frequency (%) |
12 | 869 | |
11 | 819 | |
10 | 839 | |
9 | 806 | |
8 | 840 | |
7 | 852 | |
6 | 839 | |
5 | 845 | |
4 | 837 | |
3 | 841 |
일
Real number (ℝ)
Distinct | 31 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15.6627 |
Minimum | 1 |
---|---|
Maximum | 31 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 8 |
median | 16 |
Q3 | 23 |
95-th percentile | 29 |
Maximum | 31 |
Range | 30 |
Interquartile range (IQR) | 15 |
Descriptive statistics
Standard deviation | 8.7986516 |
---|---|
Coefficient of variation (CV) | 0.56175829 |
Kurtosis | -1.1915853 |
Mean | 15.6627 |
Median Absolute Deviation (MAD) | 8 |
Skewness | 0.015663845 |
Sum | 156627 |
Variance | 77.41627 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 343 | 3.4% |
17 | 339 | 3.4% |
6 | 338 | 3.4% |
8 | 337 | 3.4% |
13 | 335 | 3.4% |
5 | 335 | 3.4% |
9 | 334 | 3.3% |
26 | 334 | 3.3% |
24 | 333 | 3.3% |
2 | 333 | 3.3% |
Other values (21) | 6639 |
Value | Count | Frequency (%) |
1 | 343 | |
2 | 333 | |
3 | 314 | |
4 | 330 | |
5 | 335 | |
6 | 338 | |
7 | 321 | |
8 | 337 | |
9 | 334 | |
10 | 325 |
Value | Count | Frequency (%) |
31 | 196 | |
30 | 291 | |
29 | 299 | |
28 | 324 | |
27 | 324 | |
26 | 334 | |
25 | 324 | |
24 | 333 | |
23 | 332 | |
22 | 321 |
연월일
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20150461 |
Minimum | 20000101 |
---|---|
Maximum | 20301230 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 20000101 |
---|---|
5-th percentile | 20010711 |
Q1 | 20071015 |
median | 20150618 |
Q3 | 20230315 |
95-th percentile | 20290601 |
Maximum | 20301230 |
Range | 301129 |
Interquartile range (IQR) | 159300.5 |
Descriptive statistics
Standard deviation | 89288.071 |
---|---|
Coefficient of variation (CV) | 0.0044310683 |
Kurtosis | -1.1965249 |
Mean | 20150461 |
Median Absolute Deviation (MAD) | 79650 |
Skewness | 0.00033144059 |
Sum | 2.0150461 × 1011 |
Variance | 7.9723596 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20271206 | 1 | < 0.1% |
20190706 | 1 | < 0.1% |
20000116 | 1 | < 0.1% |
20100906 | 1 | < 0.1% |
20170808 | 1 | < 0.1% |
20081028 | 1 | < 0.1% |
20071018 | 1 | < 0.1% |
20190811 | 1 | < 0.1% |
20100116 | 1 | < 0.1% |
20120522 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
20000101 | 1 | |
20000102 | 1 | |
20000103 | 1 | |
20000105 | 1 | |
20000107 | 1 | |
20000108 | 1 | |
20000109 | 1 | |
20000110 | 1 | |
20000111 | 1 | |
20000112 | 1 |
Value | Count | Frequency (%) |
20301230 | 1 | |
20301229 | 1 | |
20301228 | 1 | |
20301227 | 1 | |
20301226 | 1 | |
20301225 | 1 | |
20301224 | 1 | |
20301223 | 1 | |
20301222 | 1 | |
20301221 | 1 |
요일순번
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 7 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.9924 |
Minimum | 1 |
---|---|
Maximum | 7 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 2 |
median | 4 |
Q3 | 6 |
95-th percentile | 7 |
Maximum | 7 |
Range | 6 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 2.0039321 |
---|---|
Coefficient of variation (CV) | 0.5019367 |
Kurtosis | -1.2538323 |
Mean | 3.9924 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 0.0075265714 |
Sum | 39924 |
Variance | 4.0157438 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 1445 | |
1 | 1439 | |
7 | 1437 | |
4 | 1431 | |
3 | 1421 | |
6 | 1414 | |
5 | 1413 |
Value | Count | Frequency (%) |
1 | 1439 | |
2 | 1445 | |
3 | 1421 | |
4 | 1431 | |
5 | 1413 | |
6 | 1414 | |
7 | 1437 |
Value | Count | Frequency (%) |
7 | 1437 | |
6 | 1414 | |
5 | 1413 | |
4 | 1431 | |
3 | 1421 | |
2 | 1445 | |
1 | 1439 |
요일
Categorical
HIGH CORRELATION
 
Distinct | 7 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
월 | |
---|---|
일 | |
토 | |
수 | |
화 | |
Other values (2) |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 월 |
---|---|
2nd row | 일 |
3rd row | 수 |
4th row | 토 |
5th row | 일 |
Common Values
Value | Count | Frequency (%) |
월 | 1445 | |
일 | 1439 | |
토 | 1437 | |
수 | 1431 | |
화 | 1421 | |
금 | 1414 | |
목 | 1413 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
월 | 1445 | |
일 | 1439 | |
토 | 1437 | |
수 | 1431 | |
화 | 1421 | |
금 | 1414 | |
목 | 1413 |
휴일여부
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
8 | |
---|---|
7 | |
1 | |
9 | 109 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 8 |
---|---|
2nd row | 1 |
3rd row | 8 |
4th row | 7 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
8 | 7040 | |
7 | 1426 | 14.3% |
1 | 1425 | 14.2% |
9 | 109 | 1.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
8 | 7040 | |
7 | 1426 | 14.3% |
1 | 1425 | 14.2% |
9 | 109 | 1.1% |
휴일여부명
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
연도 | 월 | 일 | 연월일 | 요일순번 | 요일 | 휴일여부 | |
---|---|---|---|---|---|---|---|
연도 | 1.000 | 0.000 | 0.000 | 0.994 | 0.000 | 0.000 | 0.151 |
월 | 0.000 | 1.000 | 0.000 | 0.009 | 0.000 | 0.000 | 0.060 |
일 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.045 |
연월일 | 0.994 | 0.009 | 0.000 | 1.000 | 0.000 | 0.000 | 0.151 |
요일순번 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 | 0.874 |
요일 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 | 0.874 |
휴일여부 | 0.151 | 0.060 | 0.045 | 0.151 | 0.874 | 0.874 | 1.000 |
휴일여부 | 요일 | |
---|---|---|
휴일여부 | 1.000 | 0.813 |
요일 | 0.813 | 1.000 |
연도 | 월 | 일 | 연월일 | 요일순번 | 요일 | 휴일여부 | |
---|---|---|---|---|---|---|---|
연도 | 1.000 | -0.002 | -0.005 | 0.999 | -0.001 | 0.000 | 0.091 |
월 | -0.002 | 1.000 | 0.012 | 0.030 | -0.001 | 0.000 | 0.036 |
일 | -0.005 | 0.012 | 1.000 | -0.002 | -0.001 | 0.000 | 0.018 |
연월일 | 0.999 | 0.030 | -0.002 | 1.000 | -0.002 | 0.000 | 0.091 |
요일순번 | -0.001 | -0.001 | -0.001 | -0.002 | 1.000 | 1.000 | 0.813 |
요일 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 | 0.813 |
휴일여부 | 0.091 | 0.036 | 0.018 | 0.091 | 0.813 | 0.813 | 1.000 |
연도 | 월 | 일 | 연월일 | 요일순번 | 요일 | 휴일여부 | 휴일여부명 | |
---|---|---|---|---|---|---|---|---|
6293 | 2027 | 12 | 6 | 20271206 | 2 | 월 | 8 | <NA> |
7616 | 2000 | 10 | 15 | 20001015 | 1 | 일 | 1 | <NA> |
8175 | 2015 | 7 | 22 | 20150722 | 4 | 수 | 8 | <NA> |
7791 | 2030 | 8 | 10 | 20300810 | 7 | 토 | 7 | <NA> |
637 | 2014 | 1 | 5 | 20140105 | 1 | 일 | 1 | <NA> |
6509 | 2028 | 7 | 24 | 20280724 | 2 | 월 | 8 | <NA> |
267 | 2012 | 12 | 8 | 20121208 | 7 | 토 | 7 | <NA> |
9782 | 2020 | 3 | 24 | 20200324 | 3 | 화 | 8 | <NA> |
8880 | 2017 | 8 | 7 | 20170807 | 2 | 월 | 8 | <NA> |
9526 | 2019 | 6 | 24 | 20190624 | 2 | 월 | 8 | <NA> |
연도 | 월 | 일 | 연월일 | 요일순번 | 요일 | 휴일여부 | 휴일여부명 | |
---|---|---|---|---|---|---|---|---|
5389 | 2004 | 4 | 13 | 20040413 | 3 | 화 | 8 | <NA> |
9226 | 2018 | 8 | 10 | 20180810 | 6 | 금 | 8 | <NA> |
3966 | 2021 | 3 | 1 | 20210301 | 2 | 월 | 9 | <NA> |
6322 | 2028 | 1 | 4 | 20280104 | 3 | 화 | 8 | <NA> |
3292 | 2025 | 7 | 9 | 20250709 | 4 | 수 | 8 | <NA> |
1515 | 2011 | 1 | 31 | 20110131 | 2 | 월 | 8 | <NA> |
5560 | 2004 | 10 | 10 | 20041010 | 1 | 일 | 1 | <NA> |
4261 | 2000 | 12 | 30 | 20001230 | 7 | 토 | 7 | <NA> |
2895 | 2024 | 5 | 19 | 20240519 | 1 | 일 | 1 | <NA> |
4161 | 2000 | 9 | 12 | 20000912 | 3 | 화 | 8 | <NA> |