Overview

Dataset statistics

Number of variables7
Number of observations6575
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory379.0 KiB
Average record size in memory59.0 B

Variable types

Numeric3
DateTime2
Categorical1
Boolean1

Dataset

Description한국인터넷진흥원 대표홈페이지DB에 저장된 날짜정보입니다.
Author한국인터넷진흥원
URLhttps://www.data.go.kr/data/15092576/fileData.do

Alerts

휴일여부 is highly overall correlated with 요일High correlation
요일 is highly overall correlated with 휴일여부High correlation
날짜코드 is highly overall correlated with 년월코드 and 1 other fieldsHigh correlation
년월코드 is highly overall correlated with 날짜코드 and 1 other fieldsHigh correlation
주코드 is highly overall correlated with 날짜코드 and 1 other fieldsHigh correlation
날짜코드 has unique valuesUnique
날짜 has unique valuesUnique

Reproduction

Analysis started2023-12-12 05:02:34.791907
Analysis finished2023-12-12 05:02:36.559847
Duration1.77 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

날짜코드
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct6575
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20035672
Minimum19950101
Maximum20121231
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size57.9 KiB
2023-12-12T14:02:36.638510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum19950101
5-th percentile19951126
Q119990702
median20040101
Q320080702
95-th percentile20120206
Maximum20121231
Range171130
Interquartile range (IQR)89999

Descriptive statistics

Standard deviation51890.245
Coefficient of variation (CV)0.0025898929
Kurtosis-1.2073593
Mean20035672
Median Absolute Deviation (MAD)41130
Skewness3.8346224 × 10-5
Sum1.3173454 × 1011
Variance2.6925975 × 109
MonotonicityNot monotonic
2023-12-12T14:02:36.798549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
19950101 1
 
< 0.1%
20061229 1
 
< 0.1%
20070102 1
 
< 0.1%
20070101 1
 
< 0.1%
20070717 1
 
< 0.1%
20070716 1
 
< 0.1%
20070715 1
 
< 0.1%
20070714 1
 
< 0.1%
20070713 1
 
< 0.1%
20070712 1
 
< 0.1%
Other values (6565) 6565
99.8%
ValueCountFrequency (%)
19950101 1
< 0.1%
19950102 1
< 0.1%
19950103 1
< 0.1%
19950104 1
< 0.1%
19950105 1
< 0.1%
19950106 1
< 0.1%
19950107 1
< 0.1%
19950108 1
< 0.1%
19950109 1
< 0.1%
19950110 1
< 0.1%
ValueCountFrequency (%)
20121231 1
< 0.1%
20121230 1
< 0.1%
20121229 1
< 0.1%
20121228 1
< 0.1%
20121227 1
< 0.1%
20121226 1
< 0.1%
20121225 1
< 0.1%
20121224 1
< 0.1%
20121223 1
< 0.1%
20121222 1
< 0.1%

날짜
Date

UNIQUE 

Distinct6575
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size51.5 KiB
Minimum1995-01-01 00:00:00
Maximum2012-12-31 00:00:00
2023-12-12T14:02:36.952412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:02:37.100677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

년월코드
Real number (ℝ)

HIGH CORRELATION 

Distinct216
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean200356.56
Minimum199501
Maximum201212
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size57.9 KiB
2023-12-12T14:02:37.244059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum199501
5-th percentile199511
Q1199907
median200401
Q3200807
95-th percentile201202
Maximum201212
Range1711
Interquartile range (IQR)900

Descriptive statistics

Standard deviation518.90242
Coefficient of variation (CV)0.0025898948
Kurtosis-1.2073595
Mean200356.56
Median Absolute Deviation (MAD)411
Skewness3.8231665 × 10-5
Sum1.3173444 × 109
Variance269259.73
MonotonicityNot monotonic
2023-12-12T14:02:37.398373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
199501 31
 
0.5%
200601 31
 
0.5%
200501 31
 
0.5%
200503 31
 
0.5%
200505 31
 
0.5%
200507 31
 
0.5%
200508 31
 
0.5%
200510 31
 
0.5%
200512 31
 
0.5%
200610 31
 
0.5%
Other values (206) 6265
95.3%
ValueCountFrequency (%)
199501 31
0.5%
199502 28
0.4%
199503 31
0.5%
199504 30
0.5%
199505 31
0.5%
199506 30
0.5%
199507 31
0.5%
199508 31
0.5%
199509 30
0.5%
199510 31
0.5%
ValueCountFrequency (%)
201212 31
0.5%
201211 30
0.5%
201210 31
0.5%
201209 30
0.5%
201208 31
0.5%
201207 31
0.5%
201206 30
0.5%
201205 31
0.5%
201204 30
0.5%
201203 31
0.5%

년월
Date

Distinct216
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size51.5 KiB
Minimum1995-01-01 00:00:00
Maximum2012-12-01 00:00:00
2023-12-12T14:02:37.594901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:02:37.748003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

주코드
Real number (ℝ)

HIGH CORRELATION 

Distinct944
Distinct (%)14.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean200376.65
Minimum199452
Maximum201253
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size57.9 KiB
2023-12-12T14:02:37.924688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum199452
5-th percentile199547
Q1199926
median200401
Q3200827
95-th percentile201206
Maximum201253
Range1801
Interquartile range (IQR)901

Descriptive statistics

Standard deviation519.0849
Coefficient of variation (CV)0.0025905458
Kurtosis-1.2051324
Mean200376.65
Median Absolute Deviation (MAD)450
Skewness-4.14344 × 10-5
Sum1.3174765 × 109
Variance269449.13
MonotonicityNot monotonic
2023-12-12T14:02:38.152706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
200701 8
 
0.1%
200403 7
 
0.1%
200627 7
 
0.1%
200628 7
 
0.1%
200629 7
 
0.1%
200630 7
 
0.1%
200631 7
 
0.1%
200632 7
 
0.1%
200649 7
 
0.1%
200650 7
 
0.1%
Other values (934) 6504
98.9%
ValueCountFrequency (%)
199452 1
 
< 0.1%
199501 7
0.1%
199502 7
0.1%
199503 7
0.1%
199504 7
0.1%
199505 7
0.1%
199506 7
0.1%
199507 7
0.1%
199508 7
0.1%
199509 7
0.1%
ValueCountFrequency (%)
201253 2
 
< 0.1%
201252 7
0.1%
201251 7
0.1%
201250 7
0.1%
201249 7
0.1%
201248 7
0.1%
201247 7
0.1%
201246 7
0.1%
201245 7
0.1%
201244 7
0.1%

요일
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size51.5 KiB
940 
940 
939 
939 
939 
Other values (2)
1878 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
940
14.3%
940
14.3%
939
14.3%
939
14.3%
939
14.3%
939
14.3%
939
14.3%

Length

2023-12-12T14:02:38.343140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:02:38.821918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
940
14.3%
940
14.3%
939
14.3%
939
14.3%
939
14.3%
939
14.3%
939
14.3%

휴일여부
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size6.5 KiB
False
4529 
True
2046 
ValueCountFrequency (%)
False 4529
68.9%
True 2046
31.1%
2023-12-12T14:02:38.973133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2023-12-12T14:02:36.020977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:02:35.212722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:02:35.623712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:02:36.168144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:02:35.337903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:02:35.750711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:02:36.269270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:02:35.468703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:02:35.900473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:02:39.059588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
날짜코드년월코드주코드요일휴일여부
날짜코드1.0001.0000.9950.0000.000
년월코드1.0001.0000.9950.0000.000
주코드0.9950.9951.0000.0000.000
요일0.0000.0000.0001.0000.871
휴일여부0.0000.0000.0000.8711.000
2023-12-12T14:02:39.185684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
휴일여부요일
휴일여부1.0000.941
요일0.9411.000
2023-12-12T14:02:39.297642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
날짜코드년월코드주코드요일휴일여부
날짜코드1.0001.0001.0000.0000.000
년월코드1.0001.0001.0000.0000.000
주코드1.0001.0001.0000.0000.000
요일0.0000.0000.0001.0000.941
휴일여부0.0000.0000.0000.9411.000

Missing values

2023-12-12T14:02:36.395170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:02:36.507868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

날짜코드날짜년월코드년월주코드요일휴일여부
0199501011995-01-011995011995-01199452Y
1199501021995-01-021995011995-01199501N
2199501031995-01-031995011995-01199501N
3199501041995-01-041995011995-01199501N
4199501051995-01-051995011995-01199501N
5199501061995-01-061995011995-01199501N
6199501081995-01-081995011995-01199501Y
7199501091995-01-091995011995-01199502N
8199501101995-01-101995011995-01199502N
9199501111995-01-111995011995-01199502N
날짜코드날짜년월코드년월주코드요일휴일여부
6565201212222012-12-222012122012-12201251Y
6566201212232012-12-232012122012-12201252Y
6567201212242012-12-242012122012-12201252N
6568201212252012-12-252012122012-12201252N
6569201212262012-12-262012122012-12201252N
6570201212272012-12-272012122012-12201252N
6571201212282012-12-282012122012-12201252N
6572201212292012-12-292012122012-12201252Y
6573201212302012-12-302012122012-12201253Y
6574201212312012-12-312012122012-12201253N