Overview

Dataset statistics

Number of variables3
Number of observations209
Missing cells5
Missing cells (%)0.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.2 KiB
Average record size in memory25.6 B

Variable types

Numeric1
Categorical1
DateTime1

Dataset

Description한국인터넷진흥원 대표홈페이지 메뉴내용에 대한 정보입니다. 메뉴번호, 언어유형, 메뉴내용 등의 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15119687/fileData.do

Alerts

언어유형 is highly imbalanced (59.3%)Imbalance
등록일자 has 5 (2.4%) missing valuesMissing

Reproduction

Analysis started2023-12-12 01:18:29.309984
Analysis finished2023-12-12 01:18:29.754347
Duration0.44 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

메뉴번호
Real number (ℝ)

Distinct206
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean973456.99
Minimum5
Maximum9010520
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.0 KiB
2023-12-12T10:18:29.856727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5
5-th percentile301.4
Q120204
median1030201
Q31040901
95-th percentile5040102.6
Maximum9010520
Range9010515
Interquartile range (IQR)1020697

Descriptive statistics

Standard deviation1417786.1
Coefficient of variation (CV)1.4564446
Kurtosis7.5541185
Mean973456.99
Median Absolute Deviation (MAD)969392
Skewness2.6160417
Sum2.0345251 × 108
Variance2.0101175 × 1012
MonotonicityNot monotonic
2023-12-12T10:18:30.037249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
201 2
 
1.0%
301 2
 
1.0%
401 2
 
1.0%
5060102 1
 
0.5%
402 1
 
0.5%
1040302 1
 
0.5%
203 1
 
0.5%
3050202 1
 
0.5%
3050102 1
 
0.5%
20204 1
 
0.5%
Other values (196) 196
93.8%
ValueCountFrequency (%)
5 1
0.5%
6 1
0.5%
101 1
0.5%
102 1
0.5%
103 1
0.5%
104 1
0.5%
201 2
1.0%
203 1
0.5%
301 2
1.0%
302 1
0.5%
ValueCountFrequency (%)
9010520 1
0.5%
5060104 1
0.5%
5060102 1
0.5%
5060101 1
0.5%
5040302 1
0.5%
5040301 1
0.5%
5040204 1
0.5%
5040203 1
0.5%
5040202 1
0.5%
5040201 1
0.5%

언어유형
Categorical

IMBALANCE 

Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
KO
192 
EN
 
17

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowKO
2nd rowKO
3rd rowKO
4th rowKO
5th rowKO

Common Values

ValueCountFrequency (%)
KO 192
91.9%
EN 17
 
8.1%

Length

2023-12-12T10:18:30.221456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:18:30.356068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
ko 192
91.9%
en 17
 
8.1%

등록일자
Date

MISSING 

Distinct19
Distinct (%)9.3%
Missing5
Missing (%)2.4%
Memory size1.8 KiB
Minimum2020-02-08 09:56:00
Maximum2023-08-14 15:27:00
2023-12-12T10:18:30.508083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:18:30.644487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)

Interactions

2023-12-12T10:18:29.434638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T10:18:30.733150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
메뉴번호언어유형등록일자
메뉴번호1.0000.2650.000
언어유형0.2651.0000.000
등록일자0.0000.0001.000
2023-12-12T10:18:30.861964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
메뉴번호언어유형
메뉴번호1.0000.321
언어유형0.3211.000

Missing values

2023-12-12T10:18:29.599343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:18:29.711202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

메뉴번호언어유형등록일자
05060102KO2022-02-08 00:00:00
15060101KO2022-02-08 00:00:00
230404KO2022-02-08 00:00:00
31050202KO2022-02-08 00:00:00
41050201KO2022-02-08 00:00:00
51040802KO2022-02-08 00:00:00
61040801KO2022-06-14 16:54:00
71040703KO2022-02-08 00:00:00
81040702KO2022-02-08 00:00:00
91040701KO2022-02-08 00:00:00
메뉴번호언어유형등록일자
1991020502KO2022-01-28 13:40
2001020503KO2022-01-28 13:40
20151303KO<NA>
20251304KO<NA>
203805KO2023-04-18 16:10
204806KO2023-04-18 16:31
205807KO2023-04-18 16:36
206808KO2023-04-18 16:38
2075060104KO2023-08-07 0:00
20810412KO2023-08-14 15:27