Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows1183
Duplicate rows (%)11.8%
Total size in memory410.2 KiB
Average record size in memory42.0 B

Variable types

Categorical2
DateTime1
Numeric1

Dataset

Description홈페이지에 메뉴, 회원, 콘텐츠 관련 기본정보DB에 대한 내용입니다.
Author국가평생교육진흥원
URLhttps://www.data.go.kr/data/15071858/fileData.do

Alerts

방문자수 has constant value ""Constant
Dataset has 1183 (11.8%) duplicate rowsDuplicates
도메인명 is highly imbalanced (89.8%)Imbalance
재방문자수 has 684 (6.8%) zerosZeros

Reproduction

Analysis started2023-12-12 19:02:17.477720
Analysis finished2023-12-12 19:02:17.875511
Duration0.4 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

도메인명
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
www.parents.go.kr
9636 
m.parents.go.kr
 
345
www.parents.go.kr.
 
16
221.146.210.194
 
2
parents
 
1

Length

Max length18
Median length17
Mean length16.9312
Min length7

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowwww.parents.go.kr
2nd rowm.parents.go.kr
3rd rowwww.parents.go.kr
4th rowwww.parents.go.kr
5th rowwww.parents.go.kr

Common Values

ValueCountFrequency (%)
www.parents.go.kr 9636
96.4%
m.parents.go.kr 345
 
3.5%
www.parents.go.kr. 16
 
0.2%
221.146.210.194 2
 
< 0.1%
parents 1
 
< 0.1%

Length

2023-12-13T04:02:17.934480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:02:18.031238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
www.parents.go.kr 9652
96.5%
m.parents.go.kr 345
 
3.5%
221.146.210.194 2
 
< 0.1%
parents 1
 
< 0.1%
Distinct1306
Distinct (%)13.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2012-01-14 00:00:00
Maximum2015-09-01 00:00:00
2023-12-13T04:02:18.145592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:02:18.279189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

방문자수
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 10000
100.0%

Length

2023-12-13T04:02:18.407673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:02:18.522889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 10000
100.0%

재방문자수
Real number (ℝ)

ZEROS 

Distinct173
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean18.9527
Minimum0
Maximum376
Zeros684
Zeros (%)6.8%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T04:02:18.635872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q14
median10
Q324
95-th percentile69
Maximum376
Range376
Interquartile range (IQR)20

Descriptive statistics

Standard deviation25.635889
Coefficient of variation (CV)1.3526246
Kurtosis16.187426
Mean18.9527
Median Absolute Deviation (MAD)8
Skewness3.214048
Sum189527
Variance657.19878
MonotonicityNot monotonic
2023-12-13T04:02:18.823498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 684
 
6.8%
2 601
 
6.0%
1 599
 
6.0%
3 553
 
5.5%
4 509
 
5.1%
5 458
 
4.6%
6 406
 
4.1%
7 384
 
3.8%
8 312
 
3.1%
9 303
 
3.0%
Other values (163) 5191
51.9%
ValueCountFrequency (%)
0 684
6.8%
1 599
6.0%
2 601
6.0%
3 553
5.5%
4 509
5.1%
5 458
4.6%
6 406
4.1%
7 384
3.8%
8 312
3.1%
9 303
3.0%
ValueCountFrequency (%)
376 1
< 0.1%
268 1
< 0.1%
266 1
< 0.1%
250 1
< 0.1%
244 1
< 0.1%
229 1
< 0.1%
218 1
< 0.1%
214 1
< 0.1%
205 2
< 0.1%
196 1
< 0.1%

Interactions

2023-12-13T04:02:17.621061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:02:18.930263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도메인명재방문자수
도메인명1.0000.046
재방문자수0.0461.000
2023-12-13T04:02:19.035689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
재방문자수도메인명
재방문자수1.0000.026
도메인명0.0261.000

Missing values

2023-12-13T04:02:17.768496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:02:17.842175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

도메인명등록일방문자수재방문자수
10092www.parents.go.kr2013-05-07123
25973m.parents.go.kr2015-02-1810
22767www.parents.go.kr2014-10-25113
27388www.parents.go.kr2015-04-1716
18347www.parents.go.kr2014-04-28114
29646www.parents.go.kr2015-07-1718
7618www.parents.go.kr2013-01-23142
15434www.parents.go.kr2013-12-23113
5469www.parents.go.kr2012-10-231106
17362www.parents.go.kr2014-03-17143
도메인명등록일방문자수재방문자수
19243www.parents.go.kr2014-06-05121
21746www.parents.go.kr2014-09-1812
5528www.parents.go.kr2012-10-2617
4274www.parents.go.kr2012-09-03133
1981www.parents.go.kr2012-06-01186
486www.parents.go.kr2012-03-31112
9319www.parents.go.kr2013-04-05114
16025www.parents.go.kr2014-01-1814
17868www.parents.go.kr2014-04-0814
16433www.parents.go.kr2014-02-05124

Duplicate rows

Most frequently occurring

도메인명등록일방문자수재방문자수# duplicates
49m.parents.go.kr2015-01-19105
62m.parents.go.kr2015-06-28105
875www.parents.go.kr2014-12-28155
21m.parents.go.kr2014-11-17104
33m.parents.go.kr2014-12-21104
37m.parents.go.kr2014-12-26104
42m.parents.go.kr2015-01-07104
44m.parents.go.kr2015-01-10104
63m.parents.go.kr2015-06-29104
65m.parents.go.kr2015-06-30104