Overview

Dataset statistics

Number of variables4
Number of observations200
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.6 KiB
Average record size in memory33.7 B

Variable types

Numeric1
Categorical2
Text1

Dataset

Description국립생태원 기관 대표 홈페이지 게시물(출판도서) 목록이며, 해당 게시물은 기관 대표 홈페이지에 접속하여 확인하실 수 있습니다.
Author국립생태원
URLhttps://www.data.go.kr/data/15090283/fileData.do

Alerts

사이트 has constant value ""Constant
순번 is highly overall correlated with 카테고리High correlation
카테고리 is highly overall correlated with 순번High correlation
카테고리 is highly imbalanced (53.1%)Imbalance
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 20:29:40.598271
Analysis finished2023-12-12 20:29:41.208278
Duration0.61 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct200
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean100.5
Minimum1
Maximum200
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-13T05:29:41.326504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile10.95
Q150.75
median100.5
Q3150.25
95-th percentile190.05
Maximum200
Range199
Interquartile range (IQR)99.5

Descriptive statistics

Standard deviation57.879185
Coefficient of variation (CV)0.57591228
Kurtosis-1.2
Mean100.5
Median Absolute Deviation (MAD)50
Skewness0
Sum20100
Variance3350
MonotonicityStrictly increasing
2023-12-13T05:29:41.543357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.5%
139 1
 
0.5%
129 1
 
0.5%
130 1
 
0.5%
131 1
 
0.5%
132 1
 
0.5%
133 1
 
0.5%
134 1
 
0.5%
135 1
 
0.5%
136 1
 
0.5%
Other values (190) 190
95.0%
ValueCountFrequency (%)
1 1
0.5%
2 1
0.5%
3 1
0.5%
4 1
0.5%
5 1
0.5%
6 1
0.5%
7 1
0.5%
8 1
0.5%
9 1
0.5%
10 1
0.5%
ValueCountFrequency (%)
200 1
0.5%
199 1
0.5%
198 1
0.5%
197 1
0.5%
196 1
0.5%
195 1
0.5%
194 1
0.5%
193 1
0.5%
192 1
0.5%
191 1
0.5%

사이트
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
국립생태원
200 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국립생태원
2nd row국립생태원
3rd row국립생태원
4th row국립생태원
5th row국립생태원

Common Values

ValueCountFrequency (%)
국립생태원 200
100.0%

Length

2023-12-13T05:29:41.740114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:29:41.883932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국립생태원 200
100.0%

카테고리
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
교육_출판_출판도서_수어영상도서
180 
교육_출판_출판도서_생태영상동화
20 

Length

Max length17
Median length17
Mean length17
Min length17

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row교육_출판_출판도서_수어영상도서
2nd row교육_출판_출판도서_수어영상도서
3rd row교육_출판_출판도서_수어영상도서
4th row교육_출판_출판도서_수어영상도서
5th row교육_출판_출판도서_수어영상도서

Common Values

ValueCountFrequency (%)
교육_출판_출판도서_수어영상도서 180
90.0%
교육_출판_출판도서_생태영상동화 20
 
10.0%

Length

2023-12-13T05:29:42.013363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:29:42.138776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
교육_출판_출판도서_수어영상도서 180
90.0%
교육_출판_출판도서_생태영상동화 20
 
10.0%
Distinct180
Distinct (%)90.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-13T05:29:42.565180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length64
Median length40
Mean length22.78
Min length13

Characters and Unicode

Total characters4556
Distinct characters453
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique160 ?
Unique (%)80.0%

Sample

1st row이솝우화 01 성질 급한 사자
2nd row이솝우화 02 날고 싶은 거북
3rd row이솝우화 03 늑대와 개
4th row이솝우화 04 겁쟁이 사자
5th row이솝우화 05 은혜를 갚은 생쥐
ValueCountFrequency (%)
동화 50
 
3.8%
속담 45
 
3.4%
세계 45
 
3.4%
우리속담 40
 
3.1%
이솝우화 40
 
3.1%
그림형제 25
 
1.9%
옛이야기 25
 
1.9%
우리 25
 
1.9%
안데르센 25
 
1.9%
04 10
 
0.8%
Other values (662) 977
74.8%
2023-12-13T05:29:43.233130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1107
24.3%
128
 
2.8%
118
 
2.6%
104
 
2.3%
95
 
2.1%
94
 
2.1%
0 92
 
2.0%
86
 
1.9%
86
 
1.9%
1 82
 
1.8%
Other values (443) 2564
56.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3049
66.9%
Space Separator 1107
 
24.3%
Decimal Number 400
 
8.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
128
 
4.2%
118
 
3.9%
104
 
3.4%
95
 
3.1%
94
 
3.1%
86
 
2.8%
86
 
2.8%
60
 
2.0%
52
 
1.7%
50
 
1.6%
Other values (432) 2176
71.4%
Decimal Number
ValueCountFrequency (%)
0 92
23.0%
1 82
20.5%
2 55
13.8%
3 52
13.0%
4 25
 
6.2%
5 22
 
5.5%
8 18
 
4.5%
9 18
 
4.5%
6 18
 
4.5%
7 18
 
4.5%
Space Separator
ValueCountFrequency (%)
1107
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3049
66.9%
Common 1507
33.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
128
 
4.2%
118
 
3.9%
104
 
3.4%
95
 
3.1%
94
 
3.1%
86
 
2.8%
86
 
2.8%
60
 
2.0%
52
 
1.7%
50
 
1.6%
Other values (432) 2176
71.4%
Common
ValueCountFrequency (%)
1107
73.5%
0 92
 
6.1%
1 82
 
5.4%
2 55
 
3.6%
3 52
 
3.5%
4 25
 
1.7%
5 22
 
1.5%
8 18
 
1.2%
9 18
 
1.2%
6 18
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3049
66.9%
ASCII 1507
33.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1107
73.5%
0 92
 
6.1%
1 82
 
5.4%
2 55
 
3.6%
3 52
 
3.5%
4 25
 
1.7%
5 22
 
1.5%
8 18
 
1.2%
9 18
 
1.2%
6 18
 
1.2%
Hangul
ValueCountFrequency (%)
128
 
4.2%
118
 
3.9%
104
 
3.4%
95
 
3.1%
94
 
3.1%
86
 
2.8%
86
 
2.8%
60
 
2.0%
52
 
1.7%
50
 
1.6%
Other values (432) 2176
71.4%

Interactions

2023-12-13T05:29:40.883418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:29:43.359758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번카테고리
순번1.0001.000
카테고리1.0001.000
2023-12-13T05:29:43.465039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번카테고리
순번1.0000.980
카테고리0.9801.000

Missing values

2023-12-13T05:29:41.054186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:29:41.166495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번사이트카테고리게시물명
01국립생태원교육_출판_출판도서_수어영상도서이솝우화 01 성질 급한 사자
12국립생태원교육_출판_출판도서_수어영상도서이솝우화 02 날고 싶은 거북
23국립생태원교육_출판_출판도서_수어영상도서이솝우화 03 늑대와 개
34국립생태원교육_출판_출판도서_수어영상도서이솝우화 04 겁쟁이 사자
45국립생태원교육_출판_출판도서_수어영상도서이솝우화 05 은혜를 갚은 생쥐
56국립생태원교육_출판_출판도서_수어영상도서이솝우화 06 까마귀 깃털
67국립생태원교육_출판_출판도서_수어영상도서이솝우화 07 여우와 포도
78국립생태원교육_출판_출판도서_수어영상도서이솝우화 08 매미와 개미
89국립생태원교육_출판_출판도서_수어영상도서이솝우화 09 어부와 원숭이
910국립생태원교육_출판_출판도서_수어영상도서이솝우화 10 포도밭의 보물
순번사이트카테고리게시물명
190191국립생태원교육_출판_출판도서_생태영상동화세계 속담 01 낙타 등에서 평평한 곳을 찾지 마라
191192국립생태원교육_출판_출판도서_생태영상동화세계 속담 02 앵무새처럼 반복한다
192193국립생태원교육_출판_출판도서_생태영상동화세계 속담 03 덜 익은 무화과가 다 익은 무화과와 맞닿으면 익기 시작한다
193194국립생태원교육_출판_출판도서_생태영상동화세계 속담 04 늙은 말도 조랑말에게 배운다
194195국립생태원교육_출판_출판도서_생태영상동화세계 속담 05 닭이 꼬끼오 하고 울었다고 달걀을 낳는 것은 아니다
195196국립생태원교육_출판_출판도서_생태영상동화우리 옛이야기 01 고조선을 세운 단군왕검
196197국립생태원교육_출판_출판도서_생태영상동화우리 옛이야기 02 슬픈 그리움의 꽃 백일홍
197198국립생태원교육_출판_출판도서_생태영상동화우리 옛이야기 03 배고픈 여우와 메추라기
198199국립생태원교육_출판_출판도서_생태영상동화우리 옛이야기 04 개와 닭이 사람으로 변신한 까닭
199200국립생태원교육_출판_출판도서_생태영상동화우리 옛이야기 05 바위에 실려 간 연오랑과 세오녀