Overview

Dataset statistics

Number of variables5
Number of observations365
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.8 KiB
Average record size in memory44.4 B

Variable types

DateTime1
Numeric4

Dataset

Description우리나라의 다양한 생물들을 체험할 수 있는 전시관을 운영하고 있는 국립생물자원관을 방문한 2017년 관람객 현황 자료
Author환경부 국립생물자원관
URLhttps://www.data.go.kr/data/15039204/fileData.do

Alerts

예약인원 is highly overall correlated with 예약 실제방문인원High correlation
예약 실제방문인원 is highly overall correlated with 예약인원High correlation
비예약 방문인원 is highly overall correlated with 총방문인원High correlation
총방문인원 is highly overall correlated with 비예약 방문인원High correlation
날짜 has unique valuesUnique
예약인원 has 165 (45.2%) zerosZeros
예약 실제방문인원 has 165 (45.2%) zerosZeros
비예약 방문인원 has 57 (15.6%) zerosZeros
총방문인원 has 57 (15.6%) zerosZeros

Reproduction

Analysis started2023-12-12 08:58:10.201538
Analysis finished2023-12-12 08:58:13.787055
Duration3.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

날짜
Date

UNIQUE 

Distinct365
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
Minimum2017-01-01 00:00:00
Maximum2017-12-31 00:00:00
2023-12-12T17:58:13.898199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:58:14.103702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

예약인원
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct173
Distinct (%)47.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean173.23836
Minimum0
Maximum1088
Zeros165
Zeros (%)45.2%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2023-12-12T17:58:14.333289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median40
Q3285
95-th percentile685.6
Maximum1088
Range1088
Interquartile range (IQR)285

Descriptive statistics

Standard deviation246.84833
Coefficient of variation (CV)1.4249058
Kurtosis2.1192621
Mean173.23836
Median Absolute Deviation (MAD)40
Skewness1.6249259
Sum63232
Variance60934.1
MonotonicityNot monotonic
2023-12-12T17:58:14.525677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 165
45.2%
40 7
 
1.9%
100 3
 
0.8%
27 2
 
0.5%
45 2
 
0.5%
105 2
 
0.5%
172 2
 
0.5%
210 2
 
0.5%
254 2
 
0.5%
80 2
 
0.5%
Other values (163) 176
48.2%
ValueCountFrequency (%)
0 165
45.2%
13 1
 
0.3%
16 1
 
0.3%
18 1
 
0.3%
19 1
 
0.3%
22 1
 
0.3%
24 1
 
0.3%
27 2
 
0.5%
33 1
 
0.3%
34 1
 
0.3%
ValueCountFrequency (%)
1088 1
0.3%
1050 1
0.3%
1032 1
0.3%
1010 1
0.3%
1008 1
0.3%
991 1
0.3%
979 1
0.3%
952 1
0.3%
932 1
0.3%
903 1
0.3%

예약 실제방문인원
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct173
Distinct (%)47.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean172.43288
Minimum0
Maximum1088
Zeros165
Zeros (%)45.2%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2023-12-12T17:58:14.679634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median40
Q3285
95-th percentile685.6
Maximum1088
Range1088
Interquartile range (IQR)285

Descriptive statistics

Standard deviation245.76469
Coefficient of variation (CV)1.4252775
Kurtosis2.1387577
Mean172.43288
Median Absolute Deviation (MAD)40
Skewness1.6292593
Sum62938
Variance60400.285
MonotonicityNot monotonic
2023-12-12T17:58:14.861702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 165
45.2%
40 7
 
1.9%
100 3
 
0.8%
210 3
 
0.8%
122 2
 
0.5%
45 2
 
0.5%
105 2
 
0.5%
172 2
 
0.5%
254 2
 
0.5%
95 2
 
0.5%
Other values (163) 175
47.9%
ValueCountFrequency (%)
0 165
45.2%
13 1
 
0.3%
16 1
 
0.3%
18 1
 
0.3%
19 1
 
0.3%
22 1
 
0.3%
24 1
 
0.3%
27 2
 
0.5%
33 1
 
0.3%
34 1
 
0.3%
ValueCountFrequency (%)
1088 1
0.3%
1050 1
0.3%
1032 1
0.3%
1010 1
0.3%
1008 1
0.3%
991 1
0.3%
952 1
0.3%
932 1
0.3%
929 1
0.3%
903 1
0.3%

비예약 방문인원
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct289
Distinct (%)79.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1021.189
Minimum0
Maximum11606
Zeros57
Zeros (%)15.6%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2023-12-12T17:58:15.023746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1210
median602
Q31828
95-th percentile2812.8
Maximum11606
Range11606
Interquartile range (IQR)1618

Descriptive statistics

Standard deviation1140.3039
Coefficient of variation (CV)1.1166433
Kurtosis19.361657
Mean1021.189
Median Absolute Deviation (MAD)554
Skewness2.7942155
Sum372734
Variance1300292.9
MonotonicityNot monotonic
2023-12-12T17:58:15.198071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 57
 
15.6%
602 3
 
0.8%
403 3
 
0.8%
332 2
 
0.5%
398 2
 
0.5%
1405 2
 
0.5%
2714 2
 
0.5%
806 2
 
0.5%
276 2
 
0.5%
102 2
 
0.5%
Other values (279) 288
78.9%
ValueCountFrequency (%)
0 57
15.6%
35 1
 
0.3%
42 1
 
0.3%
48 1
 
0.3%
54 1
 
0.3%
57 1
 
0.3%
59 1
 
0.3%
85 1
 
0.3%
101 1
 
0.3%
102 2
 
0.5%
ValueCountFrequency (%)
11606 1
0.3%
4243 1
0.3%
4233 1
0.3%
4179 1
0.3%
3755 1
0.3%
3507 1
0.3%
3325 1
0.3%
3225 1
0.3%
3198 1
0.3%
3192 1
0.3%

총방문인원
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct286
Distinct (%)78.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1193.6219
Minimum0
Maximum11606
Zeros57
Zeros (%)15.6%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2023-12-12T17:58:15.400324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1430
median890
Q31956
95-th percentile2943.8
Maximum11606
Range11606
Interquartile range (IQR)1526

Descriptive statistics

Standard deviation1116.3346
Coefficient of variation (CV)0.93524976
Kurtosis19.665517
Mean1193.6219
Median Absolute Deviation (MAD)669
Skewness2.6647812
Sum435672
Variance1246203
MonotonicityNot monotonic
2023-12-12T17:58:15.638022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 57
 
15.6%
178 2
 
0.5%
444 2
 
0.5%
729 2
 
0.5%
736 2
 
0.5%
1588 2
 
0.5%
660 2
 
0.5%
1046 2
 
0.5%
552 2
 
0.5%
1828 2
 
0.5%
Other values (276) 290
79.5%
ValueCountFrequency (%)
0 57
15.6%
114 1
 
0.3%
159 1
 
0.3%
178 2
 
0.5%
186 1
 
0.3%
187 1
 
0.3%
204 1
 
0.3%
206 1
 
0.3%
210 1
 
0.3%
227 2
 
0.5%
ValueCountFrequency (%)
11606 1
0.3%
4243 1
0.3%
4233 1
0.3%
4179 1
0.3%
3755 1
0.3%
3589 1
0.3%
3507 1
0.3%
3325 1
0.3%
3225 1
0.3%
3198 1
0.3%

Interactions

2023-12-12T17:58:12.403491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:58:10.401099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:58:11.044600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:58:11.704008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:58:12.605276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:58:10.548558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:58:11.208503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:58:11.836933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:58:12.791380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:58:10.700781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:58:11.378919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:58:12.017202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:58:13.100581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:58:10.862766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:58:11.551019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:58:12.199326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:58:15.752751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
예약인원예약 실제방문인원비예약 방문인원총방문인원
예약인원1.0001.0000.4020.434
예약 실제방문인원1.0001.0000.4020.435
비예약 방문인원0.4020.4021.0000.996
총방문인원0.4340.4350.9961.000
2023-12-12T17:58:16.309844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
예약인원예약 실제방문인원비예약 방문인원총방문인원
예약인원1.0001.000-0.0840.070
예약 실제방문인원1.0001.000-0.0830.071
비예약 방문인원-0.084-0.0831.0000.965
총방문인원0.0700.0710.9651.000

Missing values

2023-12-12T17:58:13.483314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:58:13.712580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

날짜예약인원예약 실제방문인원비예약 방문인원총방문인원
02017-01-010000
12017-01-020000
22017-01-030013021302
32017-01-04272710831110
42017-01-05117117848965
52017-01-060010761076
62017-01-070027792779
72017-01-080037553755
82017-01-090000
92017-01-10181181443624
날짜예약인원예약 실제방문인원비예약 방문인원총방문인원
3552017-12-2200210210
3562017-12-23404015401580
3572017-12-240026922692
3582017-12-250000
3592017-12-26347347126473
3602017-12-2700187187
3612017-12-2813613642178
3622017-12-291919140159
3632017-12-3000552552
3642017-12-3100789789