Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells72
Missing cells (%)0.1%
Duplicate rows2082
Duplicate rows (%)20.8%
Total size in memory644.5 KiB
Average record size in memory66.0 B

Variable types

Numeric2
Categorical4
DateTime1

Dataset

Description남한산성 탐방객조사 결과 집계 현황
Author경기도
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=9QCMTO2Z55T02DE8323P1577405&infSeq=1

Alerts

Dataset has 2082 (20.8%) duplicate rowsDuplicates
입장객수 has 707 (7.1%) zerosZeros

Reproduction

Analysis started2023-12-10 21:50:14.708021
Analysis finished2023-12-10 21:50:15.724113
Duration1.02 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

집계년도
Real number (ℝ)

Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2014.0064
Minimum2009
Maximum2019
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:50:15.767760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2009
5-th percentile2009
Q12012
median2014
Q32016
95-th percentile2019
Maximum2019
Range10
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.9829933
Coefficient of variation (CV)0.001481124
Kurtosis-0.92628006
Mean2014.0064
Median Absolute Deviation (MAD)2
Skewness-0.091621932
Sum20140064
Variance8.8982489
MonotonicityNot monotonic
2023-12-11T06:50:15.857569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
2014 1434
14.3%
2015 1398
14.0%
2013 1101
11.0%
2010 1038
10.4%
2012 1006
10.1%
2009 968
9.7%
2017 802
8.0%
2018 800
8.0%
2019 753
7.5%
2016 700
7.0%
ValueCountFrequency (%)
2009 968
9.7%
2010 1038
10.4%
2012 1006
10.1%
2013 1101
11.0%
2014 1434
14.3%
2015 1398
14.0%
2016 700
7.0%
2017 802
8.0%
2018 800
8.0%
2019 753
7.5%
ValueCountFrequency (%)
2019 753
7.5%
2018 800
8.0%
2017 802
8.0%
2016 700
7.0%
2015 1398
14.0%
2014 1434
14.3%
2013 1101
11.0%
2012 1006
10.1%
2010 1038
10.4%
2009 968
9.7%

계절구분명
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
가을
3509 
겨울
2989 
여름
1758 
1744 

Length

Max length2
Median length2
Mean length1.8256
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row여름
2nd row겨울
3rd row가을
4th row가을
5th row겨울

Common Values

ValueCountFrequency (%)
가을 3509
35.1%
겨울 2989
29.9%
여름 1758
17.6%
1744
17.4%

Length

2023-12-11T06:50:15.953999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:50:16.043885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
가을 3509
35.1%
겨울 2989
29.9%
여름 1758
17.6%
1744
17.4%
Distinct214
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2009-02-22 00:00:00
Maximum2019-10-27 00:00:00
2023-12-11T06:50:16.177191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:50:16.294121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

조사요일
Categorical

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1727 
1711 
1462 
1343 
1288 
Other values (2)
2469 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
1727
17.3%
1711
17.1%
1462
14.6%
1343
13.4%
1288
12.9%
1258
12.6%
1211
12.1%

Length

2023-12-11T06:50:16.412758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:50:16.513650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1727
17.3%
1711
17.1%
1462
14.6%
1343
13.4%
1288
12.9%
1258
12.6%
1211
12.1%

장소구분명
Categorical

Distinct36
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
남문
859 
서문
823 
남문갓길 주차장
736 
로터리주차장
718 
중앙주차장
677 
Other values (31)
6187 

Length

Max length14
Median length12
Mean length6.1022
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row연주봉 5암문
2nd row남문
3rd row역사관주차장
4th row노선버스 9번, 9-1번
5th row로터리주차장

Common Values

ValueCountFrequency (%)
남문 859
 
8.6%
서문 823
 
8.2%
남문갓길 주차장 736
 
7.4%
로터리주차장 718
 
7.2%
중앙주차장 677
 
6.8%
남문주차장 659
 
6.6%
동문주차장 643
 
6.4%
역사관주차장 523
 
5.2%
수어장대(6)암문 432
 
4.3%
제1옹성(7)암문 405
 
4.0%
Other values (26) 3525
35.2%

Length

2023-12-11T06:50:16.626025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
주차장 1194
 
9.4%
노선버스 950
 
7.5%
남문 859
 
6.8%
서문 823
 
6.5%
남문갓길 736
 
5.8%
로터리주차장 718
 
5.7%
중앙주차장 677
 
5.3%
남문주차장 659
 
5.2%
동문주차장 643
 
5.1%
역사관주차장 523
 
4.1%
Other values (28) 4875
38.5%

시간대
Categorical

Distinct19
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
11:00~12:00
1118 
14:00~15:00
1081 
16:00~17:00
1079 
10:00~11:00
1075 
12:00~13:00
1067 
Other values (14)
4580 

Length

Max length12
Median length11
Mean length11.0396
Min length11

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row12:00~13:00
2nd row15:00~16:00
3rd row10:00~11:00
4th row12:00 ~13:00
5th row15:00~16:00

Common Values

ValueCountFrequency (%)
11:00~12:00 1118
11.2%
14:00~15:00 1081
10.8%
16:00~17:00 1079
10.8%
10:00~11:00 1075
10.8%
12:00~13:00 1067
10.7%
15:00~16:00 1063
10.6%
13:00~14:00 1042
10.4%
09:00~10:00 1036
10.4%
17:00~18:00 900
9.0%
08:00~09:00 143
 
1.4%
Other values (9) 396
 
4.0%

Length

2023-12-11T06:50:16.739743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
11:00~12:00 1118
10.8%
14:00~15:00 1081
10.4%
16:00~17:00 1079
10.4%
10:00~11:00 1075
10.3%
12:00~13:00 1067
10.3%
15:00~16:00 1063
10.2%
13:00~14:00 1042
10.0%
09:00~10:00 1036
10.0%
17:00~18:00 900
8.7%
08:00~09:00 143
 
1.4%
Other values (10) 792
7.6%

입장객수
Real number (ℝ)

ZEROS 

Distinct463
Distinct (%)4.7%
Missing72
Missing (%)0.7%
Infinite0
Infinite (%)0.0%
Mean63.909649
Minimum0
Maximum3349
Zeros707
Zeros (%)7.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:50:16.861386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q19
median26
Q367
95-th percentile230
Maximum3349
Range3349
Interquartile range (IQR)58

Descriptive statistics

Standard deviation133.60587
Coefficient of variation (CV)2.0905431
Kurtosis101.55503
Mean63.909649
Median Absolute Deviation (MAD)21
Skewness7.9032977
Sum634495
Variance17850.529
MonotonicityNot monotonic
2023-12-11T06:50:16.989845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 707
 
7.1%
4 267
 
2.7%
2 266
 
2.7%
3 251
 
2.5%
6 235
 
2.4%
8 222
 
2.2%
5 205
 
2.1%
7 187
 
1.9%
18 173
 
1.7%
15 173
 
1.7%
Other values (453) 7242
72.4%
ValueCountFrequency (%)
0 707
7.1%
1 113
 
1.1%
2 266
 
2.7%
3 251
 
2.5%
4 267
 
2.7%
5 205
 
2.1%
6 235
 
2.4%
7 187
 
1.9%
8 222
 
2.2%
9 140
 
1.4%
ValueCountFrequency (%)
3349 1
 
< 0.1%
2250 2
< 0.1%
2225 1
 
< 0.1%
1880 1
 
< 0.1%
1730 1
 
< 0.1%
1627 1
 
< 0.1%
1620 1
 
< 0.1%
1583 3
< 0.1%
1580 1
 
< 0.1%
1569 3
< 0.1%

Interactions

2023-12-11T06:50:15.380716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:50:15.211695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:50:15.458671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:50:15.285440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T06:50:17.097934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
집계년도계절구분명조사요일장소구분명시간대입장객수
집계년도1.0000.5150.2550.8390.4330.094
계절구분명0.5151.0000.1250.5420.2140.112
조사요일0.2550.1251.0000.1870.0630.148
장소구분명0.8390.5420.1871.0000.5270.256
시간대0.4330.2140.0630.5271.0000.102
입장객수0.0940.1120.1480.2560.1021.000
2023-12-11T06:50:17.194535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
계절구분명조사요일장소구분명시간대
계절구분명1.0000.0860.2850.118
조사요일0.0861.0000.0770.028
장소구분명0.2850.0771.0000.162
시간대0.1180.0280.1621.000
2023-12-11T06:50:17.284304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
집계년도입장객수계절구분명조사요일장소구분명시간대
집계년도1.000-0.0310.3290.1290.4570.196
입장객수-0.0311.0000.0510.0800.1000.043
계절구분명0.3290.0511.0000.0860.2850.118
조사요일0.1290.0800.0861.0000.0770.028
장소구분명0.4570.1000.2850.0771.0000.162
시간대0.1960.0430.1180.0280.1621.000

Missing values

2023-12-11T06:50:15.568624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T06:50:15.674213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

집계년도계절구분명조사일자조사요일장소구분명시간대입장객수
10562019여름2019-09-01연주봉 5암문12:00~13:00283
399062012겨울2012-02-20남문15:00~16:0077
296342013가을2013-10-27역사관주차장10:00~11:0026
158422015가을2015-10-24노선버스 9번, 9-1번12:00 ~13:00113
378562012겨울2012-02-25로터리주차장15:00~16:0021
1397620162016-04-22노선버스52번10:00~11:0051
39652018가을2018-10-28로터리주차장14:00~15:0056
267142014겨울2014-02-27남문13:00~14:00145
117292016가을2016-10-16로터리주차장15:00~16:0032
335542013겨울2013-02-13남문11:00~12:00107
집계년도계절구분명조사일자조사요일장소구분명시간대입장객수
459222009가을2009-11-02중앙주차장13:00~14:00121
344352013겨울2013-02-12로터리주차장10:00~11:006
328072013가을2013-10-21제1옹성(7)암문10:00~11:0038
308242013가을2013-10-25하행선주차장11:00~12:000
332582013겨울2013-02-14남문10:00~11:0076
487832009겨울2009-02-28역사관주차장13:00~14:0014
402562010가을2010-10-21수어장대(6)암문13:00~14:0047
86002017가을2017-10-24수어장대암문14:00~15:00135
1400320162016-04-22노선버스52번16:00~17:009
292772014겨울2014-02-22제1옹성(7)암문11:00~12:00242

Duplicate rows

Most frequently occurring

집계년도계절구분명조사일자조사요일장소구분명시간대입장객수# duplicates
16392015가을2015-10-23동문 주차장10:00~11:0046
18682015여름2015-08-12노선버스 15-1번12:00 ~13:00296
1062009겨울2009-02-23남문주차장14:00~15:00445
2282009여름2009-08-10남문갓길 주차장15:00~16:00385
7472012겨울2012-02-25남문12:00~13:005085
8122013가을2013-10-21서문15:00~16:00645
9352013가을2013-10-26남문12:00~13:007725
11622014가을2014-10-28중앙주차장13:00~14:001235
138920142014-05-13노선버스52번11:00~12:00325
140520142014-05-13서문09:00~10:00405