Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells546
Missing cells (%)1.4%
Duplicate rows124
Duplicate rows (%)1.2%
Total size in memory410.2 KiB
Average record size in memory42.0 B

Variable types

Text2
Numeric2

Dataset

Description대구광역시 관내에 있는 공공도서관의 휴관일(2000년부터) 정보입니다.도서관명, 년도, 월, 일자 항목을 포함합니다.
Author대구광역시
URLhttps://www.data.go.kr/data/15089208/fileData.do

Alerts

Dataset has 124 (1.2%) duplicate rowsDuplicates
has 546 (5.5%) missing valuesMissing

Reproduction

Analysis started2023-12-12 15:00:34.068541
Analysis finished2023-12-12 15:00:34.939477
Duration0.87 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct101
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T00:00:35.150072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length14
Mean length10.3686
Min length4

Characters and Unicode

Total characters103686
Distinct characters166
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row달서구 장기동작은도서관
2nd row수성구립 용학도서관
3rd row하빈면작은도서관
4th row대구광역시립달성도서관
5th row대신동작은도서관
ValueCountFrequency (%)
수성구립 936
 
6.0%
달서구립 924
 
5.9%
동구 869
 
5.6%
북구 719
 
4.6%
달서구 713
 
4.6%
작은도서관 648
 
4.2%
서구 459
 
3.0%
어린이도서관 374
 
2.4%
대구광역시립동부도서관 303
 
1.9%
대구광역시립서부도서관 295
 
1.9%
Other values (98) 9315
59.9%
2023-12-13T00:00:35.522422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12500
 
12.1%
9941
 
9.6%
9844
 
9.5%
7703
 
7.4%
5555
 
5.4%
3924
 
3.8%
3471
 
3.3%
3121
 
3.0%
3121
 
3.0%
2775
 
2.7%
Other values (156) 41731
40.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 94915
91.5%
Space Separator 5555
 
5.4%
Decimal Number 2257
 
2.2%
Other Punctuation 451
 
0.4%
Close Punctuation 254
 
0.2%
Open Punctuation 254
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12500
 
13.2%
9941
 
10.5%
9844
 
10.4%
7703
 
8.1%
3924
 
4.1%
3471
 
3.7%
3121
 
3.3%
3121
 
3.3%
2775
 
2.9%
2386
 
2.5%
Other values (142) 36129
38.1%
Decimal Number
ValueCountFrequency (%)
2 1085
48.1%
1 471
20.9%
8 402
 
17.8%
3 144
 
6.4%
4 75
 
3.3%
5 68
 
3.0%
7 12
 
0.5%
Other Punctuation
ValueCountFrequency (%)
· 402
89.1%
. 49
 
10.9%
Close Punctuation
ValueCountFrequency (%)
] 181
71.3%
) 73
28.7%
Open Punctuation
ValueCountFrequency (%)
[ 181
71.3%
( 73
28.7%
Space Separator
ValueCountFrequency (%)
5555
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 94915
91.5%
Common 8771
 
8.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12500
 
13.2%
9941
 
10.5%
9844
 
10.4%
7703
 
8.1%
3924
 
4.1%
3471
 
3.7%
3121
 
3.3%
3121
 
3.3%
2775
 
2.9%
2386
 
2.5%
Other values (142) 36129
38.1%
Common
ValueCountFrequency (%)
5555
63.3%
2 1085
 
12.4%
1 471
 
5.4%
8 402
 
4.6%
· 402
 
4.6%
] 181
 
2.1%
[ 181
 
2.1%
3 144
 
1.6%
4 75
 
0.9%
) 73
 
0.8%
Other values (4) 202
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 94915
91.5%
ASCII 8369
 
8.1%
None 402
 
0.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
12500
 
13.2%
9941
 
10.5%
9844
 
10.4%
7703
 
8.1%
3924
 
4.1%
3471
 
3.7%
3121
 
3.3%
3121
 
3.3%
2775
 
2.9%
2386
 
2.5%
Other values (142) 36129
38.1%
ASCII
ValueCountFrequency (%)
5555
66.4%
2 1085
 
13.0%
1 471
 
5.6%
8 402
 
4.8%
] 181
 
2.2%
[ 181
 
2.2%
3 144
 
1.7%
4 75
 
0.9%
) 73
 
0.9%
( 73
 
0.9%
Other values (3) 129
 
1.5%
None
ValueCountFrequency (%)
· 402
100.0%

연도
Real number (ℝ)

Distinct26
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2016.8709
Minimum2000
Maximum2025
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T00:00:35.656213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2000
5-th percentile2005
Q12014
median2019
Q32021
95-th percentile2023
Maximum2025
Range25
Interquartile range (IQR)7

Descriptive statistics

Standard deviation5.6283391
Coefficient of variation (CV)0.0027906293
Kurtosis0.30374284
Mean2016.8709
Median Absolute Deviation (MAD)3
Skewness-1.0533916
Sum20168709
Variance31.678201
MonotonicityNot monotonic
2023-12-13T00:00:35.795584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
2021 1082
10.8%
2020 1045
10.4%
2023 1029
10.3%
2022 1027
10.3%
2018 731
 
7.3%
2019 719
 
7.2%
2017 566
 
5.7%
2016 538
 
5.4%
2015 446
 
4.5%
2014 416
 
4.2%
Other values (16) 2401
24.0%
ValueCountFrequency (%)
2000 11
 
0.1%
2001 115
1.1%
2002 103
1.0%
2003 104
1.0%
2004 159
1.6%
2005 106
1.1%
2006 145
1.5%
2007 164
1.6%
2008 217
2.2%
2009 199
2.0%
ValueCountFrequency (%)
2025 12
 
0.1%
2024 102
 
1.0%
2023 1029
10.3%
2022 1027
10.3%
2021 1082
10.8%
2020 1045
10.4%
2019 719
7.2%
2018 731
7.3%
2017 566
5.7%
2016 538
5.4%


Real number (ℝ)

Distinct12
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.5344
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T00:00:35.916950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q14
median7
Q310
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.4528262
Coefficient of variation (CV)0.52840754
Kurtosis-1.215882
Mean6.5344
Median Absolute Deviation (MAD)3
Skewness-0.013721452
Sum65344
Variance11.922009
MonotonicityNot monotonic
2023-12-13T00:00:36.033482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
10 864
8.6%
6 854
8.5%
12 850
8.5%
9 842
8.4%
5 834
8.3%
3 833
8.3%
8 830
8.3%
1 829
8.3%
11 826
8.3%
7 816
8.2%
Other values (2) 1622
16.2%
ValueCountFrequency (%)
1 829
8.3%
2 811
8.1%
3 833
8.3%
4 811
8.1%
5 834
8.3%
6 854
8.5%
7 816
8.2%
8 830
8.3%
9 842
8.4%
10 864
8.6%
ValueCountFrequency (%)
12 850
8.5%
11 826
8.3%
10 864
8.6%
9 842
8.4%
8 830
8.3%
7 816
8.2%
6 854
8.5%
5 834
8.3%
4 811
8.1%
3 833
8.3%


Text

MISSING 

Distinct1268
Distinct (%)13.4%
Missing546
Missing (%)5.5%
Memory size156.2 KiB
2023-12-13T00:00:36.215469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length113
Median length69
Mean length21.115507
Min length1

Characters and Unicode

Total characters199626
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique553 ?
Unique (%)5.8%

Sample

1st row1, 2, 3, 8, 9, 10, 15, 16, 22, 23, 29, 30
2nd row1, 8, 15, 22, 29
3rd row1, 2, 8, 9, 15, 16, 22, 23, 27, 28, 29, 30
4th row8, 13, 14, 15, 22
5th row5, 6, 12, 13, 19, 20, 26, 27
ValueCountFrequency (%)
1 3047
 
5.1%
25 2466
 
4.1%
5 2445
 
4.1%
9 2392
 
4.0%
3 2373
 
4.0%
6 2373
 
4.0%
15 2318
 
3.9%
2 1920
 
3.2%
11 1907
 
3.2%
12 1903
 
3.2%
Other values (30) 36395
61.1%
2023-12-13T00:00:36.528597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 50085
25.1%
50085
25.1%
1 26367
13.2%
2 23959
12.0%
3 8700
 
4.4%
5 7297
 
3.7%
6 5958
 
3.0%
9 5841
 
2.9%
0 5572
 
2.8%
4 5483
 
2.7%
Other values (2) 10279
 
5.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 99456
49.8%
Other Punctuation 50085
25.1%
Space Separator 50085
25.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 26367
26.5%
2 23959
24.1%
3 8700
 
8.7%
5 7297
 
7.3%
6 5958
 
6.0%
9 5841
 
5.9%
0 5572
 
5.6%
4 5483
 
5.5%
7 5171
 
5.2%
8 5108
 
5.1%
Other Punctuation
ValueCountFrequency (%)
, 50085
100.0%
Space Separator
ValueCountFrequency (%)
50085
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 199626
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
, 50085
25.1%
50085
25.1%
1 26367
13.2%
2 23959
12.0%
3 8700
 
4.4%
5 7297
 
3.7%
6 5958
 
3.0%
9 5841
 
2.9%
0 5572
 
2.8%
4 5483
 
2.7%
Other values (2) 10279
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 199626
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 50085
25.1%
50085
25.1%
1 26367
13.2%
2 23959
12.0%
3 8700
 
4.4%
5 7297
 
3.7%
6 5958
 
3.0%
9 5841
 
2.9%
0 5572
 
2.8%
4 5483
 
2.7%
Other values (2) 10279
 
5.1%

Interactions

2023-12-13T00:00:34.551686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:00:34.365739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:00:34.645876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:00:34.465118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:00:36.604955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도
연도1.0000.000
0.0001.000
2023-12-13T00:00:36.672072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도
연도1.000-0.006
-0.0061.000

Missing values

2023-12-13T00:00:34.808557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:00:34.900510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

도서관명연도
9099달서구 장기동작은도서관2022101, 2, 3, 8, 9, 10, 15, 16, 22, 23, 29, 30
8923수성구립 용학도서관202281, 8, 15, 22, 29
3907하빈면작은도서관201711, 2, 8, 9, 15, 16, 22, 23, 27, 28, 29, 30
1116대구광역시립달성도서관200898, 13, 14, 15, 22
9224대신동작은도서관2022115, 6, 12, 13, 19, 20, 26, 27
3149대구광역시립서부도서관2015810, 14, 15, 24
4742서구 어린이도서관201842, 9, 16, 23, 30
2917다사읍서재작은도서관201521, 7, 8, 14, 15, 18, 19, 20, 21, 22, 28
7205동구 안심도서관202111, 4, 11, 18, 25
3814달서구립 본리도서관2016114, 11, 18, 25
도서관명연도
4918논공읍작은도서관201871, 8, 15, 22, 29
168대구2·28기념학생도서관2002501, 05, 15, 19
3991수성구립 용학도서관201726, 13, 20, 27
4626북구 태전도서관201825, 15, 16, 17, 19
8197옥포읍작은도서관2021124, 5, 11, 12, 18, 19, 25, 26
7117동구 신천도서관2020127, 14, 21, 25, 28
7039북구 서변동작은도서관20201213, 25
7841동구 신암5동 작은도서관202181, 7, 8, 14, 15, 21, 22, 28, 29
7269동구 방촌동 작은도서관202126, 7, 11, 12, 13, 14, 20, 21, 27, 28
4139대구광역시립서부도서관201753, 5, 8, 9, 22

Duplicate rows

Most frequently occurring

도서관명연도# duplicates
61대구광역시립동부도서관2007101, 3, 153
62대구광역시립동부도서관2007115, 193
112대구광역시립수성도서관200811, 14, 283
114대구광역시립수성도서관200831, 10, 243
115대구광역시립수성도서관200849, 14, 283
117대구광역시립수성도서관200866, 9, 233
118대구광역시립수성도서관2008714, 283
119대구광역시립수성도서관2008811, 15, 253
120대구광역시립수성도서관200898, 13, 14, 15, 223
121대구광역시립수성도서관2008103, 13, 273