Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells66
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory722.7 KiB
Average record size in memory74.0 B

Variable types

Text2
Numeric2
Boolean3
DateTime1

Dataset

Description경기도 광주시 시립도서관 홈페이지의 게시판 내용에 대한 데이터로 제목, 조회수, 공개여부, 등록일자 등의 항목을 제공합니다.
Author경기도 광주시
URLhttps://www.data.go.kr/data/15121574/fileData.do

Alerts

공지여부 is highly imbalanced (97.0%)Imbalance
게시물번호 has unique valuesUnique
조회수 has 336 (3.4%) zerosZeros

Reproduction

Analysis started2023-12-12 14:22:39.216364
Analysis finished2023-12-12 14:22:41.404888
Duration2.19 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

게시물번호
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T23:22:41.717332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length5
Mean length4.8689
Min length4

Characters and Unicode

Total characters48689
Distinct characters87
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st row9709
2nd row50622
3rd row12313
4th row11864
5th row9441
ValueCountFrequency (%)
3
 
< 0.1%
너무 2
 
< 0.1%
불편합니다 2
 
< 0.1%
문제가 2
 
< 0.1%
2
 
< 0.1%
14618 1
 
< 0.1%
11623 1
 
< 0.1%
9735 1
 
< 0.1%
50446 1
 
< 0.1%
50637 1
 
< 0.1%
Other values (10013) 10013
99.8%
2023-12-12T23:22:42.264352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 9968
20.5%
5 5681
11.7%
4 4937
10.1%
2 4665
9.6%
3 4540
9.3%
0 4435
9.1%
9 3839
 
7.9%
8 3785
 
7.8%
6 3634
 
7.5%
7 3026
 
6.2%
Other values (77) 179
 
0.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 48510
99.6%
Other Letter 145
 
0.3%
Space Separator 29
 
0.1%
Other Punctuation 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8
 
5.5%
6
 
4.1%
5
 
3.4%
5
 
3.4%
5
 
3.4%
4
 
2.8%
4
 
2.8%
3
 
2.1%
3
 
2.1%
3
 
2.1%
Other values (63) 99
68.3%
Decimal Number
ValueCountFrequency (%)
1 9968
20.5%
5 5681
11.7%
4 4937
10.2%
2 4665
9.6%
3 4540
9.4%
0 4435
9.1%
9 3839
 
7.9%
8 3785
 
7.8%
6 3634
 
7.5%
7 3026
 
6.2%
Other Punctuation
ValueCountFrequency (%)
. 3
60.0%
, 1
 
20.0%
" 1
 
20.0%
Space Separator
ValueCountFrequency (%)
29
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 48544
99.7%
Hangul 145
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8
 
5.5%
6
 
4.1%
5
 
3.4%
5
 
3.4%
5
 
3.4%
4
 
2.8%
4
 
2.8%
3
 
2.1%
3
 
2.1%
3
 
2.1%
Other values (63) 99
68.3%
Common
ValueCountFrequency (%)
1 9968
20.5%
5 5681
11.7%
4 4937
10.2%
2 4665
9.6%
3 4540
9.4%
0 4435
9.1%
9 3839
 
7.9%
8 3785
 
7.8%
6 3634
 
7.5%
7 3026
 
6.2%
Other values (4) 34
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 48544
99.7%
Hangul 145
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 9968
20.5%
5 5681
11.7%
4 4937
10.2%
2 4665
9.6%
3 4540
9.4%
0 4435
9.1%
9 3839
 
7.9%
8 3785
 
7.8%
6 3634
 
7.5%
7 3026
 
6.2%
Other values (4) 34
 
0.1%
Hangul
ValueCountFrequency (%)
8
 
5.5%
6
 
4.1%
5
 
3.4%
5
 
3.4%
5
 
3.4%
4
 
2.8%
4
 
2.8%
3
 
2.1%
3
 
2.1%
3
 
2.1%
Other values (63) 99
68.3%

게시판번호
Real number (ℝ)

Distinct18
Distinct (%)0.2%
Missing8
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean13.190753
Minimum1
Maximum611
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T23:22:42.404234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q34
95-th percentile81
Maximum611
Range610
Interquartile range (IQR)3

Descriptive statistics

Standard deviation44.897241
Coefficient of variation (CV)3.4036906
Kurtosis19.987231
Mean13.190753
Median Absolute Deviation (MAD)1
Skewness4.4062651
Sum131802
Variance2015.7622
MonotonicityNot monotonic
2023-12-12T23:22:42.564703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
2 3797
38.0%
1 3096
31.0%
4 2378
23.8%
221 229
 
2.3%
81 167
 
1.7%
62 89
 
0.9%
161 82
 
0.8%
241 65
 
0.7%
5 21
 
0.2%
281 18
 
0.2%
Other values (8) 50
 
0.5%
ValueCountFrequency (%)
1 3096
31.0%
2 3797
38.0%
3 8
 
0.1%
4 2378
23.8%
5 21
 
0.2%
61 7
 
0.1%
62 89
 
0.9%
81 167
 
1.7%
121 4
 
< 0.1%
161 82
 
0.8%
ValueCountFrequency (%)
611 1
 
< 0.1%
301 6
 
0.1%
281 18
 
0.2%
261 1
 
< 0.1%
241 65
 
0.7%
221 229
2.3%
201 7
 
0.1%
181 16
 
0.2%
161 82
 
0.8%
121 4
 
< 0.1%

제목
Text

Distinct7703
Distinct (%)77.1%
Missing9
Missing (%)0.1%
Memory size156.2 KiB
2023-12-12T23:22:42.878724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length93
Median length56
Mean length19.929436
Min length1

Characters and Unicode

Total characters199115
Distinct characters991
Distinct categories16 ?
Distinct scripts4 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6979 ?
Unique (%)69.9%

Sample

1st row3월 1주차 시립도서관 식당 주간식단표
2nd row공립 작은도서관 기간제 근로자(도서정리원) 채용 공고
3rd row학교변경요청
4th row봉사취소신청
5th row[광주시청] 3월 시민 무료 정보화 교육
ValueCountFrequency (%)
안내 1064
 
2.8%
주말프로그램 571
 
1.5%
도서관 440
 
1.2%
프로그램 365
 
1.0%
읽어주는 341
 
0.9%
오포도서관 303
 
0.8%
자원봉사 302
 
0.8%
초월도서관 301
 
0.8%
취소 295
 
0.8%
시립도서관 294
 
0.8%
Other values (9193) 33846
88.8%
2023-12-12T23:22:43.426713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
28433
 
14.3%
4796
 
2.4%
1 4727
 
2.4%
4180
 
2.1%
2 4050
 
2.0%
3777
 
1.9%
3457
 
1.7%
] 2914
 
1.5%
[ 2914
 
1.5%
2846
 
1.4%
Other values (981) 137021
68.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 132082
66.3%
Space Separator 28433
 
14.3%
Decimal Number 17543
 
8.8%
Other Punctuation 6909
 
3.5%
Close Punctuation 5472
 
2.7%
Open Punctuation 5471
 
2.7%
Lowercase Letter 1302
 
0.7%
Uppercase Letter 642
 
0.3%
Math Symbol 575
 
0.3%
Dash Punctuation 563
 
0.3%
Other values (6) 123
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4796
 
3.6%
4180
 
3.2%
3777
 
2.9%
3457
 
2.6%
2846
 
2.2%
2779
 
2.1%
2446
 
1.9%
2445
 
1.9%
2331
 
1.8%
2243
 
1.7%
Other values (872) 100782
76.3%
Lowercase Letter
ValueCountFrequency (%)
t 302
23.2%
e 185
14.2%
s 163
12.5%
o 119
 
9.1%
i 55
 
4.2%
a 47
 
3.6%
l 45
 
3.5%
n 38
 
2.9%
k 36
 
2.8%
h 32
 
2.5%
Other values (15) 280
21.5%
Uppercase Letter
ValueCountFrequency (%)
T 84
13.1%
B 78
12.1%
A 60
 
9.3%
P 54
 
8.4%
I 47
 
7.3%
S 39
 
6.1%
U 37
 
5.8%
E 33
 
5.1%
L 31
 
4.8%
G 28
 
4.4%
Other values (15) 151
23.5%
Other Punctuation
ValueCountFrequency (%)
. 2822
40.8%
, 1331
19.3%
/ 859
 
12.4%
· 606
 
8.8%
' 546
 
7.9%
! 420
 
6.1%
" 195
 
2.8%
: 63
 
0.9%
& 44
 
0.6%
; 14
 
0.2%
Other values (6) 9
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 4727
26.9%
2 4050
23.1%
0 2158
12.3%
3 1243
 
7.1%
9 934
 
5.3%
4 933
 
5.3%
8 930
 
5.3%
5 904
 
5.2%
7 858
 
4.9%
6 806
 
4.6%
Math Symbol
ValueCountFrequency (%)
~ 366
63.7%
> 96
 
16.7%
< 91
 
15.8%
+ 16
 
2.8%
3
 
0.5%
= 2
 
0.3%
1
 
0.2%
Close Punctuation
ValueCountFrequency (%)
] 2914
53.3%
) 2448
44.7%
97
 
1.8%
10
 
0.2%
2
 
< 0.1%
1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
[ 2914
53.3%
( 2447
44.7%
97
 
1.8%
10
 
0.2%
2
 
< 0.1%
1
 
< 0.1%
Other Symbol
ValueCountFrequency (%)
6
31.6%
5
26.3%
4
21.1%
4
21.1%
Modifier Symbol
ValueCountFrequency (%)
^ 40
97.6%
¸ 1
 
2.4%
Final Punctuation
ValueCountFrequency (%)
6
60.0%
4
40.0%
Initial Punctuation
ValueCountFrequency (%)
6
60.0%
4
40.0%
Space Separator
ValueCountFrequency (%)
28433
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 563
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 42
100.0%
Control
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 132060
66.3%
Common 65089
32.7%
Latin 1944
 
1.0%
Han 22
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4796
 
3.6%
4180
 
3.2%
3777
 
2.9%
3457
 
2.6%
2846
 
2.2%
2779
 
2.1%
2446
 
1.9%
2445
 
1.9%
2331
 
1.8%
2243
 
1.7%
Other values (861) 100760
76.3%
Common
ValueCountFrequency (%)
28433
43.7%
1 4727
 
7.3%
2 4050
 
6.2%
] 2914
 
4.5%
[ 2914
 
4.5%
. 2822
 
4.3%
) 2448
 
3.8%
( 2447
 
3.8%
0 2158
 
3.3%
, 1331
 
2.0%
Other values (49) 10845
 
16.7%
Latin
ValueCountFrequency (%)
t 302
 
15.5%
e 185
 
9.5%
s 163
 
8.4%
o 119
 
6.1%
T 84
 
4.3%
B 78
 
4.0%
A 60
 
3.1%
i 55
 
2.8%
P 54
 
2.8%
I 47
 
2.4%
Other values (40) 797
41.0%
Han
ValueCountFrequency (%)
9
40.9%
3
 
13.6%
2
 
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 132010
66.3%
ASCII 66160
33.2%
None 827
 
0.4%
Compat Jamo 50
 
< 0.1%
Punctuation 23
 
< 0.1%
CJK 22
 
< 0.1%
Geometric Shapes 10
 
< 0.1%
Misc Symbols 9
 
< 0.1%
Arrows 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
28433
43.0%
1 4727
 
7.1%
2 4050
 
6.1%
] 2914
 
4.4%
[ 2914
 
4.4%
. 2822
 
4.3%
) 2448
 
3.7%
( 2447
 
3.7%
0 2158
 
3.3%
, 1331
 
2.0%
Other values (77) 11916
18.0%
Hangul
ValueCountFrequency (%)
4796
 
3.6%
4180
 
3.2%
3777
 
2.9%
3457
 
2.6%
2846
 
2.2%
2779
 
2.1%
2446
 
1.9%
2445
 
1.9%
2331
 
1.8%
2243
 
1.7%
Other values (849) 100710
76.3%
None
ValueCountFrequency (%)
· 606
73.3%
97
 
11.7%
97
 
11.7%
10
 
1.2%
10
 
1.2%
2
 
0.2%
2
 
0.2%
¸ 1
 
0.1%
1
 
0.1%
1
 
0.1%
Compat Jamo
ValueCountFrequency (%)
24
48.0%
8
 
16.0%
5
 
10.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
1
 
2.0%
1
 
2.0%
1
 
2.0%
Other values (2) 2
 
4.0%
CJK
ValueCountFrequency (%)
9
40.9%
3
 
13.6%
2
 
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Geometric Shapes
ValueCountFrequency (%)
6
60.0%
4
40.0%
Punctuation
ValueCountFrequency (%)
6
26.1%
6
26.1%
4
17.4%
4
17.4%
2
 
8.7%
1
 
4.3%
Misc Symbols
ValueCountFrequency (%)
5
55.6%
4
44.4%
Arrows
ValueCountFrequency (%)
3
75.0%
1
 
25.0%

조회수
Real number (ℝ)

ZEROS 

Distinct2390
Distinct (%)23.9%
Missing9
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean814.43869
Minimum0
Maximum24937
Zeros336
Zeros (%)3.4%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T23:22:43.597755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q161
median657
Q31270
95-th percentile2170
Maximum24937
Range24937
Interquartile range (IQR)1209

Descriptive statistics

Standard deviation858.27143
Coefficient of variation (CV)1.0538196
Kurtosis87.382168
Mean814.43869
Median Absolute Deviation (MAD)604
Skewness4.7609284
Sum8137057
Variance736629.85
MonotonicityNot monotonic
2023-12-12T23:22:43.746024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 336
 
3.4%
1 261
 
2.6%
2 170
 
1.7%
3 160
 
1.6%
5 138
 
1.4%
4 108
 
1.1%
8 100
 
1.0%
6 96
 
1.0%
7 91
 
0.9%
9 84
 
0.8%
Other values (2380) 8447
84.5%
ValueCountFrequency (%)
0 336
3.4%
1 261
2.6%
2 170
1.7%
3 160
1.6%
4 108
 
1.1%
5 138
1.4%
6 96
 
1.0%
7 91
 
0.9%
8 100
 
1.0%
9 84
 
0.8%
ValueCountFrequency (%)
24937 1
< 0.1%
18124 1
< 0.1%
11273 1
< 0.1%
9919 1
< 0.1%
9014 1
< 0.1%
8042 1
< 0.1%
7960 1
< 0.1%
7810 1
< 0.1%
7750 1
< 0.1%
7560 1
< 0.1%
Distinct2
Distinct (%)< 0.1%
Missing10
Missing (%)0.1%
Memory size97.7 KiB
True
8073 
False
1917 
(Missing)
 
10
ValueCountFrequency (%)
True 8073
80.7%
False 1917
 
19.2%
(Missing) 10
 
0.1%
2023-12-12T23:22:43.858631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

공지여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing10
Missing (%)0.1%
Memory size97.7 KiB
False
9959 
True
 
31
(Missing)
 
10
ValueCountFrequency (%)
False 9959
99.6%
True 31
 
0.3%
(Missing) 10
 
0.1%
2023-12-12T23:22:43.940135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct3430
Distinct (%)34.3%
Missing10
Missing (%)0.1%
Memory size156.2 KiB
Minimum2010-01-08 00:00:00
Maximum2023-08-25 00:00:00
2023-12-12T23:22:44.092123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:22:44.244230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2
Distinct (%)< 0.1%
Missing10
Missing (%)0.1%
Memory size97.7 KiB
False
7940 
True
2050 
(Missing)
 
10
ValueCountFrequency (%)
False 7940
79.4%
True 2050
 
20.5%
(Missing) 10
 
0.1%
2023-12-12T23:22:44.347050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2023-12-12T23:22:40.452852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:22:40.277806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:22:40.544048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:22:40.371022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:22:44.414474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
게시판번호조회수공개비공개여부공지여부삭제여부
게시판번호1.0000.0000.1060.0000.069
조회수0.0001.0000.0810.0000.027
공개비공개여부0.1060.0811.0000.0280.258
공지여부0.0000.0000.0281.0000.126
삭제여부0.0690.0270.2580.1261.000
2023-12-12T23:22:44.528086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공개비공개여부삭제여부공지여부
공개비공개여부1.0000.1660.018
삭제여부0.1661.0000.080
공지여부0.0180.0801.000
2023-12-12T23:22:44.643140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
게시판번호조회수공개비공개여부공지여부삭제여부
게시판번호1.0000.1260.1200.0000.077
조회수0.1261.0000.0870.0000.029
공개비공개여부0.1200.0871.0000.0180.166
공지여부0.0000.0000.0181.0000.080
삭제여부0.0770.0290.1660.0801.000

Missing values

2023-12-12T23:22:40.983222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:22:41.122463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T23:22:41.299845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

게시물번호게시판번호제목조회수공개비공개여부공지여부등록날짜삭제여부
3901970913월 1주차 시립도서관 식당 주간식단표348YN2017-03-02Y
9396506221공립 작은도서관 기간제 근로자(도서정리원) 채용 공고3024YN2019-12-04N
2864123132학교변경요청19NN2016-03-02N
2218118642봉사취소신청10NN2015-03-13N
395594411[광주시청] 3월 시민 무료 정보화 교육599YN2016-02-04Y
2121118372봉사취소10NN2015-02-18N
2619117102곤지암도서관 봉사취소 요청563YN2014-12-13N
10426542332빌려온 책이 나의 대출내역에 안떠요2NN2021-07-12N
7475254621test0YN2018-05-24Y
343297481개관시간연장사업 기간제 근로자 채용(시립)586YN2017-05-11Y
게시물번호게시판번호제목조회수공개비공개여부공지여부등록날짜삭제여부
2145116042자원봉사 신청날짜22NN2014-08-20N
92785591문해교육사(3급) 양성과정 교육생 모집 안내1341YN2011-03-25Y
64151814113월 신규개설 재능나눔 프로그램 안내661YN2018-02-06N
1131655563281내가 만든 그림책展123YN2022-04-07N
7174149144[초월]꿈다락토요문화학교 고학년(11.14)1392YN2015-11-16N
4460103182자원봉사 취소1991YN2011-06-20N
2479121972자원봉사취소12NN2015-12-26N
7797262811중앙도서관 식당 주간식단표(6.5~6.10)712YN2018-06-05N
2713122792자원봉사취소(초월)12NN2016-01-30N
83824158113월 오포도서관 문화가있는날 영화상영 안내(3/27)380YN2019-03-04N