Overview

Dataset statistics

Number of variables8
Number of observations1772
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows1
Duplicate rows (%)0.1%
Total size in memory114.3 KiB
Average record size in memory66.1 B

Variable types

Numeric2
Text1
DateTime1
Categorical4

Dataset

Description양주시 간행물관리시스템 내 ebook 테이블(간행물리스트)에 대한 정보 (순번, 제목, 제작일, 분류, 페이지수, 관리기관명, 관리기관 전화번호) * 순번은 연번과 다름(간행물 업로드 시 수정, 삭제에 의한 없는 순번 존재)
URLhttps://www.data.go.kr/data/15063222/fileData.do

Alerts

관리기관명 has constant value ""Constant
데이터기준일자 has constant value ""Constant
Dataset has 1 (0.1%) duplicate rowsDuplicates
순번 is highly overall correlated with 관리기관 전화번호High correlation
관리기관 전화번호 is highly overall correlated with 순번High correlation
관리기관 전화번호 is highly imbalanced (57.8%)Imbalance

Reproduction

Analysis started2023-12-12 06:44:22.099368
Analysis finished2023-12-12 06:44:23.340109
Duration1.24 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION 

Distinct1771
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1743.5824
Minimum16
Maximum3094
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size15.7 KiB
2023-12-12T15:44:23.434185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum16
5-th percentile119.55
Q1977.5
median1852.5
Q32609.75
95-th percentile3006.45
Maximum3094
Range3078
Interquartile range (IQR)1632.25

Descriptive statistics

Standard deviation952.76247
Coefficient of variation (CV)0.5464396
Kurtosis-1.114178
Mean1743.5824
Median Absolute Deviation (MAD)789
Skewness-0.3602071
Sum3089628
Variance907756.33
MonotonicityIncreasing
2023-12-12T15:44:23.893407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3022 2
 
0.1%
16 1
 
0.1%
2416 1
 
0.1%
2433 1
 
0.1%
2432 1
 
0.1%
2431 1
 
0.1%
2425 1
 
0.1%
2424 1
 
0.1%
2423 1
 
0.1%
2422 1
 
0.1%
Other values (1761) 1761
99.4%
ValueCountFrequency (%)
16 1
0.1%
17 1
0.1%
18 1
0.1%
19 1
0.1%
20 1
0.1%
21 1
0.1%
22 1
0.1%
23 1
0.1%
24 1
0.1%
25 1
0.1%
ValueCountFrequency (%)
3094 1
0.1%
3093 1
0.1%
3092 1
0.1%
3091 1
0.1%
3090 1
0.1%
3089 1
0.1%
3088 1
0.1%
3087 1
0.1%
3086 1
0.1%
3085 1
0.1%

제목
Text

Distinct1675
Distinct (%)94.5%
Missing0
Missing (%)0.0%
Memory size14.0 KiB
2023-12-12T15:44:24.151631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length31
Mean length10.480248
Min length2

Characters and Unicode

Total characters18571
Distinct characters286
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1605 ?
Unique (%)90.6%

Sample

1st rowgreen1
2nd rowgreen2
3rd rowgreen3
4th rowgreen4
5th rowgreen5
ValueCountFrequency (%)
640
 
18.6%
2018 75
 
2.2%
2017 68
 
2.0%
2016 67
 
1.9%
2015 63
 
1.8%
2020 61
 
1.8%
2013 60
 
1.7%
2019 57
 
1.7%
2014 56
 
1.6%
2021 54
 
1.6%
Other values (1207) 2244
65.1%
2023-12-12T15:44:24.581405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 1834
 
9.9%
0 1681
 
9.1%
1673
 
9.0%
1 1456
 
7.8%
1066
 
5.7%
- 1058
 
5.7%
3 469
 
2.5%
4 449
 
2.4%
418
 
2.3%
E 412
 
2.2%
Other values (276) 8055
43.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7647
41.2%
Other Letter 5373
28.9%
Space Separator 1673
 
9.0%
Dash Punctuation 1058
 
5.7%
Uppercase Letter 1044
 
5.6%
Lowercase Letter 633
 
3.4%
Open Punctuation 361
 
1.9%
Close Punctuation 358
 
1.9%
Connector Punctuation 317
 
1.7%
Other Punctuation 105
 
0.6%
Other values (2) 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1066
19.8%
418
 
7.8%
394
 
7.3%
372
 
6.9%
352
 
6.6%
177
 
3.3%
129
 
2.4%
121
 
2.3%
102
 
1.9%
83
 
1.5%
Other values (234) 2159
40.2%
Lowercase Letter
ValueCountFrequency (%)
e 250
39.5%
r 125
19.7%
n 125
19.7%
g 119
18.8%
d 4
 
0.6%
a 2
 
0.3%
p 2
 
0.3%
f 2
 
0.3%
o 1
 
0.2%
l 1
 
0.2%
Other values (2) 2
 
0.3%
Decimal Number
ValueCountFrequency (%)
2 1834
24.0%
0 1681
22.0%
1 1456
19.0%
3 469
 
6.1%
4 449
 
5.9%
9 405
 
5.3%
5 354
 
4.6%
7 349
 
4.6%
8 348
 
4.6%
6 302
 
3.9%
Uppercase Letter
ValueCountFrequency (%)
E 412
39.5%
G 213
20.4%
N 207
19.8%
R 205
19.6%
C 3
 
0.3%
P 2
 
0.2%
T 1
 
0.1%
V 1
 
0.1%
Other Punctuation
ValueCountFrequency (%)
. 102
97.1%
# 2
 
1.9%
· 1
 
1.0%
Open Punctuation
ValueCountFrequency (%)
( 359
99.4%
[ 2
 
0.6%
Close Punctuation
ValueCountFrequency (%)
) 356
99.4%
] 2
 
0.6%
Space Separator
ValueCountFrequency (%)
1673
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1058
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 317
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 11521
62.0%
Hangul 5373
28.9%
Latin 1677
 
9.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1066
19.8%
418
 
7.8%
394
 
7.3%
372
 
6.9%
352
 
6.6%
177
 
3.3%
129
 
2.4%
121
 
2.3%
102
 
1.9%
83
 
1.5%
Other values (234) 2159
40.2%
Common
ValueCountFrequency (%)
2 1834
15.9%
0 1681
14.6%
1673
14.5%
1 1456
12.6%
- 1058
9.2%
3 469
 
4.1%
4 449
 
3.9%
9 405
 
3.5%
( 359
 
3.1%
) 356
 
3.1%
Other values (12) 1781
15.5%
Latin
ValueCountFrequency (%)
E 412
24.6%
e 250
14.9%
G 213
12.7%
N 207
12.3%
R 205
12.2%
r 125
 
7.5%
n 125
 
7.5%
g 119
 
7.1%
d 4
 
0.2%
C 3
 
0.2%
Other values (10) 14
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 13196
71.1%
Hangul 5373
28.9%
Misc Symbols 1
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 1834
13.9%
0 1681
12.7%
1673
12.7%
1 1456
11.0%
- 1058
 
8.0%
3 469
 
3.6%
4 449
 
3.4%
E 412
 
3.1%
9 405
 
3.1%
( 359
 
2.7%
Other values (30) 3400
25.8%
Hangul
ValueCountFrequency (%)
1066
19.8%
418
 
7.8%
394
 
7.3%
372
 
6.9%
352
 
6.6%
177
 
3.3%
129
 
2.4%
121
 
2.3%
102
 
1.9%
83
 
1.5%
Other values (234) 2159
40.2%
Misc Symbols
ValueCountFrequency (%)
1
100.0%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct1144
Distinct (%)64.6%
Missing0
Missing (%)0.0%
Memory size14.0 KiB
Minimum1994-01-01 00:00:00
Maximum2022-07-01 00:00:00
2023-12-12T15:44:24.757098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:44:24.901689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

분류
Categorical

Distinct21
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size14.0 KiB
양주시보
913 
함께그린양주
330 
반상회보
 
82
양주시 건설자재 모음집
 
74
예산서
 
70
Other values (16)
303 

Length

Max length12
Median length4
Mean length4.8922122
Min length2

Unique

Unique4 ?
Unique (%)0.2%

Sample

1st row함께그린양주
2nd row함께그린양주
3rd row함께그린양주
4th row함께그린양주
5th row함께그린양주

Common Values

ValueCountFrequency (%)
양주시보 913
51.5%
함께그린양주 330
 
18.6%
반상회보 82
 
4.6%
양주시 건설자재 모음집 74
 
4.2%
예산서 70
 
4.0%
의정소식지 68
 
3.8%
업무보고/연설문 63
 
3.6%
기타 53
 
3.0%
통계연보 49
 
2.8%
지구단위계획 29
 
1.6%
Other values (11) 41
 
2.3%

Length

2023-12-12T15:44:25.042495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
양주시보 913
46.7%
함께그린양주 330
 
16.9%
양주시 86
 
4.4%
반상회보 82
 
4.2%
건설자재 74
 
3.8%
모음집 74
 
3.8%
예산서 70
 
3.6%
의정소식지 68
 
3.5%
업무보고/연설문 63
 
3.2%
기타 53
 
2.7%
Other values (18) 143
 
7.3%

페이지수
Real number (ℝ)

Distinct261
Distinct (%)14.7%
Missing1
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean60.508187
Minimum1
Maximum965
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size15.7 KiB
2023-12-12T15:44:25.179419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4
Q112
median26
Q359
95-th percentile256
Maximum965
Range964
Interquartile range (IQR)47

Descriptive statistics

Standard deviation106.55941
Coefficient of variation (CV)1.7610743
Kurtosis19.425635
Mean60.508187
Median Absolute Deviation (MAD)16
Skewness4.070017
Sum107160
Variance11354.909
MonotonicityNot monotonic
2023-12-12T15:44:25.337138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
12 222
 
12.5%
16 140
 
7.9%
4 54
 
3.0%
8 48
 
2.7%
20 35
 
2.0%
3 31
 
1.7%
2 29
 
1.6%
18 26
 
1.5%
17 24
 
1.4%
21 23
 
1.3%
Other values (251) 1139
64.3%
ValueCountFrequency (%)
1 23
1.3%
2 29
1.6%
3 31
1.7%
4 54
3.0%
5 17
 
1.0%
6 17
 
1.0%
7 15
 
0.8%
8 48
2.7%
9 17
 
1.0%
10 20
 
1.1%
ValueCountFrequency (%)
965 1
0.1%
938 1
0.1%
797 1
0.1%
766 1
0.1%
710 1
0.1%
671 2
0.1%
670 1
0.1%
669 1
0.1%
668 1
0.1%
663 2
0.1%

관리기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size14.0 KiB
양주시 자치행정과
1772 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row양주시 자치행정과
2nd row양주시 자치행정과
3rd row양주시 자치행정과
4th row양주시 자치행정과
5th row양주시 자치행정과

Common Values

ValueCountFrequency (%)
양주시 자치행정과 1772
100.0%

Length

2023-12-12T15:44:25.493064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:44:25.626269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
양주시 1772
50.0%
자치행정과 1772
50.0%

관리기관 전화번호
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size14.0 KiB
031-8082-5253
1620 
031-8082-5252
 
152

Length

Max length13
Median length13
Mean length13
Min length13

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row031-8082-5253
2nd row031-8082-5253
3rd row031-8082-5253
4th row031-8082-5253
5th row031-8082-5253

Common Values

ValueCountFrequency (%)
031-8082-5253 1620
91.4%
031-8082-5252 152
 
8.6%

Length

2023-12-12T15:44:25.768326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:44:25.936769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
031-8082-5253 1620
91.4%
031-8082-5252 152
 
8.6%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size14.0 KiB
2023-08-07
1772 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-08-07
2nd row2023-08-07
3rd row2023-08-07
4th row2023-08-07
5th row2023-08-07

Common Values

ValueCountFrequency (%)
2023-08-07 1772
100.0%

Length

2023-12-12T15:44:26.152116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:44:26.322810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-08-07 1772
100.0%

Interactions

2023-12-12T15:44:22.767550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:44:22.513413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:44:22.907904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:44:22.649681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:44:26.416612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번분류페이지수관리기관 전화번호
순번1.0000.7630.4070.851
분류0.7631.0000.7930.166
페이지수0.4070.7931.0000.098
관리기관 전화번호0.8510.1660.0981.000
2023-12-12T15:44:26.540222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분류관리기관 전화번호
분류1.0000.145
관리기관 전화번호0.1451.000
2023-12-12T15:44:26.630420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번페이지수분류관리기관 전화번호
순번1.0000.2070.4050.682
페이지수0.2071.0000.4390.075
분류0.4050.4391.0000.145
관리기관 전화번호0.6820.0750.1451.000

Missing values

2023-12-12T15:44:23.139754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:44:23.272833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번제목제작일분류페이지수관리기관명관리기관 전화번호데이터기준일자
016green12005-05-04함께그린양주16양주시 자치행정과031-8082-52532023-08-07
117green22005-05-18함께그린양주8양주시 자치행정과031-8082-52532023-08-07
218green32005-06-01함께그린양주12양주시 자치행정과031-8082-52532023-08-07
319green42005-06-15함께그린양주12양주시 자치행정과031-8082-52532023-08-07
420green52005-06-29함께그린양주12양주시 자치행정과031-8082-52532023-08-07
521green62005-07-13함께그린양주8양주시 자치행정과031-8082-52532023-08-07
622green72005-07-27함께그린양주8양주시 자치행정과031-8082-52532023-08-07
723green82005-08-10함께그린양주12양주시 자치행정과031-8082-52532023-08-07
824green92005-08-24함께그린양주12양주시 자치행정과031-8082-52532023-08-07
925green102005-09-07함께그린양주12양주시 자치행정과031-8082-52532023-08-07
순번제목제작일분류페이지수관리기관명관리기관 전화번호데이터기준일자
176230852022 - 20호2022-05-17양주시보51양주시 자치행정과031-8082-52522023-08-07
176330862022 - 21호2022-05-24양주시보59양주시 자치행정과031-8082-52522023-08-07
17643087GREEN_3282022-05-31함께그린양주16양주시 자치행정과031-8082-52522023-08-07
176530882022 - 22호2022-06-08양주시보344양주시 자치행정과031-8082-52522023-08-07
176630892022 - 23호2022-06-08양주시보74양주시 자치행정과031-8082-52522023-08-07
176730902022 - 24호2022-06-13양주시보59양주시 자치행정과031-8082-52522023-08-07
176830912022 - 25호2022-06-22양주시보113양주시 자치행정과031-8082-52522023-08-07
17693092의정광장65호2022-06-28의정소식지32양주시 자치행정과031-8082-52522023-08-07
177030932022 - 26호2022-06-28양주시보30양주시 자치행정과031-8082-52522023-08-07
17713094GREEN_3292022-07-01함께그린양주16양주시 자치행정과031-8082-52522023-08-07

Duplicate rows

Most frequently occurring

순번제목제작일분류페이지수관리기관명관리기관 전화번호데이터기준일자# duplicates
03022GREEN_3182021-07-28함께그린양주16양주시 자치행정과031-8082-52522023-08-072