Overview

Dataset statistics

Number of variables3
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory322.3 KiB
Average record size in memory33.0 B

Variable types

Numeric1
Text1
DateTime1

Dataset

Description경남 창원시 도서관사업소 홈페이지의 인기 검색어 목록에 대한 데이터 입니다. 항목은 일련번호, 검색어, 검색일을 제공합니다.
Author경상남도 창원시
URLhttps://www.data.go.kr/data/15088775/fileData.do

Alerts

연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 17:03:00.160566
Analysis finished2023-12-12 17:03:00.681855
Duration0.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6449.9976
Minimum1
Maximum12871
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T02:03:00.760024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile635.9
Q13242.75
median6458.5
Q39648.25
95-th percentile12247.1
Maximum12871
Range12870
Interquartile range (IQR)6405.5

Descriptive statistics

Standard deviation3710.9131
Coefficient of variation (CV)0.57533558
Kurtosis-1.1876059
Mean6449.9976
Median Absolute Deviation (MAD)3204
Skewness-0.0067010055
Sum64499976
Variance13770876
MonotonicityNot monotonic
2023-12-13T02:03:00.947537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10863 1
 
< 0.1%
7895 1
 
< 0.1%
2360 1
 
< 0.1%
9346 1
 
< 0.1%
8531 1
 
< 0.1%
12009 1
 
< 0.1%
10906 1
 
< 0.1%
9200 1
 
< 0.1%
8049 1
 
< 0.1%
11718 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
12871 1
< 0.1%
12868 1
< 0.1%
12867 1
< 0.1%
12866 1
< 0.1%
12865 1
< 0.1%
12863 1
< 0.1%
12862 1
< 0.1%
12861 1
< 0.1%
12859 1
< 0.1%
12858 1
< 0.1%
Distinct947
Distinct (%)9.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T02:03:01.353148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length9
Mean length2.5221
Min length1

Characters and Unicode

Total characters25221
Distinct characters493
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique636 ?
Unique (%)6.4%

Sample

1st row구연
2nd row사물함
3rd row구연
4th row
5th row구연
ValueCountFrequency (%)
영화 1033
 
10.3%
북스타트 949
 
9.5%
886
 
8.9%
반납 886
 
8.9%
구연 875
 
8.8%
사물함 791
 
7.9%
꾸러미 299
 
3.0%
인문학 296
 
3.0%
기간제 280
 
2.8%
독서회 238
 
2.4%
Other values (935) 3467
34.7%
2023-12-13T02:03:01.908551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1194
 
4.7%
1076
 
4.3%
1075
 
4.3%
999
 
4.0%
982
 
3.9%
979
 
3.9%
937
 
3.7%
924
 
3.7%
896
 
3.6%
890
 
3.5%
Other values (483) 15269
60.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 24854
98.5%
Lowercase Letter 239
 
0.9%
Decimal Number 90
 
0.4%
Uppercase Letter 34
 
0.1%
Other Punctuation 3
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1194
 
4.8%
1076
 
4.3%
1075
 
4.3%
999
 
4.0%
982
 
4.0%
979
 
3.9%
937
 
3.8%
924
 
3.7%
896
 
3.6%
890
 
3.6%
Other values (432) 14902
60.0%
Lowercase Letter
ValueCountFrequency (%)
d 57
23.8%
v 22
 
9.2%
r 16
 
6.7%
t 14
 
5.9%
h 14
 
5.9%
i 13
 
5.4%
w 12
 
5.0%
k 11
 
4.6%
s 11
 
4.6%
j 9
 
3.8%
Other values (14) 60
25.1%
Uppercase Letter
ValueCountFrequency (%)
D 12
35.3%
V 4
 
11.8%
O 3
 
8.8%
W 2
 
5.9%
M 2
 
5.9%
A 2
 
5.9%
K 1
 
2.9%
B 1
 
2.9%
P 1
 
2.9%
C 1
 
2.9%
Other values (5) 5
14.7%
Decimal Number
ValueCountFrequency (%)
0 22
24.4%
2 19
21.1%
1 13
14.4%
3 9
10.0%
4 6
 
6.7%
5 6
 
6.7%
9 5
 
5.6%
8 5
 
5.6%
6 4
 
4.4%
7 1
 
1.1%
Other Punctuation
ValueCountFrequency (%)
. 3
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 24854
98.5%
Latin 273
 
1.1%
Common 94
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1194
 
4.8%
1076
 
4.3%
1075
 
4.3%
999
 
4.0%
982
 
4.0%
979
 
3.9%
937
 
3.8%
924
 
3.7%
896
 
3.6%
890
 
3.6%
Other values (432) 14902
60.0%
Latin
ValueCountFrequency (%)
d 57
20.9%
v 22
 
8.1%
r 16
 
5.9%
t 14
 
5.1%
h 14
 
5.1%
i 13
 
4.8%
D 12
 
4.4%
w 12
 
4.4%
k 11
 
4.0%
s 11
 
4.0%
Other values (29) 91
33.3%
Common
ValueCountFrequency (%)
0 22
23.4%
2 19
20.2%
1 13
13.8%
3 9
9.6%
4 6
 
6.4%
5 6
 
6.4%
9 5
 
5.3%
8 5
 
5.3%
6 4
 
4.3%
. 3
 
3.2%
Other values (2) 2
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 24842
98.5%
ASCII 367
 
1.5%
Compat Jamo 12
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1194
 
4.8%
1076
 
4.3%
1075
 
4.3%
999
 
4.0%
982
 
4.0%
979
 
3.9%
937
 
3.8%
924
 
3.7%
896
 
3.6%
890
 
3.6%
Other values (422) 14890
59.9%
ASCII
ValueCountFrequency (%)
d 57
 
15.5%
v 22
 
6.0%
0 22
 
6.0%
2 19
 
5.2%
r 16
 
4.4%
t 14
 
3.8%
h 14
 
3.8%
i 13
 
3.5%
1 13
 
3.5%
D 12
 
3.3%
Other values (41) 165
45.0%
Compat Jamo
ValueCountFrequency (%)
2
16.7%
2
16.7%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
Distinct182
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2021-03-18 00:00:00
Maximum2021-09-15 00:00:00
2023-12-13T02:03:02.042835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:03:02.186646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-13T02:03:00.452226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-13T02:03:00.573317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:03:00.648207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번검색어검색일
1086210863구연2021-09-06
73507351사물함2021-07-15
71207121구연2021-07-14
11115111162021-09-06
49344935구연2021-06-11
1215612157영화2021-09-08
1158311584반납2021-09-07
65746575영화2021-07-03
1257712578추석2021-09-13
12291230반납2021-04-11
연번검색어검색일
88648865다락2021-08-09
65466547교과2021-07-02
640464052021-06-29
68156816복사2021-07-09
92329233도서회원증2021-08-14
621962202021-06-25
73927393사물함2021-07-15
72177218영화2021-07-14
77167717초등2021-07-20
784978502021-07-22