Overview

Dataset statistics

Number of variables4
Number of observations1850
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory61.6 KiB
Average record size in memory34.1 B

Variable types

Numeric2
Text1
Categorical1

Dataset

Description군포시 도서관홈페이지의 최근 3개월 인기검색어에 대한 데이터로 검색순위, 검색어, 검색건수, 검색일자 데이터기준일자 항목을 제공합니다.(검색횟수 20건 이상 제공)
Author경기도 군포시
URLhttps://www.data.go.kr/data/15123369/fileData.do

Alerts

기준일자 has constant value ""Constant
검색순위 is highly overall correlated with 검색횟수High correlation
검색횟수 is highly overall correlated with 검색순위High correlation
검색횟수 is highly skewed (γ1 = 31.07890412)Skewed
검색순위 has unique valuesUnique
인기검색어 has unique valuesUnique

Reproduction

Analysis started2024-03-15 01:55:43.791412
Analysis finished2024-03-15 01:55:46.043424
Duration2.25 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

검색순위
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1850
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean925.5
Minimum1
Maximum1850
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size16.4 KiB
2024-03-15T10:55:46.302736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile93.45
Q1463.25
median925.5
Q31387.75
95-th percentile1757.55
Maximum1850
Range1849
Interquartile range (IQR)924.5

Descriptive statistics

Standard deviation534.19332
Coefficient of variation (CV)0.57719429
Kurtosis-1.2
Mean925.5
Median Absolute Deviation (MAD)462.5
Skewness0
Sum1712175
Variance285362.5
MonotonicityStrictly increasing
2024-03-15T10:55:46.688889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
1244 1
 
0.1%
1242 1
 
0.1%
1241 1
 
0.1%
1240 1
 
0.1%
1239 1
 
0.1%
1238 1
 
0.1%
1237 1
 
0.1%
1236 1
 
0.1%
1235 1
 
0.1%
Other values (1840) 1840
99.5%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1850 1
0.1%
1849 1
0.1%
1848 1
0.1%
1847 1
0.1%
1846 1
0.1%
1845 1
0.1%
1844 1
0.1%
1843 1
0.1%
1842 1
0.1%
1841 1
0.1%

인기검색어
Text

UNIQUE 

Distinct1850
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size14.6 KiB
2024-03-15T10:55:48.146913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length17
Mean length5.2762162
Min length1

Characters and Unicode

Total characters9761
Distinct characters792
Distinct categories11 ?
Distinct scripts6 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1850 ?
Unique (%)100.0%

Sample

1st row흔한남매
2nd row삼국지
3rd row쇼펜하우어
4th row바다가 들리는 편의점
5th row역행자
ValueCountFrequency (%)
삼국지 13
 
0.5%
한국사 12
 
0.5%
설민석 8
 
0.3%
읽는 8
 
0.3%
1 7
 
0.3%
신화 7
 
0.3%
장편소설 6
 
0.2%
로마 6
 
0.2%
그리스로마신화 5
 
0.2%
만화로 5
 
0.2%
Other values (2245) 2586
97.1%
2024-03-15T10:55:50.009671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
817
 
8.4%
249
 
2.6%
154
 
1.6%
137
 
1.4%
133
 
1.4%
123
 
1.3%
107
 
1.1%
e 101
 
1.0%
94
 
1.0%
86
 
0.9%
Other values (782) 7760
79.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7647
78.3%
Lowercase Letter 827
 
8.5%
Space Separator 817
 
8.4%
Decimal Number 189
 
1.9%
Uppercase Letter 121
 
1.2%
Other Punctuation 107
 
1.1%
Open Punctuation 25
 
0.3%
Close Punctuation 24
 
0.2%
Math Symbol 2
 
< 0.1%
Initial Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
249
 
3.3%
154
 
2.0%
137
 
1.8%
133
 
1.7%
123
 
1.6%
107
 
1.4%
94
 
1.2%
86
 
1.1%
84
 
1.1%
80
 
1.0%
Other values (704) 6400
83.7%
Lowercase Letter
ValueCountFrequency (%)
e 101
 
12.2%
o 69
 
8.3%
a 68
 
8.2%
r 59
 
7.1%
t 57
 
6.9%
s 56
 
6.8%
i 50
 
6.0%
n 41
 
5.0%
d 34
 
4.1%
l 30
 
3.6%
Other values (16) 262
31.7%
Uppercase Letter
ValueCountFrequency (%)
E 10
 
8.3%
S 10
 
8.3%
T 9
 
7.4%
R 8
 
6.6%
D 8
 
6.6%
H 7
 
5.8%
G 7
 
5.8%
A 7
 
5.8%
O 6
 
5.0%
W 6
 
5.0%
Other values (16) 43
35.5%
Decimal Number
ValueCountFrequency (%)
1 66
34.9%
2 30
15.9%
4 24
 
12.7%
0 16
 
8.5%
3 13
 
6.9%
5 12
 
6.3%
8 12
 
6.3%
6 7
 
3.7%
7 5
 
2.6%
9 4
 
2.1%
Other Punctuation
ValueCountFrequency (%)
. 63
58.9%
, 13
 
12.1%
: 13
 
12.1%
! 7
 
6.5%
? 5
 
4.7%
· 2
 
1.9%
' 1
 
0.9%
; 1
 
0.9%
% 1
 
0.9%
\ 1
 
0.9%
Space Separator
ValueCountFrequency (%)
817
100.0%
Open Punctuation
ValueCountFrequency (%)
( 25
100.0%
Close Punctuation
ValueCountFrequency (%)
) 24
100.0%
Math Symbol
ValueCountFrequency (%)
+ 2
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7635
78.2%
Common 1166
 
11.9%
Latin 948
 
9.7%
Han 5
 
0.1%
Katakana 4
 
< 0.1%
Hiragana 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
249
 
3.3%
154
 
2.0%
137
 
1.8%
133
 
1.7%
123
 
1.6%
107
 
1.4%
94
 
1.2%
86
 
1.1%
84
 
1.1%
80
 
1.0%
Other values (692) 6388
83.7%
Latin
ValueCountFrequency (%)
e 101
 
10.7%
o 69
 
7.3%
a 68
 
7.2%
r 59
 
6.2%
t 57
 
6.0%
s 56
 
5.9%
i 50
 
5.3%
n 41
 
4.3%
d 34
 
3.6%
l 30
 
3.2%
Other values (42) 383
40.4%
Common
ValueCountFrequency (%)
817
70.1%
1 66
 
5.7%
. 63
 
5.4%
2 30
 
2.6%
( 25
 
2.1%
4 24
 
2.1%
) 24
 
2.1%
0 16
 
1.4%
3 13
 
1.1%
, 13
 
1.1%
Other values (16) 75
 
6.4%
Han
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Katakana
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Hiragana
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7629
78.2%
ASCII 2111
 
21.6%
Compat Jamo 6
 
0.1%
CJK 5
 
0.1%
Katakana 4
 
< 0.1%
Hiragana 3
 
< 0.1%
None 2
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
817
38.7%
e 101
 
4.8%
o 69
 
3.3%
a 68
 
3.2%
1 66
 
3.1%
. 63
 
3.0%
r 59
 
2.8%
t 57
 
2.7%
s 56
 
2.7%
i 50
 
2.4%
Other values (66) 705
33.4%
Hangul
ValueCountFrequency (%)
249
 
3.3%
154
 
2.0%
137
 
1.8%
133
 
1.7%
123
 
1.6%
107
 
1.4%
94
 
1.2%
86
 
1.1%
84
 
1.1%
80
 
1.0%
Other values (687) 6382
83.7%
Compat Jamo
ValueCountFrequency (%)
2
33.3%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
None
ValueCountFrequency (%)
· 2
100.0%
CJK
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Punctuation
ValueCountFrequency (%)
1
100.0%
Katakana
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Hiragana
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

검색횟수
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct172
Distinct (%)9.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean51.765405
Minimum20
Maximum8006
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size16.4 KiB
2024-03-15T10:55:50.442212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20
5-th percentile20
Q124
median26
Q341
95-th percentile120.55
Maximum8006
Range7986
Interquartile range (IQR)17

Descriptive statistics

Standard deviation208.8754
Coefficient of variation (CV)4.0350384
Kurtosis1146.396
Mean51.765405
Median Absolute Deviation (MAD)4
Skewness31.078904
Sum95766
Variance43628.932
MonotonicityDecreasing
2024-03-15T10:55:50.909832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
25 401
21.7%
26 129
 
7.0%
20 121
 
6.5%
21 100
 
5.4%
27 94
 
5.1%
22 92
 
5.0%
23 86
 
4.6%
24 69
 
3.7%
28 67
 
3.6%
29 39
 
2.1%
Other values (162) 652
35.2%
ValueCountFrequency (%)
20 121
 
6.5%
21 100
 
5.4%
22 92
 
5.0%
23 86
 
4.6%
24 69
 
3.7%
25 401
21.7%
26 129
 
7.0%
27 94
 
5.1%
28 67
 
3.6%
29 39
 
2.1%
ValueCountFrequency (%)
8006 1
0.1%
1767 1
0.1%
1652 1
0.1%
1635 1
0.1%
1583 1
0.1%
1006 1
0.1%
858 1
0.1%
786 1
0.1%
728 1
0.1%
715 1
0.1%

기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size14.6 KiB
2024-01-18
1850 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-01-18
2nd row2024-01-18
3rd row2024-01-18
4th row2024-01-18
5th row2024-01-18

Common Values

ValueCountFrequency (%)
2024-01-18 1850
100.0%

Length

2024-03-15T10:55:51.350274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T10:55:51.671490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-01-18 1850
100.0%

Interactions

2024-03-15T10:55:44.870339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T10:55:44.402164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T10:55:45.098953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T10:55:44.683665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T10:55:51.849376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
검색순위검색횟수
검색순위1.0000.135
검색횟수0.1351.000
2024-03-15T10:55:52.089893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
검색순위검색횟수
검색순위1.000-0.994
검색횟수-0.9941.000

Missing values

2024-03-15T10:55:45.537528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T10:55:45.781037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

검색순위인기검색어검색횟수기준일자
01흔한남매80062024-01-18
12삼국지17672024-01-18
23쇼펜하우어16522024-01-18
34바다가 들리는 편의점16352024-01-18
45역행자15832024-01-18
56마흔에 읽는 쇼펜하우어10062024-01-18
67해리포터8582024-01-18
78불편한 편의점7862024-01-18
89마인크래프트7282024-01-18
910도둑맞은 집중력7152024-01-18
검색순위인기검색어검색횟수기준일자
18401841고고학202024-01-18
18411842심슨202024-01-18
18421843더 마인드202024-01-18
18431844Ai202024-01-18
18441845권귀헌202024-01-18
18451846엉덩이탐정202024-01-18
18461847개미202024-01-18
18471848수면202024-01-18
18481849포카혼타스202024-01-18
18491850수학동화202024-01-18