Overview

Dataset statistics

Number of variables16
Number of observations8745
Missing cells1531
Missing cells (%)1.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.1 MiB
Average record size in memory130.0 B

Variable types

Numeric2
Categorical6
Text8

Dataset

Description전라남도 및 유관기관, 공공기관 등에서 발행하는 연차보고서, 통계 연보, 간행물 등 보유하고 있는 행정자료 정보 목록이다
Author전라남도
URLhttps://www.data.go.kr/data/15105775/fileData.do

Alerts

관리구분 has constant value ""Constant
이용제한구분 has constant value ""Constant
자료실명 has constant value ""Constant
구분 has constant value ""Constant
배가상태 is highly imbalanced (98.5%)Imbalance
원-복본구분 is highly imbalanced (89.3%)Imbalance
발행지 has 538 (6.2%) missing valuesMissing
가격 has 928 (10.6%) missing valuesMissing
가격 is highly skewed (γ1 = 43.1886101)Skewed
연번 has unique valuesUnique
가격 has 7755 (88.7%) zerosZeros

Reproduction

Analysis started2023-12-12 06:51:35.939185
Analysis finished2023-12-12 06:51:38.698289
Duration2.76 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct8745
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4373
Minimum1
Maximum8745
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size77.0 KiB
2023-12-12T15:51:38.781322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile438.2
Q12187
median4373
Q36559
95-th percentile8307.8
Maximum8745
Range8744
Interquartile range (IQR)4372

Descriptive statistics

Standard deviation2524.6084
Coefficient of variation (CV)0.57731726
Kurtosis-1.2
Mean4373
Median Absolute Deviation (MAD)2186
Skewness0
Sum38241885
Variance6373647.5
MonotonicityStrictly increasing
2023-12-12T15:51:38.923556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
5833 1
 
< 0.1%
5827 1
 
< 0.1%
5828 1
 
< 0.1%
5829 1
 
< 0.1%
5830 1
 
< 0.1%
5831 1
 
< 0.1%
5832 1
 
< 0.1%
5834 1
 
< 0.1%
5876 1
 
< 0.1%
Other values (8735) 8735
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
8745 1
< 0.1%
8744 1
< 0.1%
8743 1
< 0.1%
8742 1
< 0.1%
8741 1
< 0.1%
8740 1
< 0.1%
8739 1
< 0.1%
8738 1
< 0.1%
8737 1
< 0.1%
8736 1
< 0.1%

관리구분
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size68.4 KiB
강항지식정보센터
8745 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강항지식정보센터
2nd row강항지식정보센터
3rd row강항지식정보센터
4th row강항지식정보센터
5th row강항지식정보센터

Common Values

ValueCountFrequency (%)
강항지식정보센터 8745
100.0%

Length

2023-12-12T15:51:39.048939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:51:39.142354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
강항지식정보센터 8745
100.0%

배가상태
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size68.4 KiB
비치자료
8725 
특별대출자료
 
18
관외대출자료
 
2

Length

Max length6
Median length4
Mean length4.004574
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row비치자료
2nd row비치자료
3rd row비치자료
4th row비치자료
5th row비치자료

Common Values

ValueCountFrequency (%)
비치자료 8725
99.8%
특별대출자료 18
 
0.2%
관외대출자료 2
 
< 0.1%

Length

2023-12-12T15:51:39.252252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:51:39.359689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
비치자료 8725
99.8%
특별대출자료 18
 
0.2%
관외대출자료 2
 
< 0.1%

이용제한구분
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size68.4 KiB
일반
8745 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반
2nd row일반
3rd row일반
4th row일반
5th row일반

Common Values

ValueCountFrequency (%)
일반 8745
100.0%

Length

2023-12-12T15:51:39.463077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:51:39.558364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 8745
100.0%
Distinct8740
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size68.4 KiB
2023-12-12T15:51:39.746050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters104940
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8735 ?
Unique (%)99.9%

Sample

1st rowSB0000000000
2nd rowSB0000000001
3rd rowSB0000000002
4th rowSB0000000003
5th rowSB0000000004
ValueCountFrequency (%)
sb0000004233 2
 
< 0.1%
sb0000003213 2
 
< 0.1%
sb0000004213 2
 
< 0.1%
sb0000008626 2
 
< 0.1%
sb0000004627 2
 
< 0.1%
sb0000005832 1
 
< 0.1%
sb0000005841 1
 
< 0.1%
sb0000005825 1
 
< 0.1%
sb0000005837 1
 
< 0.1%
sb0000005836 1
 
< 0.1%
Other values (8730) 8730
99.8%
2023-12-12T15:51:40.052146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 56121
53.5%
S 8745
 
8.3%
B 8745
 
8.3%
2 3658
 
3.5%
1 3657
 
3.5%
3 3656
 
3.5%
4 3649
 
3.5%
6 3647
 
3.5%
5 3644
 
3.5%
7 3587
 
3.4%
Other values (2) 5831
 
5.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 87450
83.3%
Uppercase Letter 17490
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 56121
64.2%
2 3658
 
4.2%
1 3657
 
4.2%
3 3656
 
4.2%
4 3649
 
4.2%
6 3647
 
4.2%
5 3644
 
4.2%
7 3587
 
4.1%
8 3287
 
3.8%
9 2544
 
2.9%
Uppercase Letter
ValueCountFrequency (%)
S 8745
50.0%
B 8745
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 87450
83.3%
Latin 17490
 
16.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 56121
64.2%
2 3658
 
4.2%
1 3657
 
4.2%
3 3656
 
4.2%
4 3649
 
4.2%
6 3647
 
4.2%
5 3644
 
4.2%
7 3587
 
4.1%
8 3287
 
3.8%
9 2544
 
2.9%
Latin
ValueCountFrequency (%)
S 8745
50.0%
B 8745
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 104940
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 56121
53.5%
S 8745
 
8.3%
B 8745
 
8.3%
2 3658
 
3.5%
1 3657
 
3.5%
3 3656
 
3.5%
4 3649
 
3.5%
6 3647
 
3.5%
5 3644
 
3.5%
7 3587
 
3.4%
Other values (2) 5831
 
5.6%
Distinct3592
Distinct (%)41.1%
Missing0
Missing (%)0.0%
Memory size68.4 KiB
2023-12-12T15:51:40.282135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length23
Mean length10.440709
Min length3

Characters and Unicode

Total characters91304
Distinct characters188
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2786 ?
Unique (%)31.9%

Sample

1st row3421-국12ㄱ-제11집(2018)
2nd row32599-전292ㅍ
3rd row32599-전292ㄴ
4th row32599-전292ㅇ
5th row32599-전292ㅎ
ValueCountFrequency (%)
3190911-통14ㅇ 171
 
1.9%
520-통14ㄴ 60
 
0.7%
350-국95 60
 
0.7%
311195-전292ㅈ 57
 
0.6%
3190911-경74ㅇ 56
 
0.6%
310-충83ㅊ 55
 
0.6%
310-경52ㄱ 55
 
0.6%
311196-나76ㄴ 53
 
0.6%
31059-전292ㅈ 52
 
0.6%
320-충83ㅊ 52
 
0.6%
Other values (3600) 8167
92.4%
2023-12-12T15:51:40.675365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 12071
13.2%
3 11364
12.4%
- 10266
11.2%
0 7557
 
8.3%
5 7327
 
8.0%
9 6984
 
7.6%
2 5386
 
5.9%
6 3637
 
4.0%
7 3568
 
3.9%
4 3319
 
3.6%
Other values (178) 19825
21.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 63475
69.5%
Other Letter 16849
 
18.5%
Dash Punctuation 10266
 
11.2%
Uppercase Letter 368
 
0.4%
Math Symbol 125
 
0.1%
Space Separator 93
 
0.1%
Lowercase Letter 67
 
0.1%
Other Punctuation 52
 
0.1%
Close Punctuation 4
 
< 0.1%
Open Punctuation 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1604
 
9.5%
1525
 
9.1%
1167
 
6.9%
1085
 
6.4%
962
 
5.7%
790
 
4.7%
739
 
4.4%
571
 
3.4%
498
 
3.0%
495
 
2.9%
Other values (137) 7413
44.0%
Uppercase Letter
ValueCountFrequency (%)
V 287
78.0%
O 15
 
4.1%
F 13
 
3.5%
K 12
 
3.3%
G 8
 
2.2%
E 6
 
1.6%
A 5
 
1.4%
I 5
 
1.4%
D 4
 
1.1%
N 3
 
0.8%
Other values (7) 10
 
2.7%
Decimal Number
ValueCountFrequency (%)
1 12071
19.0%
3 11364
17.9%
0 7557
11.9%
5 7327
11.5%
9 6984
11.0%
2 5386
8.5%
6 3637
 
5.7%
7 3568
 
5.6%
4 3319
 
5.2%
8 2262
 
3.6%
Lowercase Letter
ValueCountFrequency (%)
v 63
94.0%
s 1
 
1.5%
k 1
 
1.5%
a 1
 
1.5%
f 1
 
1.5%
Math Symbol
ValueCountFrequency (%)
= 124
99.2%
~ 1
 
0.8%
Other Punctuation
ValueCountFrequency (%)
, 51
98.1%
/ 1
 
1.9%
Dash Punctuation
ValueCountFrequency (%)
- 10266
100.0%
Space Separator
ValueCountFrequency (%)
93
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 74020
81.1%
Hangul 16849
 
18.5%
Latin 435
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1604
 
9.5%
1525
 
9.1%
1167
 
6.9%
1085
 
6.4%
962
 
5.7%
790
 
4.7%
739
 
4.4%
571
 
3.4%
498
 
3.0%
495
 
2.9%
Other values (137) 7413
44.0%
Latin
ValueCountFrequency (%)
V 287
66.0%
v 63
 
14.5%
O 15
 
3.4%
F 13
 
3.0%
K 12
 
2.8%
G 8
 
1.8%
E 6
 
1.4%
A 5
 
1.1%
I 5
 
1.1%
D 4
 
0.9%
Other values (12) 17
 
3.9%
Common
ValueCountFrequency (%)
1 12071
16.3%
3 11364
15.4%
- 10266
13.9%
0 7557
10.2%
5 7327
9.9%
9 6984
9.4%
2 5386
7.3%
6 3637
 
4.9%
7 3568
 
4.8%
4 3319
 
4.5%
Other values (9) 2541
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 74455
81.5%
Hangul 8757
 
9.6%
Compat Jamo 8092
 
8.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 12071
16.2%
3 11364
15.3%
- 10266
13.8%
0 7557
10.1%
5 7327
9.8%
9 6984
9.4%
2 5386
7.2%
6 3637
 
4.9%
7 3568
 
4.8%
4 3319
 
4.5%
Other values (31) 2976
 
4.0%
Compat Jamo
ValueCountFrequency (%)
1604
19.8%
1525
18.8%
1167
14.4%
1085
13.4%
790
9.8%
456
 
5.6%
364
 
4.5%
362
 
4.5%
297
 
3.7%
272
 
3.4%
Other values (6) 170
 
2.1%
Hangul
ValueCountFrequency (%)
962
 
11.0%
739
 
8.4%
571
 
6.5%
498
 
5.7%
495
 
5.7%
405
 
4.6%
292
 
3.3%
270
 
3.1%
181
 
2.1%
172
 
2.0%
Other values (121) 4172
47.6%

자료실명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size68.4 KiB
강항종합자료실
8745 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강항종합자료실
2nd row강항종합자료실
3rd row강항종합자료실
4th row강항종합자료실
5th row강항종합자료실

Common Values

ValueCountFrequency (%)
강항종합자료실 8745
100.0%

Length

2023-12-12T15:51:40.796415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:51:40.879364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
강항종합자료실 8745
100.0%

서명
Text

Distinct8445
Distinct (%)96.6%
Missing0
Missing (%)0.0%
Memory size68.4 KiB
2023-12-12T15:51:41.069375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length166
Median length141
Mean length19.466438
Min length4

Characters and Unicode

Total characters170234
Distinct characters805
Distinct categories14 ?
Distinct scripts4 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8243 ?
Unique (%)94.3%

Sample

1st row(국가인권위원회)결정례집 인권정책, 침해구제 차별시정 제11집(2018)
2nd row포도 생산비 절감 경영매뉴얼
3rd row녹차 생산비 절감 경영 매뉴얼
4th row오리 생산비 절감 경영 매뉴얼
5th row한우 생산비 절감 경영 매뉴얼
ValueCountFrequency (%)
연구 932
 
3.3%
585
 
2.1%
위한 521
 
1.8%
관한 339
 
1.2%
방안 327
 
1.2%
통계연보 262
 
0.9%
중심으로 210
 
0.7%
개선방안 187
 
0.7%
178
 
0.6%
분석 172
 
0.6%
Other values (12140) 24567
86.9%
2023-12-12T15:51:41.495883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
19720
 
11.6%
1 5310
 
3.1%
4917
 
2.9%
0 4604
 
2.7%
( 4581
 
2.7%
) 4579
 
2.7%
9 4480
 
2.6%
2 4305
 
2.5%
4301
 
2.5%
3905
 
2.3%
Other values (795) 109532
64.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 110691
65.0%
Decimal Number 24606
 
14.5%
Space Separator 19720
 
11.6%
Open Punctuation 4583
 
2.7%
Close Punctuation 4581
 
2.7%
Lowercase Letter 2876
 
1.7%
Uppercase Letter 1240
 
0.7%
Dash Punctuation 1087
 
0.6%
Other Punctuation 413
 
0.2%
Letter Number 173
 
0.1%
Other values (4) 264
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4917
 
4.4%
4301
 
3.9%
3905
 
3.5%
3637
 
3.3%
2698
 
2.4%
2268
 
2.0%
2105
 
1.9%
2069
 
1.9%
2049
 
1.9%
2022
 
1.8%
Other values (700) 80720
72.9%
Lowercase Letter
ValueCountFrequency (%)
e 327
11.4%
o 281
9.8%
n 266
9.2%
i 263
9.1%
t 260
9.0%
a 239
 
8.3%
r 194
 
6.7%
s 175
 
6.1%
c 117
 
4.1%
l 111
 
3.9%
Other values (15) 643
22.4%
Uppercase Letter
ValueCountFrequency (%)
A 151
12.2%
I 147
11.9%
O 125
10.1%
R 90
 
7.3%
K 90
 
7.3%
E 74
 
6.0%
C 73
 
5.9%
D 66
 
5.3%
T 65
 
5.2%
N 60
 
4.8%
Other values (14) 299
24.1%
Other Punctuation
ValueCountFrequency (%)
, 267
64.6%
· 45
 
10.9%
/ 33
 
8.0%
' 27
 
6.5%
" 14
 
3.4%
& 13
 
3.1%
? 7
 
1.7%
# 2
 
0.5%
@ 2
 
0.5%
* 1
 
0.2%
Other values (2) 2
 
0.5%
Decimal Number
ValueCountFrequency (%)
1 5310
21.6%
0 4604
18.7%
9 4480
18.2%
2 4305
17.5%
8 1475
 
6.0%
7 1189
 
4.8%
6 955
 
3.9%
5 853
 
3.5%
3 797
 
3.2%
4 638
 
2.6%
Letter Number
ValueCountFrequency (%)
55
31.8%
53
30.6%
29
16.8%
12
 
6.9%
8
 
4.6%
7
 
4.0%
3
 
1.7%
2
 
1.2%
2
 
1.2%
2
 
1.2%
Math Symbol
ValueCountFrequency (%)
= 96
56.8%
~ 71
42.0%
× 1
 
0.6%
1
 
0.6%
Open Punctuation
ValueCountFrequency (%)
( 4581
> 99.9%
2
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 4579
> 99.9%
2
 
< 0.1%
Modifier Symbol
ValueCountFrequency (%)
˙ 14
66.7%
` 7
33.3%
Space Separator
ValueCountFrequency (%)
19720
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1087
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 73
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 110475
64.9%
Common 55254
32.5%
Latin 4289
 
2.5%
Han 216
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4917
 
4.5%
4301
 
3.9%
3905
 
3.5%
3637
 
3.3%
2698
 
2.4%
2268
 
2.1%
2105
 
1.9%
2069
 
1.9%
2049
 
1.9%
2022
 
1.8%
Other values (641) 80504
72.9%
Latin
ValueCountFrequency (%)
e 327
 
7.6%
o 281
 
6.6%
n 266
 
6.2%
i 263
 
6.1%
t 260
 
6.1%
a 239
 
5.6%
r 194
 
4.5%
s 175
 
4.1%
A 151
 
3.5%
I 147
 
3.4%
Other values (49) 1986
46.3%
Han
ValueCountFrequency (%)
15
 
6.9%
15
 
6.9%
15
 
6.9%
15
 
6.9%
15
 
6.9%
15
 
6.9%
10
 
4.6%
8
 
3.7%
5
 
2.3%
5
 
2.3%
Other values (49) 98
45.4%
Common
ValueCountFrequency (%)
19720
35.7%
1 5310
 
9.6%
0 4604
 
8.3%
( 4581
 
8.3%
) 4579
 
8.3%
9 4480
 
8.1%
2 4305
 
7.8%
8 1475
 
2.7%
7 1189
 
2.2%
- 1087
 
2.0%
Other values (26) 3924
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 110462
64.9%
ASCII 59303
34.8%
CJK 213
 
0.1%
Number Forms 173
 
0.1%
None 52
 
< 0.1%
Modifier Letters 14
 
< 0.1%
Compat Jamo 13
 
< 0.1%
CJK Compat Ideographs 3
 
< 0.1%
Letterlike Symbols 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
19720
33.3%
1 5310
 
9.0%
0 4604
 
7.8%
( 4581
 
7.7%
) 4579
 
7.7%
9 4480
 
7.6%
2 4305
 
7.3%
8 1475
 
2.5%
7 1189
 
2.0%
- 1087
 
1.8%
Other values (67) 7973
13.4%
Hangul
ValueCountFrequency (%)
4917
 
4.5%
4301
 
3.9%
3905
 
3.5%
3637
 
3.3%
2698
 
2.4%
2268
 
2.1%
2105
 
1.9%
2069
 
1.9%
2049
 
1.9%
2022
 
1.8%
Other values (636) 80491
72.9%
Number Forms
ValueCountFrequency (%)
55
31.8%
53
30.6%
29
16.8%
12
 
6.9%
8
 
4.6%
7
 
4.0%
3
 
1.7%
2
 
1.2%
2
 
1.2%
2
 
1.2%
None
ValueCountFrequency (%)
· 45
86.5%
2
 
3.8%
2
 
3.8%
× 1
 
1.9%
1
 
1.9%
1
 
1.9%
CJK
ValueCountFrequency (%)
15
 
7.0%
15
 
7.0%
15
 
7.0%
15
 
7.0%
15
 
7.0%
15
 
7.0%
10
 
4.7%
8
 
3.8%
5
 
2.3%
5
 
2.3%
Other values (48) 95
44.6%
Modifier Letters
ValueCountFrequency (%)
˙ 14
100.0%
Compat Jamo
ValueCountFrequency (%)
8
61.5%
2
 
15.4%
1
 
7.7%
1
 
7.7%
1
 
7.7%
CJK Compat Ideographs
ValueCountFrequency (%)
3
100.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
Distinct2311
Distinct (%)26.4%
Missing6
Missing (%)0.1%
Memory size68.4 KiB
2023-12-12T15:51:41.707257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length103
Median length92
Mean length6.8091315
Min length2

Characters and Unicode

Total characters59505
Distinct characters442
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1763 ?
Unique (%)20.2%

Sample

1st row국가인권위원회 편
2nd row전라남도 농업기술원
3rd row전라남도 농업기술원
4th row전라남도 농업기술원
5th row전라남도 농업기술원
ValueCountFrequency (%)
통계청 728
 
5.8%
전라남도 348
 
2.8%
한국지방행정연구원 297
 
2.4%
경제기획원 279
 
2.2%
한국행정연구원 198
 
1.6%
국회예산정책처 161
 
1.3%
한국행정학회 153
 
1.2%
124
 
1.0%
내무부 123
 
1.0%
헌법재판소 119
 
1.0%
Other values (2814) 9993
79.8%
2023-12-12T15:51:42.101037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5680
 
9.5%
, 2975
 
5.0%
1844
 
3.1%
1633
 
2.7%
1600
 
2.7%
1282
 
2.2%
1152
 
1.9%
1141
 
1.9%
1112
 
1.9%
1076
 
1.8%
Other values (432) 40010
67.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 50192
84.3%
Space Separator 5680
 
9.5%
Other Punctuation 3030
 
5.1%
Lowercase Letter 307
 
0.5%
Uppercase Letter 188
 
0.3%
Decimal Number 57
 
0.1%
Open Punctuation 25
 
< 0.1%
Close Punctuation 24
 
< 0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1844
 
3.7%
1633
 
3.3%
1600
 
3.2%
1282
 
2.6%
1152
 
2.3%
1141
 
2.3%
1112
 
2.2%
1076
 
2.1%
1035
 
2.1%
1027
 
2.0%
Other values (374) 37290
74.3%
Lowercase Letter
ValueCountFrequency (%)
i 36
11.7%
o 35
11.4%
e 33
10.7%
n 31
10.1%
a 27
8.8%
t 19
 
6.2%
l 17
 
5.5%
h 15
 
4.9%
r 15
 
4.9%
s 13
 
4.2%
Other values (13) 66
21.5%
Uppercase Letter
ValueCountFrequency (%)
A 39
20.7%
O 33
17.6%
F 30
16.0%
E 9
 
4.8%
I 8
 
4.3%
N 7
 
3.7%
K 7
 
3.7%
D 7
 
3.7%
T 7
 
3.7%
C 6
 
3.2%
Other values (12) 35
18.6%
Other Punctuation
ValueCountFrequency (%)
, 2975
98.2%
· 29
 
1.0%
/ 17
 
0.6%
\ 9
 
0.3%
Decimal Number
ValueCountFrequency (%)
0 46
80.7%
1 9
 
15.8%
2 1
 
1.8%
5 1
 
1.8%
Open Punctuation
ValueCountFrequency (%)
( 24
96.0%
{ 1
 
4.0%
Space Separator
ValueCountFrequency (%)
5680
100.0%
Close Punctuation
ValueCountFrequency (%)
) 24
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 50107
84.2%
Common 8818
 
14.8%
Latin 495
 
0.8%
Han 85
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1844
 
3.7%
1633
 
3.3%
1600
 
3.2%
1282
 
2.6%
1152
 
2.3%
1141
 
2.3%
1112
 
2.2%
1076
 
2.1%
1035
 
2.1%
1027
 
2.0%
Other values (340) 37205
74.3%
Latin
ValueCountFrequency (%)
A 39
 
7.9%
i 36
 
7.3%
o 35
 
7.1%
e 33
 
6.7%
O 33
 
6.7%
n 31
 
6.3%
F 30
 
6.1%
a 27
 
5.5%
t 19
 
3.8%
l 17
 
3.4%
Other values (35) 195
39.4%
Han
ValueCountFrequency (%)
8
 
9.4%
7
 
8.2%
6
 
7.1%
6
 
7.1%
6
 
7.1%
6
 
7.1%
5
 
5.9%
5
 
5.9%
3
 
3.5%
調 3
 
3.5%
Other values (24) 30
35.3%
Common
ValueCountFrequency (%)
5680
64.4%
, 2975
33.7%
0 46
 
0.5%
· 29
 
0.3%
) 24
 
0.3%
( 24
 
0.3%
/ 17
 
0.2%
\ 9
 
0.1%
1 9
 
0.1%
- 2
 
< 0.1%
Other values (3) 3
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 50101
84.2%
ASCII 9284
 
15.6%
CJK 85
 
0.1%
None 29
 
< 0.1%
Compat Jamo 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5680
61.2%
, 2975
32.0%
0 46
 
0.5%
A 39
 
0.4%
i 36
 
0.4%
o 35
 
0.4%
e 33
 
0.4%
O 33
 
0.4%
n 31
 
0.3%
F 30
 
0.3%
Other values (47) 346
 
3.7%
Hangul
ValueCountFrequency (%)
1844
 
3.7%
1633
 
3.3%
1600
 
3.2%
1282
 
2.6%
1152
 
2.3%
1141
 
2.3%
1112
 
2.2%
1076
 
2.1%
1035
 
2.1%
1027
 
2.0%
Other values (334) 37199
74.2%
None
ValueCountFrequency (%)
· 29
100.0%
CJK
ValueCountFrequency (%)
8
 
9.4%
7
 
8.2%
6
 
7.1%
6
 
7.1%
6
 
7.1%
6
 
7.1%
5
 
5.9%
5
 
5.9%
3
 
3.5%
調 3
 
3.5%
Other values (24) 30
35.3%
Compat Jamo
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Distinct558
Distinct (%)6.4%
Missing0
Missing (%)0.0%
Memory size68.4 KiB
2023-12-12T15:51:42.388875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length23
Mean length5.4830189
Min length2

Characters and Unicode

Total characters47949
Distinct characters310
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique218 ?
Unique (%)2.5%

Sample

1st row국가인권위원회
2nd row전라남도 농업기술원
3rd row전라남도 농업기술원
4th row전라남도 농업기술원
5th row전라남도 농업기술원
ValueCountFrequency (%)
통계청 702
 
7.7%
한국행정연구원 570
 
6.2%
전라남도 328
 
3.6%
경제기획원 276
 
3.0%
한국노동연구원 254
 
2.8%
국토연구원 208
 
2.3%
없음 201
 
2.2%
한국문화관광연구원 198
 
2.2%
건축도시공간연구소 192
 
2.1%
국회예산정책처 167
 
1.8%
Other values (548) 6047
66.1%
2023-12-12T15:51:42.823135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2895
 
6.0%
2605
 
5.4%
2477
 
5.2%
2393
 
5.0%
1830
 
3.8%
1517
 
3.2%
1239
 
2.6%
1114
 
2.3%
995
 
2.1%
993
 
2.1%
Other values (300) 29891
62.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 47293
98.6%
Space Separator 403
 
0.8%
Uppercase Letter 97
 
0.2%
Other Punctuation 78
 
0.2%
Close Punctuation 24
 
0.1%
Open Punctuation 24
 
0.1%
Decimal Number 23
 
< 0.1%
Lowercase Letter 5
 
< 0.1%
Other Symbol 1
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2895
 
6.1%
2605
 
5.5%
2477
 
5.2%
2393
 
5.1%
1830
 
3.9%
1517
 
3.2%
1239
 
2.6%
1114
 
2.4%
995
 
2.1%
993
 
2.1%
Other values (271) 29235
61.8%
Uppercase Letter
ValueCountFrequency (%)
A 17
17.5%
F 16
16.5%
O 16
16.5%
I 16
16.5%
K 14
14.4%
L 10
10.3%
D 4
 
4.1%
E 3
 
3.1%
N 1
 
1.0%
Decimal Number
ValueCountFrequency (%)
0 7
30.4%
2 6
26.1%
1 5
21.7%
7 3
13.0%
9 1
 
4.3%
3 1
 
4.3%
Lowercase Letter
ValueCountFrequency (%)
k 1
20.0%
o 1
20.0%
r 1
20.0%
e 1
20.0%
a 1
20.0%
Other Punctuation
ValueCountFrequency (%)
, 58
74.4%
/ 16
 
20.5%
& 3
 
3.8%
· 1
 
1.3%
Space Separator
ValueCountFrequency (%)
403
100.0%
Close Punctuation
ValueCountFrequency (%)
) 24
100.0%
Open Punctuation
ValueCountFrequency (%)
( 24
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 47177
98.4%
Common 553
 
1.2%
Han 117
 
0.2%
Latin 102
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2895
 
6.1%
2605
 
5.5%
2477
 
5.3%
2393
 
5.1%
1830
 
3.9%
1517
 
3.2%
1239
 
2.6%
1114
 
2.4%
995
 
2.1%
993
 
2.1%
Other values (246) 29119
61.7%
Han
ValueCountFrequency (%)
13
 
11.1%
9
 
7.7%
9
 
7.7%
9
 
7.7%
9
 
7.7%
7
 
6.0%
7
 
6.0%
4
 
3.4%
4
 
3.4%
4
 
3.4%
Other values (16) 42
35.9%
Common
ValueCountFrequency (%)
403
72.9%
, 58
 
10.5%
) 24
 
4.3%
( 24
 
4.3%
/ 16
 
2.9%
0 7
 
1.3%
2 6
 
1.1%
1 5
 
0.9%
7 3
 
0.5%
& 3
 
0.5%
Other values (4) 4
 
0.7%
Latin
ValueCountFrequency (%)
A 17
16.7%
F 16
15.7%
O 16
15.7%
I 16
15.7%
K 14
13.7%
L 10
9.8%
D 4
 
3.9%
E 3
 
2.9%
k 1
 
1.0%
o 1
 
1.0%
Other values (4) 4
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 47175
98.4%
ASCII 654
 
1.4%
CJK 117
 
0.2%
None 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2895
 
6.1%
2605
 
5.5%
2477
 
5.3%
2393
 
5.1%
1830
 
3.9%
1517
 
3.2%
1239
 
2.6%
1114
 
2.4%
995
 
2.1%
993
 
2.1%
Other values (244) 29117
61.7%
ASCII
ValueCountFrequency (%)
403
61.6%
, 58
 
8.9%
) 24
 
3.7%
( 24
 
3.7%
A 17
 
2.6%
F 16
 
2.4%
/ 16
 
2.4%
O 16
 
2.4%
I 16
 
2.4%
K 14
 
2.1%
Other values (17) 50
 
7.6%
CJK
ValueCountFrequency (%)
13
 
11.1%
9
 
7.7%
9
 
7.7%
9
 
7.7%
9
 
7.7%
7
 
6.0%
7
 
6.0%
4
 
3.4%
4
 
3.4%
4
 
3.4%
Other values (16) 42
35.9%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
None
ValueCountFrequency (%)
1
50.0%
· 1
50.0%
Distinct226
Distinct (%)2.6%
Missing49
Missing (%)0.6%
Memory size68.4 KiB
2023-12-12T15:51:43.122153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length4
Mean length4.3460212
Min length3

Characters and Unicode

Total characters37793
Distinct characters28
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique82 ?
Unique (%)0.9%

Sample

1st row2019
2nd row2018
3rd row2018
4th row2018
5th row2018
ValueCountFrequency (%)
2020 580
 
6.6%
2019 498
 
5.6%
2021 463
 
5.2%
2018 355
 
4.0%
2022 342
 
3.9%
1997 254
 
2.9%
1996 209
 
2.4%
2012 208
 
2.4%
2002 201
 
2.3%
2003 198
 
2.2%
Other values (212) 5520
62.5%
2023-12-12T15:51:43.590087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 8726
23.1%
0 8432
22.3%
1 7664
20.3%
9 6141
16.2%
8 1799
 
4.8%
3 1355
 
3.6%
7 1342
 
3.6%
6 1051
 
2.8%
5 577
 
1.5%
4 546
 
1.4%
Other values (18) 160
 
0.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 37633
99.6%
Space Separator 132
 
0.3%
Other Letter 25
 
0.1%
Math Symbol 1
 
< 0.1%
Uppercase Letter 1
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4
16.0%
3
12.0%
3
12.0%
2
8.0%
2
8.0%
2
8.0%
2
8.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
Other values (4) 4
16.0%
Decimal Number
ValueCountFrequency (%)
2 8726
23.2%
0 8432
22.4%
1 7664
20.4%
9 6141
16.3%
8 1799
 
4.8%
3 1355
 
3.6%
7 1342
 
3.6%
6 1051
 
2.8%
5 577
 
1.5%
4 546
 
1.5%
Space Separator
ValueCountFrequency (%)
132
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%
Uppercase Letter
ValueCountFrequency (%)
E 1
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 37767
99.9%
Hangul 25
 
0.1%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4
16.0%
3
12.0%
3
12.0%
2
8.0%
2
8.0%
2
8.0%
2
8.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
Other values (4) 4
16.0%
Common
ValueCountFrequency (%)
2 8726
23.1%
0 8432
22.3%
1 7664
20.3%
9 6141
16.3%
8 1799
 
4.8%
3 1355
 
3.6%
7 1342
 
3.6%
6 1051
 
2.8%
5 577
 
1.5%
4 546
 
1.4%
Other values (3) 134
 
0.4%
Latin
ValueCountFrequency (%)
E 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 37768
99.9%
Hangul 25
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 8726
23.1%
0 8432
22.3%
1 7664
20.3%
9 6141
16.3%
8 1799
 
4.8%
3 1355
 
3.6%
7 1342
 
3.6%
6 1051
 
2.8%
5 577
 
1.5%
4 546
 
1.4%
Other values (4) 135
 
0.4%
Hangul
ValueCountFrequency (%)
4
16.0%
3
12.0%
3
12.0%
2
8.0%
2
8.0%
2
8.0%
2
8.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
Other values (4) 4
16.0%

발행지
Text

MISSING 

Distinct187
Distinct (%)2.3%
Missing538
Missing (%)6.2%
Memory size68.4 KiB
2023-12-12T15:51:43.931761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length16
Mean length4.2094553
Min length1

Characters and Unicode

Total characters34547
Distinct characters136
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique72 ?
Unique (%)0.9%

Sample

1st row서울
2nd row나주
3rd row나주
4th row나주
5th row나주
ValueCountFrequency (%)
서울특별시 1829
21.2%
서울 1611
18.7%
세종특별자치시 844
 
9.8%
대전광역시 510
 
5.9%
전라남도 327
 
3.8%
경기도 180
 
2.1%
무안군 179
 
2.1%
서울시 166
 
1.9%
나주시 113
 
1.3%
울산광역시 107
 
1.2%
Other values (136) 2768
32.1%
2023-12-12T15:51:44.405414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4386
 
12.7%
3756
 
10.9%
3628
 
10.5%
2741
 
7.9%
2738
 
7.9%
1281
 
3.7%
1141
 
3.3%
1132
 
3.3%
1053
 
3.0%
896
 
2.6%
Other values (126) 11795
34.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 33915
98.2%
Space Separator 471
 
1.4%
Close Punctuation 39
 
0.1%
Open Punctuation 39
 
0.1%
Decimal Number 39
 
0.1%
Other Punctuation 38
 
0.1%
Dash Punctuation 4
 
< 0.1%
Modifier Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4386
 
12.9%
3756
 
11.1%
3628
 
10.7%
2741
 
8.1%
2738
 
8.1%
1281
 
3.8%
1141
 
3.4%
1132
 
3.3%
1053
 
3.1%
896
 
2.6%
Other values (108) 11163
32.9%
Decimal Number
ValueCountFrequency (%)
2 9
23.1%
1 7
17.9%
6 5
12.8%
8 4
10.3%
5 4
10.3%
3 3
 
7.7%
0 3
 
7.7%
9 2
 
5.1%
7 2
 
5.1%
Other Punctuation
ValueCountFrequency (%)
, 32
84.2%
* 3
 
7.9%
? 2
 
5.3%
/ 1
 
2.6%
Space Separator
ValueCountFrequency (%)
471
100.0%
Close Punctuation
ValueCountFrequency (%)
) 39
100.0%
Open Punctuation
ValueCountFrequency (%)
( 39
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 33915
98.2%
Common 632
 
1.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4386
 
12.9%
3756
 
11.1%
3628
 
10.7%
2741
 
8.1%
2738
 
8.1%
1281
 
3.8%
1141
 
3.4%
1132
 
3.3%
1053
 
3.1%
896
 
2.6%
Other values (108) 11163
32.9%
Common
ValueCountFrequency (%)
471
74.5%
) 39
 
6.2%
( 39
 
6.2%
, 32
 
5.1%
2 9
 
1.4%
1 7
 
1.1%
6 5
 
0.8%
8 4
 
0.6%
5 4
 
0.6%
- 4
 
0.6%
Other values (8) 18
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 33915
98.2%
ASCII 632
 
1.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4386
 
12.9%
3756
 
11.1%
3628
 
10.7%
2741
 
8.1%
2738
 
8.1%
1281
 
3.8%
1141
 
3.4%
1132
 
3.3%
1053
 
3.1%
896
 
2.6%
Other values (108) 11163
32.9%
ASCII
ValueCountFrequency (%)
471
74.5%
) 39
 
6.2%
( 39
 
6.2%
, 32
 
5.1%
2 9
 
1.4%
1 7
 
1.1%
6 5
 
0.8%
8 4
 
0.6%
5 4
 
0.6%
- 4
 
0.6%
Other values (8) 18
 
2.8%

원-복본구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size68.4 KiB
원본
8621 
복본
 
124

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row원본
2nd row원본
3rd row원본
4th row원본
5th row원본

Common Values

ValueCountFrequency (%)
원본 8621
98.6%
복본 124
 
1.4%

Length

2023-12-12T15:51:44.598674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:51:44.699824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
원본 8621
98.6%
복본 124
 
1.4%

가격
Real number (ℝ)

MISSING  SKEWED  ZEROS 

Distinct26
Distinct (%)0.3%
Missing928
Missing (%)10.6%
Infinite0
Infinite (%)0.0%
Mean91.037866
Minimum0
Maximum100000
Zeros7755
Zeros (%)88.7%
Negative0
Negative (%)0.0%
Memory size77.0 KiB
2023-12-12T15:51:44.811130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum100000
Range100000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1840.7301
Coefficient of variation (CV)20.21939
Kurtosis2247.1292
Mean91.037866
Median Absolute Deviation (MAD)0
Skewness43.18861
Sum711643
Variance3388287.4
MonotonicityNot monotonic
2023-12-12T15:51:44.956161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
0 7755
88.7%
12000 10
 
0.1%
6000 8
 
0.1%
10000 6
 
0.1%
8000 6
 
0.1%
15000 3
 
< 0.1%
8500 3
 
< 0.1%
1 3
 
< 0.1%
4500 2
 
< 0.1%
18000 2
 
< 0.1%
Other values (16) 19
 
0.2%
(Missing) 928
 
10.6%
ValueCountFrequency (%)
0 7755
88.7%
1 3
 
< 0.1%
2 1
 
< 0.1%
3 1
 
< 0.1%
9 1
 
< 0.1%
26 1
 
< 0.1%
1300 1
 
< 0.1%
2500 1
 
< 0.1%
3000 2
 
< 0.1%
3500 1
 
< 0.1%
ValueCountFrequency (%)
100000 2
 
< 0.1%
35000 1
 
< 0.1%
23000 1
 
< 0.1%
18000 2
 
< 0.1%
15000 3
 
< 0.1%
14000 1
 
< 0.1%
12000 10
0.1%
11000 1
 
< 0.1%
10000 6
0.1%
8500 3
 
< 0.1%

구분
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size68.4 KiB
8745 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
8745
100.0%

Length

2023-12-12T15:51:45.382385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:51:45.488162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
8745
100.0%
Distinct3118
Distinct (%)35.7%
Missing10
Missing (%)0.1%
Memory size68.4 KiB
2023-12-12T15:51:45.908950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length3
Mean length5.8710933
Min length2

Characters and Unicode

Total characters51284
Distinct characters64
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1815 ?
Unique (%)20.8%

Sample

1st row1082 페이지,23 센치미터
2nd row59,19_26
3rd row53
4th row49
5th row41
ValueCountFrequency (%)
센치미터 52
 
0.6%
0페이지,0센치미터 48
 
0.5%
1000 41
 
0.5%
289 25
 
0.3%
193 23
 
0.3%
233 23
 
0.3%
198 22
 
0.2%
260 22
 
0.2%
174 22
 
0.2%
200 21
 
0.2%
Other values (3106) 8591
96.6%
2023-12-12T15:51:46.495181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 6716
 
13.1%
1 4482
 
8.7%
3 3631
 
7.1%
5 3071
 
6.0%
6 3008
 
5.9%
, 2909
 
5.7%
4 2764
 
5.4%
2309
 
4.5%
2309
 
4.5%
2309
 
4.5%
Other values (54) 17776
34.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 31964
62.3%
Other Letter 16087
31.4%
Other Punctuation 2913
 
5.7%
Space Separator 179
 
0.3%
Lowercase Letter 117
 
0.2%
Close Punctuation 8
 
< 0.1%
Open Punctuation 8
 
< 0.1%
Math Symbol 4
 
< 0.1%
Letter Number 2
 
< 0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2309
14.4%
2309
14.4%
2309
14.4%
2309
14.4%
2214
13.8%
2213
13.8%
2213
13.8%
47
 
0.3%
37
 
0.2%
34
 
0.2%
Other values (28) 93
 
0.6%
Decimal Number
ValueCountFrequency (%)
2 6716
21.0%
1 4482
14.0%
3 3631
11.4%
5 3071
9.6%
6 3008
9.4%
4 2764
8.6%
0 2181
 
6.8%
7 2181
 
6.8%
9 1971
 
6.2%
8 1959
 
6.1%
Other Punctuation
ValueCountFrequency (%)
, 2909
99.9%
* 2
 
0.1%
· 1
 
< 0.1%
1
 
< 0.1%
Lowercase Letter
ValueCountFrequency (%)
x 55
47.0%
i 40
34.2%
v 22
 
18.8%
Math Symbol
ValueCountFrequency (%)
× 3
75.0%
+ 1
 
25.0%
Letter Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
179
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 35078
68.4%
Hangul 16085
31.4%
Latin 119
 
0.2%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2309
14.4%
2309
14.4%
2309
14.4%
2309
14.4%
2214
13.8%
2213
13.8%
2213
13.8%
47
 
0.3%
37
 
0.2%
34
 
0.2%
Other values (26) 91
 
0.6%
Common
ValueCountFrequency (%)
2 6716
19.1%
1 4482
12.8%
3 3631
10.4%
5 3071
8.8%
6 3008
8.6%
, 2909
8.3%
4 2764
7.9%
0 2181
 
6.2%
7 2181
 
6.2%
9 1971
 
5.6%
Other values (11) 2164
 
6.2%
Latin
ValueCountFrequency (%)
x 55
46.2%
i 40
33.6%
v 22
 
18.5%
1
 
0.8%
1
 
0.8%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 35190
68.6%
Hangul 16085
31.4%
None 5
 
< 0.1%
CJK 2
 
< 0.1%
Number Forms 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 6716
19.1%
1 4482
12.7%
3 3631
10.3%
5 3071
8.7%
6 3008
8.5%
, 2909
8.3%
4 2764
7.9%
0 2181
 
6.2%
7 2181
 
6.2%
9 1971
 
5.6%
Other values (11) 2276
 
6.5%
Hangul
ValueCountFrequency (%)
2309
14.4%
2309
14.4%
2309
14.4%
2309
14.4%
2214
13.8%
2213
13.8%
2213
13.8%
47
 
0.3%
37
 
0.2%
34
 
0.2%
Other values (26) 91
 
0.6%
None
ValueCountFrequency (%)
× 3
60.0%
· 1
 
20.0%
1
 
20.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
Number Forms
ValueCountFrequency (%)
1
50.0%
1
50.0%

Interactions

2023-12-12T15:51:38.024847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:51:37.858897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:51:38.115287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:51:37.933424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:51:46.589747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번배가상태원-복본구분가격
연번1.0000.1000.1920.128
배가상태0.1001.0000.0000.000
원-복본구분0.1920.0001.0000.090
가격0.1280.0000.0901.000
2023-12-12T15:51:46.682762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
원-복본구분배가상태
원-복본구분1.0000.000
배가상태0.0001.000
2023-12-12T15:51:46.803759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번가격배가상태원-복본구분
연번1.0000.0140.0600.147
가격0.0141.0000.0000.110
배가상태0.0600.0001.0000.000
원-복본구분0.1470.1100.0001.000

Missing values

2023-12-12T15:51:38.278703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:51:38.471870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T15:51:38.617282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번관리구분배가상태이용제한구분등록번호청구기호자료실명서명저작자발행자발행년발행지원-복본구분가격구분형태사항
01강항지식정보센터비치자료일반SB00000000003421-국12ㄱ-제11집(2018)강항종합자료실(국가인권위원회)결정례집 인권정책, 침해구제 차별시정 제11집(2018)국가인권위원회 편국가인권위원회2019서울원본01082 페이지,23 센치미터
12강항지식정보센터비치자료일반SB000000000132599-전292ㅍ강항종합자료실포도 생산비 절감 경영매뉴얼전라남도 농업기술원전라남도 농업기술원2018나주원본059,19_26
23강항지식정보센터비치자료일반SB000000000232599-전292ㄴ강항종합자료실녹차 생산비 절감 경영 매뉴얼전라남도 농업기술원전라남도 농업기술원2018나주원본053
34강항지식정보센터비치자료일반SB000000000332599-전292ㅇ강항종합자료실오리 생산비 절감 경영 매뉴얼전라남도 농업기술원전라남도 농업기술원2018나주원본049
45강항지식정보센터비치자료일반SB000000000432599-전292ㅎ강항종합자료실한우 생산비 절감 경영 매뉴얼전라남도 농업기술원전라남도 농업기술원2018나주원본041
56강항지식정보센터비치자료일반SB000000000532599-전292ㄱ강항종합자료실고구마 생산비 절감 경영매뉴얼전라남도 농업기술원전라남도 농업기술원2018나주원본043
67강항지식정보센터비치자료일반SB000000000632599-전292ㅆ강항종합자료실쌀 생산비 절감 경영 매뉴얼전라남도 농업기술원전라남도 농업기술원2018나주원본053
78강항지식정보센터비치자료일반SB000000000732599-전292ㄷ강항종합자료실단감 생산비 절감 경영 매뉴얼전라남도 농업기술원전라남도 농업기술원2018나주원본065
89강항지식정보센터비치자료일반SB000000000832599-전292ㅅ강항종합자료실시설호박 생산비 절감 경영 매뉴얼전라남도 농업기술원전라남도 농업기술원2018나주원본046
910강항지식정보센터비치자료일반SB000000000932599-전292ㅇ강항종합자료실유자 생산비 절감 경영 매뉴얼전라남도 농업기술원전라남도 농업기술원2018나주원본048
연번관리구분배가상태이용제한구분등록번호청구기호자료실명서명저작자발행자발행년발행지원-복본구분가격구분형태사항
87358736강항지식정보센터비치자료일반SB00000087383644-대13ㅂ=2강항종합자료실범죄분석통권 제145호, 2012년대검찰청대검찰청2012 10 15서울복본<NA>689페이지,26센치미터
87368737강항지식정보센터비치자료일반SB00000087393672-대14ㅂ강항종합자료실(2015)범죄분석대검찰청대검찰청2015서울특별시원본0923
87378738강항지식정보센터비치자료일반SB00000087403672-대14ㅂ강항종합자료실(2016)범죄분석대검찰청대검찰청2016서울특별시원본0903
87388739강항지식정보센터비치자료일반SB00000087413672-대14ㅂ강항종합자료실(2017)범죄분석대검찰청대검찰청2017서울특별시원본0902
87398740강항지식정보센터비치자료일반SB00000087343672-대13ㅂ강항종합자료실범죄분석(2006)대검찰청대검찰청20061020서울원본<NA>589페이지,26센치미터
87408741강항지식정보센터비치자료일반SB0000008705540-서57ㅈ강항종합자료실전국 한옥분포 현황조사 - 대구 및 나주편서수정, 옥채원건축도시공간연구소201212서울원본<NA>374,247
87418742강항지식정보센터비치자료일반SB0000008706540-유16강항종합자료실2012 한옥산업 현황조사유광흠, 신민종건축도시공간연구소201212서울원본<NA>217,247
87428743강항지식정보센터비치자료일반SB000000871333147-건817ㅎ강항종합자료실한옥문화의 세계화를 위한 인문학적 가치 발굴 연구(2)건축도시공간연구소 국가한옥센터건축도시공간연구소2013 12 31경기도원본<NA>181페이지,21센치미터
87438744강항지식정보센터비치자료일반SB00000087313672-대13ㅂ-2002강항종합자료실범죄분석대검찰청대종파이오2002없음원본0531페이지,26센치미터
87448745강항지식정보센터비치자료일반SB00000087333672-대13 ㅂ강항종합자료실범죄분석-범죄개관-대검찰청없음대종파이오2004없음원본0559페이지,26센치미터