Overview

Dataset statistics

Number of variables11
Number of observations1652
Missing cells1481
Missing cells (%)8.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory145.3 KiB
Average record size in memory90.1 B

Variable types

Numeric2
Categorical4
Text5

Dataset

Description전라남도 진도군 옥외광고물현황 공공데이터는 진도군 옥외광고물의 광고물종류, 규격, 표시장소 등 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15063722/fileData.do

Alerts

순번 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 순번 and 1 other fieldsHigh correlation
광고물종류 is highly overall correlated with 구분High correlation
수량 is highly imbalanced (88.2%)Imbalance
표시층 has 1217 (73.7%) missing valuesMissing
표시장소_도로명 has 263 (15.9%) missing valuesMissing
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 20:29:20.915626
Analysis finished2023-12-12 20:29:23.325110
Duration2.41 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1652
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean826.5
Minimum1
Maximum1652
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.6 KiB
2023-12-13T05:29:23.409869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile83.55
Q1413.75
median826.5
Q31239.25
95-th percentile1569.45
Maximum1652
Range1651
Interquartile range (IQR)825.5

Descriptive statistics

Standard deviation477.03564
Coefficient of variation (CV)0.57717561
Kurtosis-1.2
Mean826.5
Median Absolute Deviation (MAD)413
Skewness0
Sum1365378
Variance227563
MonotonicityStrictly increasing
2023-12-13T05:29:23.568763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
1100 1
 
0.1%
1110 1
 
0.1%
1109 1
 
0.1%
1108 1
 
0.1%
1107 1
 
0.1%
1106 1
 
0.1%
1105 1
 
0.1%
1104 1
 
0.1%
1103 1
 
0.1%
Other values (1642) 1642
99.4%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1652 1
0.1%
1651 1
0.1%
1650 1
0.1%
1649 1
0.1%
1648 1
0.1%
1647 1
0.1%
1646 1
0.1%
1645 1
0.1%
1644 1
0.1%
1643 1
0.1%

구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.0 KiB
신고
935 
허가
717 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row신고
2nd row신고
3rd row신고
4th row신고
5th row허가

Common Values

ValueCountFrequency (%)
신고 935
56.6%
허가 717
43.4%

Length

2023-12-13T05:29:23.718842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:29:23.845566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신고 935
56.6%
허가 717
43.4%

광고물종류
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size13.0 KiB
벽면이용간판(가로형)
768 
돌출간판
306 
지주이용 간판
287 
가로형간판_입체형
174 
현수막게시틀
 
43
Other values (9)
 
74

Length

Max length12
Median length11
Mean length8.4219128
Min length2

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row가로형간판_입체형
2nd row지주이용 간판
3rd row지주이용 간판
4th row벽면이용간판(가로형)
5th row창문이용 광고물

Common Values

ValueCountFrequency (%)
벽면이용간판(가로형) 768
46.5%
돌출간판 306
 
18.5%
지주이용 간판 287
 
17.4%
가로형간판_입체형 174
 
10.5%
현수막게시틀 43
 
2.6%
옥상간판 42
 
2.5%
벽면이용간판(세로형) 11
 
0.7%
선전탑 6
 
0.4%
창문이용 광고물 4
 
0.2%
아치광고물 4
 
0.2%
Other values (4) 7
 
0.4%

Length

2023-12-13T05:29:23.977914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
벽면이용간판(가로형 768
39.4%
돌출간판 306
 
15.7%
지주이용 287
 
14.7%
간판 287
 
14.7%
가로형간판_입체형 174
 
8.9%
현수막게시틀 43
 
2.2%
옥상간판 42
 
2.2%
벽면이용간판(세로형 11
 
0.6%
광고물 7
 
0.4%
선전탑 6
 
0.3%
Other values (7) 17
 
0.9%

규격
Text

Distinct879
Distinct (%)53.2%
Missing0
Missing (%)0.0%
Memory size13.0 KiB
2023-12-13T05:29:24.438663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length50
Mean length9.1900726
Min length3

Characters and Unicode

Total characters15182
Distinct characters163
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique671 ?
Unique (%)40.6%

Sample

1st row8*2.8 : 입체형간판 면적 산출 : 8M*2.8M*0.7=15.68
2nd row1*2 1*2 높이:2
3rd row2*0.9 높이:3
4th row7*1
5th row1.6*0.4 : 1층 전광류 사용
ValueCountFrequency (%)
231
 
8.8%
4*0.5 66
 
2.5%
녹색디자인거리 66
 
2.5%
간판정비사업 57
 
2.2%
적합 52
 
2.0%
1*1 50
 
1.9%
높이:1 50
 
1.9%
녹색디자인 33
 
1.3%
조성사업 33
 
1.3%
6*0.7 33
 
1.3%
Other values (877) 1947
74.4%
2023-12-13T05:29:25.118557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1836
12.1%
* 1766
11.6%
1754
11.6%
. 1684
11.1%
1 1224
 
8.1%
5 791
 
5.2%
2 635
 
4.2%
8 564
 
3.7%
3 536
 
3.5%
4 457
 
3.0%
Other values (153) 3935
25.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7140
47.0%
Other Punctuation 3839
25.3%
Other Letter 2362
 
15.6%
Space Separator 1754
 
11.6%
Lowercase Letter 23
 
0.2%
Uppercase Letter 23
 
0.2%
Close Punctuation 19
 
0.1%
Open Punctuation 19
 
0.1%
Other Symbol 2
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
175
 
7.4%
168
 
7.1%
116
 
4.9%
113
 
4.8%
109
 
4.6%
103
 
4.4%
102
 
4.3%
101
 
4.3%
101
 
4.3%
101
 
4.3%
Other values (128) 1173
49.7%
Decimal Number
ValueCountFrequency (%)
0 1836
25.7%
1 1224
17.1%
5 791
11.1%
2 635
 
8.9%
8 564
 
7.9%
3 536
 
7.5%
4 457
 
6.4%
6 447
 
6.3%
7 405
 
5.7%
9 245
 
3.4%
Other Punctuation
ValueCountFrequency (%)
* 1766
46.0%
. 1684
43.9%
: 388
 
10.1%
/ 1
 
< 0.1%
Lowercase Letter
ValueCountFrequency (%)
m 19
82.6%
x 4
 
17.4%
Uppercase Letter
ValueCountFrequency (%)
M 15
65.2%
X 8
34.8%
Close Punctuation
ValueCountFrequency (%)
) 12
63.2%
] 7
36.8%
Open Punctuation
ValueCountFrequency (%)
( 12
63.2%
[ 7
36.8%
Space Separator
ValueCountFrequency (%)
1754
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Math Symbol
ValueCountFrequency (%)
= 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 12774
84.1%
Hangul 2362
 
15.6%
Latin 46
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
175
 
7.4%
168
 
7.1%
116
 
4.9%
113
 
4.8%
109
 
4.6%
103
 
4.4%
102
 
4.3%
101
 
4.3%
101
 
4.3%
101
 
4.3%
Other values (128) 1173
49.7%
Common
ValueCountFrequency (%)
0 1836
14.4%
* 1766
13.8%
1754
13.7%
. 1684
13.2%
1 1224
9.6%
5 791
6.2%
2 635
 
5.0%
8 564
 
4.4%
3 536
 
4.2%
4 457
 
3.6%
Other values (11) 1527
12.0%
Latin
ValueCountFrequency (%)
m 19
41.3%
M 15
32.6%
X 8
17.4%
x 4
 
8.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 12818
84.4%
Hangul 2362
 
15.6%
CJK Compat 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1836
14.3%
* 1766
13.8%
1754
13.7%
. 1684
13.1%
1 1224
9.5%
5 791
6.2%
2 635
 
5.0%
8 564
 
4.4%
3 536
 
4.2%
4 457
 
3.6%
Other values (14) 1571
12.3%
Hangul
ValueCountFrequency (%)
175
 
7.4%
168
 
7.1%
116
 
4.9%
113
 
4.8%
109
 
4.6%
103
 
4.4%
102
 
4.3%
101
 
4.3%
101
 
4.3%
101
 
4.3%
Other values (128) 1173
49.7%
CJK Compat
ValueCountFrequency (%)
2
100.0%
Distinct809
Distinct (%)49.0%
Missing0
Missing (%)0.0%
Memory size13.0 KiB
2023-12-13T05:29:25.584492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length37
Mean length7.3196126
Min length3

Characters and Unicode

Total characters12092
Distinct characters28
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique586 ?
Unique (%)35.5%

Sample

1st row8*2.8
2nd row1*2 1*2 높이:2
3rd row2*0.9 높이:3
4th row7*1
5th row1.6*0.4
ValueCountFrequency (%)
4*0.5 66
 
3.4%
높이:1 50
 
2.6%
1*1 50
 
2.6%
6*0.7 33
 
1.7%
5*1 21
 
1.1%
0.3*1 20
 
1.0%
높이:5 20
 
1.0%
90*600 19
 
1.0%
7*1 19
 
1.0%
4*1 18
 
0.9%
Other values (776) 1619
83.7%
2023-12-13T05:29:26.189882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1821
15.1%
* 1759
14.5%
. 1664
13.8%
1 1205
10.0%
1065
8.8%
5 784
6.5%
2 621
 
5.1%
8 560
 
4.6%
3 531
 
4.4%
4 450
 
3.7%
Other values (18) 1632
13.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7050
58.3%
Other Punctuation 3591
29.7%
Space Separator 1065
 
8.8%
Other Letter 347
 
2.9%
Uppercase Letter 21
 
0.2%
Lowercase Letter 12
 
0.1%
Other Symbol 2
 
< 0.1%
Open Punctuation 2
 
< 0.1%
Close Punctuation 2
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1821
25.8%
1 1205
17.1%
5 784
11.1%
2 621
 
8.8%
8 560
 
7.9%
3 531
 
7.5%
4 450
 
6.4%
6 444
 
6.3%
7 400
 
5.7%
9 234
 
3.3%
Other Letter
ValueCountFrequency (%)
168
48.4%
168
48.4%
5
 
1.4%
2
 
0.6%
1
 
0.3%
1
 
0.3%
1
 
0.3%
1
 
0.3%
Other Punctuation
ValueCountFrequency (%)
* 1759
49.0%
. 1664
46.3%
: 168
 
4.7%
Uppercase Letter
ValueCountFrequency (%)
M 13
61.9%
X 8
38.1%
Space Separator
ValueCountFrequency (%)
1065
100.0%
Lowercase Letter
ValueCountFrequency (%)
m 12
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 11712
96.9%
Hangul 347
 
2.9%
Latin 33
 
0.3%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1821
15.5%
* 1759
15.0%
. 1664
14.2%
1 1205
10.3%
1065
9.1%
5 784
6.7%
2 621
 
5.3%
8 560
 
4.8%
3 531
 
4.5%
4 450
 
3.8%
Other values (7) 1252
10.7%
Hangul
ValueCountFrequency (%)
168
48.4%
168
48.4%
5
 
1.4%
2
 
0.6%
1
 
0.3%
1
 
0.3%
1
 
0.3%
1
 
0.3%
Latin
ValueCountFrequency (%)
M 13
39.4%
m 12
36.4%
X 8
24.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 11743
97.1%
Hangul 347
 
2.9%
CJK Compat 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1821
15.5%
* 1759
15.0%
. 1664
14.2%
1 1205
10.3%
1065
9.1%
5 784
6.7%
2 621
 
5.3%
8 560
 
4.8%
3 531
 
4.5%
4 450
 
3.8%
Other values (9) 1283
10.9%
Hangul
ValueCountFrequency (%)
168
48.4%
168
48.4%
5
 
1.4%
2
 
0.6%
1
 
0.3%
1
 
0.3%
1
 
0.3%
1
 
0.3%
CJK Compat
ValueCountFrequency (%)
2
100.0%

수량
Categorical

IMBALANCE 

Distinct10
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size13.0 KiB
1
1567 
2
 
53
3
 
11
4
 
6
7
 
4
Other values (5)
 
11

Length

Max length3
Median length1
Mean length1.003632
Min length1

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 1567
94.9%
2 53
 
3.2%
3 11
 
0.7%
4 6
 
0.4%
7 4
 
0.2%
5 4
 
0.2%
1개 3
 
0.2%
6 2
 
0.1%
34 1
 
0.1%
10개 1
 
0.1%

Length

2023-12-13T05:29:26.442295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:29:26.618353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 1567
94.9%
2 53
 
3.2%
3 11
 
0.7%
4 6
 
0.4%
7 4
 
0.2%
5 4
 
0.2%
1개 3
 
0.2%
6 2
 
0.1%
34 1
 
0.1%
10개 1
 
0.1%

조명
Categorical

Distinct5
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size13.0 KiB
비조명
1070 
기타재료
439 
일반전기
 
68
전광(간접조명이용)
 
52
네온사인
 
23

Length

Max length10
Median length3
Mean length3.5411622
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row비조명
2nd row비조명
3rd row비조명
4th row비조명
5th row전광(간접조명이용)

Common Values

ValueCountFrequency (%)
비조명 1070
64.8%
기타재료 439
26.6%
일반전기 68
 
4.1%
전광(간접조명이용) 52
 
3.1%
네온사인 23
 
1.4%

Length

2023-12-13T05:29:26.791760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:29:26.941008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
비조명 1070
64.8%
기타재료 439
26.6%
일반전기 68
 
4.1%
전광(간접조명이용 52
 
3.1%
네온사인 23
 
1.4%

표시층
Real number (ℝ)

MISSING 

Distinct7
Distinct (%)1.6%
Missing1217
Missing (%)73.7%
Infinite0
Infinite (%)0.0%
Mean1.3333333
Minimum1
Maximum7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.6 KiB
2023-12-13T05:29:27.085517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q31
95-th percentile3
Maximum7
Range6
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.7384544
Coefficient of variation (CV)0.5538408
Kurtosis15.092353
Mean1.3333333
Median Absolute Deviation (MAD)0
Skewness3.312307
Sum580
Variance0.5453149
MonotonicityNot monotonic
2023-12-13T05:29:27.215940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
1 332
 
20.1%
2 78
 
4.7%
3 15
 
0.9%
4 6
 
0.4%
5 2
 
0.1%
7 1
 
0.1%
6 1
 
0.1%
(Missing) 1217
73.7%
ValueCountFrequency (%)
1 332
20.1%
2 78
 
4.7%
3 15
 
0.9%
4 6
 
0.4%
5 2
 
0.1%
6 1
 
0.1%
7 1
 
0.1%
ValueCountFrequency (%)
7 1
 
0.1%
6 1
 
0.1%
5 2
 
0.1%
4 6
 
0.4%
3 15
 
0.9%
2 78
 
4.7%
1 332
20.1%
Distinct815
Distinct (%)49.3%
Missing0
Missing (%)0.0%
Memory size13.0 KiB
2023-12-13T05:29:27.650113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length36
Mean length23.628329
Min length15

Characters and Unicode

Total characters39034
Distinct characters161
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique509 ?
Unique (%)30.8%

Sample

1st row전라남도 진도군 군내면 둔전리 1782 선진농협 군내지점
2nd row전라남도 진도군 진도읍 교동리 560 한국농촌공사진도지사
3rd row전라남도 진도군 의신면 금갑리 554-1
4th row전라남도 진도군 진도읍 남동리
5th row전라남도 진도군 진도읍 남동리 779-2
ValueCountFrequency (%)
전라남도 1652
20.0%
진도군 1649
20.0%
진도읍 1229
14.9%
남동리 584
 
7.1%
성내리 203
 
2.5%
쌍정리 175
 
2.1%
동외리 127
 
1.5%
임회면 112
 
1.4%
의신면 111
 
1.3%
군내면 96
 
1.2%
Other values (822) 2303
27.9%
2023-12-13T05:29:28.274978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9993
25.6%
4549
11.7%
2924
 
7.5%
2236
 
5.7%
1816
 
4.7%
1686
 
4.3%
1654
 
4.2%
1622
 
4.2%
- 1242
 
3.2%
1229
 
3.1%
Other values (151) 10083
25.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 21762
55.8%
Space Separator 9993
25.6%
Decimal Number 6029
 
15.4%
Dash Punctuation 1242
 
3.2%
Uppercase Letter 6
 
< 0.1%
Close Punctuation 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4549
20.9%
2924
13.4%
2236
10.3%
1816
 
8.3%
1686
 
7.7%
1654
 
7.6%
1622
 
7.5%
1229
 
5.6%
820
 
3.8%
419
 
1.9%
Other values (133) 2807
12.9%
Decimal Number
ValueCountFrequency (%)
1 1180
19.6%
5 756
12.5%
4 692
11.5%
2 675
11.2%
7 630
10.4%
3 594
9.9%
6 485
8.0%
0 358
 
5.9%
9 348
 
5.8%
8 311
 
5.2%
Uppercase Letter
ValueCountFrequency (%)
G 2
33.3%
L 2
33.3%
B 1
16.7%
P 1
16.7%
Space Separator
ValueCountFrequency (%)
9993
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1242
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 21762
55.8%
Common 17266
44.2%
Latin 6
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4549
20.9%
2924
13.4%
2236
10.3%
1816
 
8.3%
1686
 
7.7%
1654
 
7.6%
1622
 
7.5%
1229
 
5.6%
820
 
3.8%
419
 
1.9%
Other values (133) 2807
12.9%
Common
ValueCountFrequency (%)
9993
57.9%
- 1242
 
7.2%
1 1180
 
6.8%
5 756
 
4.4%
4 692
 
4.0%
2 675
 
3.9%
7 630
 
3.6%
3 594
 
3.4%
6 485
 
2.8%
0 358
 
2.1%
Other values (4) 661
 
3.8%
Latin
ValueCountFrequency (%)
G 2
33.3%
L 2
33.3%
B 1
16.7%
P 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 21762
55.8%
ASCII 17272
44.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9993
57.9%
- 1242
 
7.2%
1 1180
 
6.8%
5 756
 
4.4%
4 692
 
4.0%
2 675
 
3.9%
7 630
 
3.6%
3 594
 
3.4%
6 485
 
2.8%
0 358
 
2.1%
Other values (8) 667
 
3.9%
Hangul
ValueCountFrequency (%)
4549
20.9%
2924
13.4%
2236
10.3%
1816
 
8.3%
1686
 
7.7%
1654
 
7.6%
1622
 
7.5%
1229
 
5.6%
820
 
3.8%
419
 
1.9%
Other values (133) 2807
12.9%
Distinct703
Distinct (%)50.6%
Missing263
Missing (%)15.9%
Memory size13.0 KiB
2023-12-13T05:29:28.653573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length35
Mean length21.090713
Min length18

Characters and Unicode

Total characters29295
Distinct characters230
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique416 ?
Unique (%)29.9%

Sample

1st row전라남도 진도군 군내면 진도대로 7954 선진농협 군내지점
2nd row전라남도 진도군 진도읍 교동4길 25 교동회관
3rd row전라남도 진도군 의신면 진도대로 3074
4th row전라남도 진도군 진도읍 남문길 5 진도공용버스터미널
5th row전라남도 진도군 진도읍 남문길 4
ValueCountFrequency (%)
전라남도 1389
19.6%
진도군 1386
19.6%
진도읍 1018
14.4%
남문길 237
 
3.3%
옥주길 134
 
1.9%
남동1길 112
 
1.6%
임회면 106
 
1.5%
진도대로 106
 
1.5%
의신면 98
 
1.4%
쌍정2길 85
 
1.2%
Other values (538) 2415
34.1%
2023-12-13T05:29:29.210083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5732
19.6%
3938
13.4%
2533
 
8.6%
1804
 
6.2%
1528
 
5.2%
1402
 
4.8%
1398
 
4.8%
1205
 
4.1%
1018
 
3.5%
1 953
 
3.3%
Other values (220) 7784
26.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 19119
65.3%
Space Separator 5732
 
19.6%
Decimal Number 3815
 
13.0%
Dash Punctuation 434
 
1.5%
Open Punctuation 94
 
0.3%
Close Punctuation 94
 
0.3%
Uppercase Letter 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3938
20.6%
2533
13.2%
1804
9.4%
1528
 
8.0%
1402
 
7.3%
1398
 
7.3%
1205
 
6.3%
1018
 
5.3%
369
 
1.9%
362
 
1.9%
Other values (202) 3562
18.6%
Decimal Number
ValueCountFrequency (%)
1 953
25.0%
2 603
15.8%
3 505
13.2%
4 459
12.0%
5 318
 
8.3%
6 261
 
6.8%
7 204
 
5.3%
8 202
 
5.3%
0 163
 
4.3%
9 147
 
3.9%
Uppercase Letter
ValueCountFrequency (%)
G 3
42.9%
L 2
28.6%
P 1
 
14.3%
S 1
 
14.3%
Space Separator
ValueCountFrequency (%)
5732
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 434
100.0%
Open Punctuation
ValueCountFrequency (%)
( 94
100.0%
Close Punctuation
ValueCountFrequency (%)
) 94
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 19119
65.3%
Common 10169
34.7%
Latin 7
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3938
20.6%
2533
13.2%
1804
9.4%
1528
 
8.0%
1402
 
7.3%
1398
 
7.3%
1205
 
6.3%
1018
 
5.3%
369
 
1.9%
362
 
1.9%
Other values (202) 3562
18.6%
Common
ValueCountFrequency (%)
5732
56.4%
1 953
 
9.4%
2 603
 
5.9%
3 505
 
5.0%
4 459
 
4.5%
- 434
 
4.3%
5 318
 
3.1%
6 261
 
2.6%
7 204
 
2.0%
8 202
 
2.0%
Other values (4) 498
 
4.9%
Latin
ValueCountFrequency (%)
G 3
42.9%
L 2
28.6%
P 1
 
14.3%
S 1
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 19119
65.3%
ASCII 10176
34.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5732
56.3%
1 953
 
9.4%
2 603
 
5.9%
3 505
 
5.0%
4 459
 
4.5%
- 434
 
4.3%
5 318
 
3.1%
6 261
 
2.6%
7 204
 
2.0%
8 202
 
2.0%
Other values (8) 505
 
5.0%
Hangul
ValueCountFrequency (%)
3938
20.6%
2533
13.2%
1804
9.4%
1528
 
8.0%
1402
 
7.3%
1398
 
7.3%
1205
 
6.3%
1018
 
5.3%
369
 
1.9%
362
 
1.9%
Other values (202) 3562
18.6%
Distinct1390
Distinct (%)84.2%
Missing1
Missing (%)0.1%
Memory size13.0 KiB
2023-12-13T05:29:29.584859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length62
Median length53
Mean length7.4609328
Min length1

Characters and Unicode

Total characters12318
Distinct characters665
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1225 ?
Unique (%)74.2%

Sample

1st row선진농협 자재센터
2nd row교동회관 information
3rd row학수동다매재 법무사허의천사무소
4th row오렌케이크
5th row이바돔감자탕
ValueCountFrequency (%)
gs칼텍스 21
 
0.9%
미용실(싸인볼 20
 
0.8%
진도 13
 
0.5%
lg 13
 
0.5%
미니스톱 11
 
0.5%
진도점 10
 
0.4%
진도군 9
 
0.4%
lg정유 8
 
0.3%
에스케이(sk 7
 
0.3%
교보생명 7
 
0.3%
Other values (1820) 2275
95.0%
2023-12-13T05:29:30.106661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
784
 
6.4%
284
 
2.3%
282
 
2.3%
) 175
 
1.4%
( 173
 
1.4%
159
 
1.3%
155
 
1.3%
151
 
1.2%
150
 
1.2%
139
 
1.1%
Other values (655) 9866
80.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9954
80.8%
Space Separator 784
 
6.4%
Uppercase Letter 564
 
4.6%
Lowercase Letter 319
 
2.6%
Decimal Number 257
 
2.1%
Close Punctuation 175
 
1.4%
Open Punctuation 173
 
1.4%
Other Punctuation 51
 
0.4%
Dash Punctuation 33
 
0.3%
Other Symbol 4
 
< 0.1%
Other values (3) 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
284
 
2.9%
282
 
2.8%
159
 
1.6%
155
 
1.6%
151
 
1.5%
150
 
1.5%
139
 
1.4%
125
 
1.3%
124
 
1.2%
122
 
1.2%
Other values (578) 8263
83.0%
Uppercase Letter
ValueCountFrequency (%)
S 74
13.1%
G 67
11.9%
O 56
 
9.9%
L 55
 
9.8%
K 34
 
6.0%
E 29
 
5.1%
I 28
 
5.0%
T 27
 
4.8%
C 21
 
3.7%
H 20
 
3.5%
Other values (16) 153
27.1%
Lowercase Letter
ValueCountFrequency (%)
l 30
 
9.4%
a 29
 
9.1%
n 27
 
8.5%
c 26
 
8.2%
e 25
 
7.8%
i 24
 
7.5%
o 23
 
7.2%
k 18
 
5.6%
s 17
 
5.3%
p 11
 
3.4%
Other values (14) 89
27.9%
Decimal Number
ValueCountFrequency (%)
5 46
17.9%
2 42
16.3%
0 38
14.8%
4 31
12.1%
1 31
12.1%
3 24
9.3%
8 14
 
5.4%
7 13
 
5.1%
6 13
 
5.1%
9 5
 
1.9%
Other Punctuation
ValueCountFrequency (%)
. 28
54.9%
& 8
 
15.7%
/ 4
 
7.8%
% 2
 
3.9%
" 2
 
3.9%
! 2
 
3.9%
' 2
 
3.9%
: 2
 
3.9%
· 1
 
2.0%
Space Separator
ValueCountFrequency (%)
784
100.0%
Close Punctuation
ValueCountFrequency (%)
) 175
100.0%
Open Punctuation
ValueCountFrequency (%)
( 173
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 33
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9953
80.8%
Common 1481
 
12.0%
Latin 883
 
7.2%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
284
 
2.9%
282
 
2.8%
159
 
1.6%
155
 
1.6%
151
 
1.5%
150
 
1.5%
139
 
1.4%
125
 
1.3%
124
 
1.2%
122
 
1.2%
Other values (577) 8262
83.0%
Latin
ValueCountFrequency (%)
S 74
 
8.4%
G 67
 
7.6%
O 56
 
6.3%
L 55
 
6.2%
K 34
 
3.9%
l 30
 
3.4%
E 29
 
3.3%
a 29
 
3.3%
I 28
 
3.2%
n 27
 
3.1%
Other values (40) 454
51.4%
Common
ValueCountFrequency (%)
784
52.9%
) 175
 
11.8%
( 173
 
11.7%
5 46
 
3.1%
2 42
 
2.8%
0 38
 
2.6%
- 33
 
2.2%
4 31
 
2.1%
1 31
 
2.1%
. 28
 
1.9%
Other values (17) 100
 
6.8%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9951
80.8%
ASCII 2359
 
19.2%
Geometric Shapes 4
 
< 0.1%
Compat Jamo 2
 
< 0.1%
None 1
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
784
33.2%
) 175
 
7.4%
( 173
 
7.3%
S 74
 
3.1%
G 67
 
2.8%
O 56
 
2.4%
L 55
 
2.3%
5 46
 
1.9%
2 42
 
1.8%
0 38
 
1.6%
Other values (65) 849
36.0%
Hangul
ValueCountFrequency (%)
284
 
2.9%
282
 
2.8%
159
 
1.6%
155
 
1.6%
151
 
1.5%
150
 
1.5%
139
 
1.4%
125
 
1.3%
124
 
1.2%
122
 
1.2%
Other values (575) 8260
83.0%
Geometric Shapes
ValueCountFrequency (%)
4
100.0%
Compat Jamo
ValueCountFrequency (%)
1
50.0%
1
50.0%
None
ValueCountFrequency (%)
· 1
100.0%
CJK
ValueCountFrequency (%)
1
100.0%

Interactions

2023-12-13T05:29:22.355745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:29:22.126489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:29:22.451803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:29:22.258876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:29:30.252938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번구분광고물종류수량조명표시층
순번1.0000.7080.6560.2840.6430.306
구분0.7081.0000.8330.1210.2370.379
광고물종류0.6560.8331.0000.6710.5240.527
수량0.2840.1210.6711.0000.1260.000
조명0.6430.2370.5240.1261.0000.000
표시층0.3060.3790.5270.0000.0001.000
2023-12-13T05:29:30.385636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분조명수량광고물종류
구분1.0000.2890.0920.678
조명0.2891.0000.0520.305
수량0.0920.0521.0000.346
광고물종류0.6780.3050.3461.000
2023-12-13T05:29:30.502047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번표시층구분광고물종류수량조명
순번1.0000.0320.5520.3340.0900.320
표시층0.0321.0000.4040.3110.0000.000
구분0.5520.4041.0000.6780.0920.289
광고물종류0.3340.3110.6781.0000.3460.305
수량0.0900.0000.0920.3461.0000.052
조명0.3200.0000.2890.3050.0521.000

Missing values

2023-12-13T05:29:22.621110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:29:22.815436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T05:29:23.265188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

순번구분광고물종류규격광고물규격수량조명표시층표시장소_지번표시장소_도로명표시내용
01신고가로형간판_입체형8*2.8 : 입체형간판 면적 산출 : 8M*2.8M*0.7=15.688*2.81비조명2전라남도 진도군 군내면 둔전리 1782 선진농협 군내지점전라남도 진도군 군내면 진도대로 7954 선진농협 군내지점선진농협 자재센터
12신고지주이용 간판1*2 1*2 높이:21*2 1*2 높이:21비조명1전라남도 진도군 진도읍 교동리 560 한국농촌공사진도지사전라남도 진도군 진도읍 교동4길 25 교동회관교동회관 information
23신고지주이용 간판2*0.9 높이:32*0.9 높이:31비조명1전라남도 진도군 의신면 금갑리 554-1전라남도 진도군 의신면 진도대로 3074학수동다매재 법무사허의천사무소
34신고벽면이용간판(가로형)7*17*11비조명1전라남도 진도군 진도읍 남동리전라남도 진도군 진도읍 남문길 5 진도공용버스터미널오렌케이크
45허가창문이용 광고물1.6*0.4 : 1층 전광류 사용1.6*0.41전광(간접조명이용)<NA>전라남도 진도군 진도읍 남동리 779-2전라남도 진도군 진도읍 남문길 4이바돔감자탕
56허가창문이용 광고물1.32*0.481.32*0.481기타재료1전라남도 진도군 진도읍 남동리전라남도 진도군 진도읍 남문길 42-6암소마을 삼겹살 한우특수부위 김치찌게
67허가벽면이용간판(가로형)18*0.718*0.71기타재료1전라남도 진도군 군내면 둔전리 1844-9전라남도 진도군 군내면 진도대로 7955SK 에너지
78허가지주이용 간판2.6*2.5 높이:8.52.6*2.5 높이:8.51비조명1전라남도 진도군 진도읍 남동리 745-11전라남도 진도군 진도읍 남동1길 31sk에너지
89허가벽면이용간판(가로형)18*0.7 : 코끼리 주유소18*0.71기타재료1전라남도 진도군 진도읍 남동리 745-11전라남도 진도군 진도읍 남동1길 31SK 에너지
910허가지주이용 간판2.6*2.5 높이:102.6*2.5 높이:101네온사인<NA>전라남도 진도군 고군면 고성리 310-13전라남도 진도군 고군면 오일시1길 51SK주유소(진일)
순번구분광고물종류규격광고물규격수량조명표시층표시장소_지번표시장소_도로명표시내용
16421643허가돌출간판65*18065*1801비조명<NA>전라남도 진도군 진도읍 남동리 745-50전라남도 진도군 진도읍 남동3길 11-9남양다방 휴게음식점
16431644허가돌출간판65*20065*2001비조명<NA>전라남도 진도군 진도읍 쌍정리 34-1전라남도 진도군 진도읍 쌍정2길 37우리포장마차 음식점
16441645허가돌출간판60*1560*151비조명<NA>전라남도 진도군 진도읍 남동리 559전라남도 진도군 진도읍 남동4길 7-38국제당구장
16451646허가돌출간판70*10070*1001비조명<NA>전라남도 진도군 진도읍 교동리 594-7전라남도 진도군 진도읍 교동3길 19채형진출장밴드
16461647허가돌출간판70*20070*2001비조명<NA>전라남도 진도군 진도읍 교동리 594-7전라남도 진도군 진도읍 교동3길 19풍차다방
16471648허가지주이용 간판100*90100*901비조명<NA>전라남도 진도군 진도읍 교동리 136-1전라남도 진도군 진도읍 서문길 35-3북상교회
16481649허가지주이용 간판70*40070*4001비조명<NA>전라남도 진도군 지산면 보전리 1135전라남도 진도군 지산면 참전복로 394-11등대회집
16491650신고벽면이용간판(가로형)1.0 * 4.01.0 * 4.01비조명1전라남도 진도군 조도면 창유리 289전라남도 진도군 조도면 유토길 4-10조도교회예배당
16501651신고지주이용 간판1.0 * 4.01.0 * 4.01비조명<NA>전라남도 진도군 조도면 창유리 576전라남도 진도군 조도면 창유1길 34중앙당구장
16511652신고벽면이용간판(가로형)0.6 * 1.50.6 * 1.51비조명1전라남도 진도군 조도면 창유리 648-3전라남도 진도군 조도면 창유1길 11조도 개인용달