Overview

Dataset statistics

Number of variables5
Number of observations151
Missing cells151
Missing cells (%)20.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.3 KiB
Average record size in memory42.9 B

Variable types

Numeric1
Text3
Unsupported1

Dataset

Description하동군 담배소매인 현황(2021.05.27기준)입니다.
Author경상남도 하동군
URLhttps://www.data.go.kr/data/15081776/fileData.do

Alerts

비고 has 151 (100.0%) missing valuesMissing
연번 has unique valuesUnique
비고 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-11 23:47:17.054511
Analysis finished2023-12-11 23:47:17.611717
Duration0.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct151
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean76
Minimum1
Maximum151
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2023-12-12T08:47:17.709904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile8.5
Q138.5
median76
Q3113.5
95-th percentile143.5
Maximum151
Range150
Interquartile range (IQR)75

Descriptive statistics

Standard deviation43.734045
Coefficient of variation (CV)0.57544796
Kurtosis-1.2
Mean76
Median Absolute Deviation (MAD)38
Skewness0
Sum11476
Variance1912.6667
MonotonicityStrictly increasing
2023-12-12T08:47:17.839449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.7%
105 1
 
0.7%
98 1
 
0.7%
99 1
 
0.7%
100 1
 
0.7%
101 1
 
0.7%
102 1
 
0.7%
103 1
 
0.7%
104 1
 
0.7%
106 1
 
0.7%
Other values (141) 141
93.4%
ValueCountFrequency (%)
1 1
0.7%
2 1
0.7%
3 1
0.7%
4 1
0.7%
5 1
0.7%
6 1
0.7%
7 1
0.7%
8 1
0.7%
9 1
0.7%
10 1
0.7%
ValueCountFrequency (%)
151 1
0.7%
150 1
0.7%
149 1
0.7%
148 1
0.7%
147 1
0.7%
146 1
0.7%
145 1
0.7%
144 1
0.7%
143 1
0.7%
142 1
0.7%
Distinct69
Distinct (%)45.7%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-12T08:47:18.096490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length2
Mean length2.6556291
Min length2

Characters and Unicode

Total characters401
Distinct characters101
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique38 ?
Unique (%)25.2%

Sample

1st row*민
2nd row*윤
3rd row*민
4th row*정
5th row*숙
ValueCountFrequency (%)
10
 
6.5%
10
 
6.5%
7
 
4.5%
6
 
3.9%
5
 
3.2%
5
 
3.2%
5
 
3.2%
4
 
2.6%
4
 
2.6%
4
 
2.6%
Other values (62) 94
61.0%
2023-12-12T08:47:18.534583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 139
34.7%
10
 
2.5%
10
 
2.5%
9
 
2.2%
9
 
2.2%
8
 
2.0%
7
 
1.7%
7
 
1.7%
6
 
1.5%
6
 
1.5%
Other values (91) 190
47.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 257
64.1%
Other Punctuation 139
34.7%
Space Separator 3
 
0.7%
Close Punctuation 1
 
0.2%
Open Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10
 
3.9%
10
 
3.9%
9
 
3.5%
9
 
3.5%
8
 
3.1%
7
 
2.7%
7
 
2.7%
6
 
2.3%
6
 
2.3%
6
 
2.3%
Other values (87) 179
69.6%
Other Punctuation
ValueCountFrequency (%)
* 139
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 257
64.1%
Common 144
35.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10
 
3.9%
10
 
3.9%
9
 
3.5%
9
 
3.5%
8
 
3.1%
7
 
2.7%
7
 
2.7%
6
 
2.3%
6
 
2.3%
6
 
2.3%
Other values (87) 179
69.6%
Common
ValueCountFrequency (%)
* 139
96.5%
3
 
2.1%
) 1
 
0.7%
( 1
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 257
64.1%
ASCII 144
35.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 139
96.5%
3
 
2.1%
) 1
 
0.7%
( 1
 
0.7%
Hangul
ValueCountFrequency (%)
10
 
3.9%
10
 
3.9%
9
 
3.5%
9
 
3.5%
8
 
3.1%
7
 
2.7%
7
 
2.7%
6
 
2.3%
6
 
2.3%
6
 
2.3%
Other values (87) 179
69.6%
Distinct132
Distinct (%)87.4%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-12T08:47:18.817435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length14
Mean length5.602649
Min length1

Characters and Unicode

Total characters846
Distinct characters222
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique127 ?
Unique (%)84.1%

Sample

1st row세븐일레븐하동화개점
2nd row미인2호점
3rd row지에스25하동전도점
4th row풀마트
5th row이마트24알(R)하동읍내점
ValueCountFrequency (%)
없음 16
 
9.3%
씨유 5
 
2.9%
gs25 3
 
1.7%
빅마트 2
 
1.2%
대성마트 2
 
1.2%
향원슈퍼 2
 
1.2%
중앙상회 2
 
1.2%
하동점 2
 
1.2%
풀마트 2
 
1.2%
이덕명 1
 
0.6%
Other values (135) 135
78.5%
2023-12-12T08:47:19.261935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42
 
5.0%
28
 
3.3%
28
 
3.3%
25
 
3.0%
23
 
2.7%
22
 
2.6%
16
 
1.9%
16
 
1.9%
15
 
1.8%
15
 
1.8%
Other values (212) 616
72.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 762
90.1%
Decimal Number 35
 
4.1%
Space Separator 23
 
2.7%
Uppercase Letter 20
 
2.4%
Dash Punctuation 2
 
0.2%
Open Punctuation 2
 
0.2%
Close Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
42
 
5.5%
28
 
3.7%
28
 
3.7%
25
 
3.3%
22
 
2.9%
16
 
2.1%
16
 
2.1%
15
 
2.0%
15
 
2.0%
14
 
1.8%
Other values (194) 541
71.0%
Decimal Number
ValueCountFrequency (%)
2 14
40.0%
5 10
28.6%
1 3
 
8.6%
4 3
 
8.6%
3 2
 
5.7%
8 2
 
5.7%
6 1
 
2.9%
Uppercase Letter
ValueCountFrequency (%)
G 7
35.0%
S 6
30.0%
C 2
 
10.0%
R 2
 
10.0%
L 1
 
5.0%
U 1
 
5.0%
I 1
 
5.0%
Space Separator
ValueCountFrequency (%)
23
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 762
90.1%
Common 64
 
7.6%
Latin 20
 
2.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
42
 
5.5%
28
 
3.7%
28
 
3.7%
25
 
3.3%
22
 
2.9%
16
 
2.1%
16
 
2.1%
15
 
2.0%
15
 
2.0%
14
 
1.8%
Other values (194) 541
71.0%
Common
ValueCountFrequency (%)
23
35.9%
2 14
21.9%
5 10
15.6%
1 3
 
4.7%
4 3
 
4.7%
- 2
 
3.1%
3 2
 
3.1%
8 2
 
3.1%
( 2
 
3.1%
) 2
 
3.1%
Latin
ValueCountFrequency (%)
G 7
35.0%
S 6
30.0%
C 2
 
10.0%
R 2
 
10.0%
L 1
 
5.0%
U 1
 
5.0%
I 1
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 762
90.1%
ASCII 84
 
9.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
42
 
5.5%
28
 
3.7%
28
 
3.7%
25
 
3.3%
22
 
2.9%
16
 
2.1%
16
 
2.1%
15
 
2.0%
15
 
2.0%
14
 
1.8%
Other values (194) 541
71.0%
ASCII
ValueCountFrequency (%)
23
27.4%
2 14
16.7%
5 10
11.9%
G 7
 
8.3%
S 6
 
7.1%
1 3
 
3.6%
4 3
 
3.6%
- 2
 
2.4%
C 2
 
2.4%
3 2
 
2.4%
Other values (8) 12
14.3%
Distinct148
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-12T08:47:19.715814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length34
Mean length22.205298
Min length18

Characters and Unicode

Total characters3353
Distinct characters159
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique145 ?
Unique (%)96.0%

Sample

1st row경상남도 하동군 화개면 화개로 18-4. 태양다방
2nd row경상남도 하동군 악양면 평사리길 34
3rd row경상남도 하동군 금남면 섬진강대로 984
4th row경상남도 하동군 진교면 진교중앙길 3. 풀할인마트
5th row경상남도 하동군 하동읍 시장1길 25
ValueCountFrequency (%)
경상남도 151
19.3%
하동군 151
19.3%
하동읍 35
 
4.5%
진교면 20
 
2.6%
화개면 20
 
2.6%
금남면 18
 
2.3%
옥종면 16
 
2.0%
경서대로 15
 
1.9%
악양면 12
 
1.5%
섬진강대로 12
 
1.5%
Other values (221) 333
42.5%
2023-12-12T08:47:20.608659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
633
18.9%
200
 
6.0%
192
 
5.7%
173
 
5.2%
171
 
5.1%
155
 
4.6%
154
 
4.6%
152
 
4.5%
116
 
3.5%
93
 
2.8%
Other values (149) 1314
39.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2229
66.5%
Space Separator 633
 
18.9%
Decimal Number 425
 
12.7%
Dash Punctuation 27
 
0.8%
Other Punctuation 17
 
0.5%
Close Punctuation 11
 
0.3%
Open Punctuation 11
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
200
 
9.0%
192
 
8.6%
173
 
7.8%
171
 
7.7%
155
 
7.0%
154
 
6.9%
152
 
6.8%
116
 
5.2%
93
 
4.2%
58
 
2.6%
Other values (134) 765
34.3%
Decimal Number
ValueCountFrequency (%)
1 84
19.8%
2 58
13.6%
4 44
10.4%
3 41
9.6%
5 39
9.2%
6 36
8.5%
7 34
8.0%
8 31
 
7.3%
9 30
 
7.1%
0 28
 
6.6%
Space Separator
ValueCountFrequency (%)
633
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 27
100.0%
Other Punctuation
ValueCountFrequency (%)
. 17
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2229
66.5%
Common 1124
33.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
200
 
9.0%
192
 
8.6%
173
 
7.8%
171
 
7.7%
155
 
7.0%
154
 
6.9%
152
 
6.8%
116
 
5.2%
93
 
4.2%
58
 
2.6%
Other values (134) 765
34.3%
Common
ValueCountFrequency (%)
633
56.3%
1 84
 
7.5%
2 58
 
5.2%
4 44
 
3.9%
3 41
 
3.6%
5 39
 
3.5%
6 36
 
3.2%
7 34
 
3.0%
8 31
 
2.8%
9 30
 
2.7%
Other values (5) 94
 
8.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2229
66.5%
ASCII 1124
33.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
633
56.3%
1 84
 
7.5%
2 58
 
5.2%
4 44
 
3.9%
3 41
 
3.6%
5 39
 
3.5%
6 36
 
3.2%
7 34
 
3.0%
8 31
 
2.8%
9 30
 
2.7%
Other values (5) 94
 
8.4%
Hangul
ValueCountFrequency (%)
200
 
9.0%
192
 
8.6%
173
 
7.8%
171
 
7.7%
155
 
7.0%
154
 
6.9%
152
 
6.8%
116
 
5.2%
93
 
4.2%
58
 
2.6%
Other values (134) 765
34.3%

비고
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing151
Missing (%)100.0%
Memory size1.5 KiB

Interactions

2023-12-12T08:47:17.347812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T08:47:20.807092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번대표자명
연번1.0000.532
대표자명0.5321.000

Missing values

2023-12-12T08:47:17.481966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:47:17.574281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번대표자명업소명업소도로명주소비고
01*민세븐일레븐하동화개점경상남도 하동군 화개면 화개로 18-4. 태양다방<NA>
12*윤미인2호점경상남도 하동군 악양면 평사리길 34<NA>
23*민지에스25하동전도점경상남도 하동군 금남면 섬진강대로 984<NA>
34*정풀마트경상남도 하동군 진교면 진교중앙길 3. 풀할인마트<NA>
45*숙이마트24알(R)하동읍내점경상남도 하동군 하동읍 시장1길 25<NA>
56*진지에스25하동제일점경상남도 하동군 금남면 경충로 451. 제일남사휴게실<NA>
67*철너뱅이한밭식당경상남도 하동군 하동읍 군청로 32<NA>
78*옥대치편의점경상남도 하동군 금남면 한재길 2-1<NA>
89*남씨유하동녹차마을점경상남도 하동군 하동읍 경서대로 243-2<NA>
910*경도둑골관광농원경상남도 하동군 옥종면 호계천로 210<NA>
연번대표자명업소명업소도로명주소비고
141142*희없음경상남도 하동군 화개면 화개로 305<NA>
142143*자없음경상남도 하동군 화개면 화개로 979<NA>
143144*심김순심경상남도 하동군 금남면 노량해안길 16-4<NA>
144145*화정진화경상남도 하동군 악양면 악양서로 395<NA>
145146*선경상남도 하동군 청암면 새터길 6-4<NA>
146147*종없음경상남도 하동군 진교면 술상길 41<NA>
147148*득없음경상남도 하동군 진교면 민다리길 30-2<NA>
148149*인없음경상남도 하동군 하동읍 군청로 185-5<NA>
149150*남없음경상남도 하동군 하동읍 남당길 42-9<NA>
150151*자없음경상남도 하동군 횡천면 경서대로 1147<NA>