Overview

Dataset statistics

Number of variables4
Number of observations874
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory28.3 KiB
Average record size in memory33.2 B

Variable types

Numeric1
Text2
Categorical1

Dataset

Description경상남도 하동군에 있는 통신방문판매업 현황 (연번, 법인또는상호, 소재지면, 소재지주소 등)의 정보를 제공하고 있습니다
URLhttps://www.data.go.kr/data/15085540/fileData.do

Alerts

분류 is highly imbalanced (88.8%)Imbalance
번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 04:17:50.778417
Analysis finished2023-12-12 04:17:51.873338
Duration1.09 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct874
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean437.5
Minimum1
Maximum874
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.8 KiB
2023-12-12T13:17:51.987289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile44.65
Q1219.25
median437.5
Q3655.75
95-th percentile830.35
Maximum874
Range873
Interquartile range (IQR)436.5

Descriptive statistics

Standard deviation252.44636
Coefficient of variation (CV)0.57702026
Kurtosis-1.2
Mean437.5
Median Absolute Deviation (MAD)218.5
Skewness0
Sum382375
Variance63729.167
MonotonicityStrictly increasing
2023-12-12T13:17:52.198469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
588 1
 
0.1%
577 1
 
0.1%
578 1
 
0.1%
579 1
 
0.1%
580 1
 
0.1%
581 1
 
0.1%
582 1
 
0.1%
583 1
 
0.1%
584 1
 
0.1%
Other values (864) 864
98.9%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
874 1
0.1%
873 1
0.1%
872 1
0.1%
871 1
0.1%
870 1
0.1%
869 1
0.1%
868 1
0.1%
867 1
0.1%
866 1
0.1%
865 1
0.1%
Distinct847
Distinct (%)96.9%
Missing0
Missing (%)0.0%
Memory size7.0 KiB
2023-12-12T13:17:52.635104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length21
Mean length6.2643021
Min length2

Characters and Unicode

Total characters5475
Distinct characters548
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique821 ?
Unique (%)93.9%

Sample

1st row청화산조
2nd row삼순농장
3rd row하동군청
4th row오즈
5th row야란
ValueCountFrequency (%)
주식회사 27
 
2.5%
농업회사법인 24
 
2.3%
지리산 12
 
1.1%
농원 6
 
0.6%
청학동 6
 
0.6%
섬진강 6
 
0.6%
하동 6
 
0.6%
영농조합법인 5
 
0.5%
화개장터 3
 
0.3%
농장 3
 
0.3%
Other values (918) 963
90.8%
2023-12-12T13:17:53.276076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
270
 
4.9%
189
 
3.5%
158
 
2.9%
147
 
2.7%
145
 
2.6%
125
 
2.3%
115
 
2.1%
103
 
1.9%
98
 
1.8%
91
 
1.7%
Other values (538) 4034
73.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5005
91.4%
Space Separator 189
 
3.5%
Lowercase Letter 94
 
1.7%
Uppercase Letter 80
 
1.5%
Close Punctuation 47
 
0.9%
Open Punctuation 46
 
0.8%
Decimal Number 6
 
0.1%
Other Punctuation 4
 
0.1%
Other Symbol 2
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
270
 
5.4%
158
 
3.2%
147
 
2.9%
145
 
2.9%
125
 
2.5%
115
 
2.3%
103
 
2.1%
98
 
2.0%
91
 
1.8%
83
 
1.7%
Other values (480) 3670
73.3%
Uppercase Letter
ValueCountFrequency (%)
O 9
11.2%
A 8
 
10.0%
N 7
 
8.8%
E 7
 
8.8%
C 5
 
6.2%
L 5
 
6.2%
D 5
 
6.2%
M 4
 
5.0%
I 4
 
5.0%
R 4
 
5.0%
Other values (13) 22
27.5%
Lowercase Letter
ValueCountFrequency (%)
e 11
11.7%
o 11
11.7%
t 9
9.6%
s 7
 
7.4%
l 7
 
7.4%
i 6
 
6.4%
a 6
 
6.4%
n 6
 
6.4%
r 5
 
5.3%
m 5
 
5.3%
Other values (12) 21
22.3%
Decimal Number
ValueCountFrequency (%)
2 3
50.0%
3 1
 
16.7%
0 1
 
16.7%
4 1
 
16.7%
Other Punctuation
ValueCountFrequency (%)
' 2
50.0%
. 1
25.0%
& 1
25.0%
Space Separator
ValueCountFrequency (%)
189
100.0%
Close Punctuation
ValueCountFrequency (%)
) 47
100.0%
Open Punctuation
ValueCountFrequency (%)
( 46
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5007
91.5%
Common 294
 
5.4%
Latin 174
 
3.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
270
 
5.4%
158
 
3.2%
147
 
2.9%
145
 
2.9%
125
 
2.5%
115
 
2.3%
103
 
2.1%
98
 
2.0%
91
 
1.8%
83
 
1.7%
Other values (481) 3672
73.3%
Latin
ValueCountFrequency (%)
e 11
 
6.3%
o 11
 
6.3%
O 9
 
5.2%
t 9
 
5.2%
A 8
 
4.6%
N 7
 
4.0%
s 7
 
4.0%
E 7
 
4.0%
l 7
 
4.0%
i 6
 
3.4%
Other values (35) 92
52.9%
Common
ValueCountFrequency (%)
189
64.3%
) 47
 
16.0%
( 46
 
15.6%
2 3
 
1.0%
' 2
 
0.7%
- 1
 
0.3%
3 1
 
0.3%
. 1
 
0.3%
& 1
 
0.3%
0 1
 
0.3%
Other values (2) 2
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5005
91.4%
ASCII 468
 
8.5%
None 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
270
 
5.4%
158
 
3.2%
147
 
2.9%
145
 
2.9%
125
 
2.5%
115
 
2.3%
103
 
2.1%
98
 
2.0%
91
 
1.8%
83
 
1.7%
Other values (480) 3670
73.3%
ASCII
ValueCountFrequency (%)
189
40.4%
) 47
 
10.0%
( 46
 
9.8%
e 11
 
2.4%
o 11
 
2.4%
O 9
 
1.9%
t 9
 
1.9%
A 8
 
1.7%
N 7
 
1.5%
s 7
 
1.5%
Other values (47) 124
26.5%
None
ValueCountFrequency (%)
2
100.0%
Distinct800
Distinct (%)91.5%
Missing0
Missing (%)0.0%
Memory size7.0 KiB
2023-12-12T13:17:53.781643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length40
Mean length22.122426
Min length10

Characters and Unicode

Total characters19335
Distinct characters296
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique734 ?
Unique (%)84.0%

Sample

1st row경상남도 하동군 악양면 노전길 41-16
2nd row경상남도 하동군 진교면 평당길 13
3rd row경상남도 하동군 하동읍 군청로 23, 하동군청
4th row경상남도 하동군 화개면 쌍계로 404
5th row경상남도 하동군 적량면 중도길 90-17
ValueCountFrequency (%)
경상남도 832
18.6%
하동군 832
18.6%
화개면 204
 
4.6%
하동읍 146
 
3.3%
악양면 143
 
3.2%
옥종면 69
 
1.5%
화개로 62
 
1.4%
적량면 55
 
1.2%
청암면 47
 
1.0%
금남면 44
 
1.0%
Other values (1072) 2046
45.7%
2023-12-12T13:17:54.389857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3606
18.7%
1083
 
5.6%
1008
 
5.2%
886
 
4.6%
871
 
4.5%
863
 
4.5%
857
 
4.4%
844
 
4.4%
727
 
3.8%
1 666
 
3.4%
Other values (286) 7924
41.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12187
63.0%
Space Separator 3606
 
18.7%
Decimal Number 2942
 
15.2%
Dash Punctuation 402
 
2.1%
Other Punctuation 122
 
0.6%
Close Punctuation 35
 
0.2%
Open Punctuation 35
 
0.2%
Uppercase Letter 5
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1083
 
8.9%
1008
 
8.3%
886
 
7.3%
871
 
7.1%
863
 
7.1%
857
 
7.0%
844
 
6.9%
727
 
6.0%
519
 
4.3%
314
 
2.6%
Other values (264) 4215
34.6%
Decimal Number
ValueCountFrequency (%)
1 666
22.6%
2 414
14.1%
3 310
10.5%
4 300
10.2%
5 251
 
8.5%
6 229
 
7.8%
0 204
 
6.9%
7 193
 
6.6%
9 189
 
6.4%
8 186
 
6.3%
Uppercase Letter
ValueCountFrequency (%)
R 1
20.0%
C 1
20.0%
D 1
20.0%
T 1
20.0%
K 1
20.0%
Other Punctuation
ValueCountFrequency (%)
121
99.2%
@ 1
 
0.8%
Space Separator
ValueCountFrequency (%)
3606
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 402
100.0%
Close Punctuation
ValueCountFrequency (%)
) 35
100.0%
Open Punctuation
ValueCountFrequency (%)
( 35
100.0%
Lowercase Letter
ValueCountFrequency (%)
g 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12187
63.0%
Common 7142
36.9%
Latin 6
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1083
 
8.9%
1008
 
8.3%
886
 
7.3%
871
 
7.1%
863
 
7.1%
857
 
7.0%
844
 
6.9%
727
 
6.0%
519
 
4.3%
314
 
2.6%
Other values (264) 4215
34.6%
Common
ValueCountFrequency (%)
3606
50.5%
1 666
 
9.3%
2 414
 
5.8%
- 402
 
5.6%
3 310
 
4.3%
4 300
 
4.2%
5 251
 
3.5%
6 229
 
3.2%
0 204
 
2.9%
7 193
 
2.7%
Other values (6) 567
 
7.9%
Latin
ValueCountFrequency (%)
R 1
16.7%
g 1
16.7%
C 1
16.7%
D 1
16.7%
T 1
16.7%
K 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12187
63.0%
ASCII 7027
36.3%
None 121
 
0.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3606
51.3%
1 666
 
9.5%
2 414
 
5.9%
- 402
 
5.7%
3 310
 
4.4%
4 300
 
4.3%
5 251
 
3.6%
6 229
 
3.3%
0 204
 
2.9%
7 193
 
2.7%
Other values (11) 452
 
6.4%
Hangul
ValueCountFrequency (%)
1083
 
8.9%
1008
 
8.3%
886
 
7.3%
871
 
7.1%
863
 
7.1%
857
 
7.0%
844
 
6.9%
727
 
6.0%
519
 
4.3%
314
 
2.6%
Other values (264) 4215
34.6%
None
ValueCountFrequency (%)
121
100.0%

분류
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size7.0 KiB
통신판매업
861 
방문판매업
 
13

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row통신판매업
2nd row통신판매업
3rd row통신판매업
4th row통신판매업
5th row통신판매업

Common Values

ValueCountFrequency (%)
통신판매업 861
98.5%
방문판매업 13
 
1.5%

Length

2023-12-12T13:17:54.560705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:17:54.689645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
통신판매업 861
98.5%
방문판매업 13
 
1.5%

Interactions

2023-12-12T13:17:51.536220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T13:17:54.767496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호분류
번호1.0000.461
분류0.4611.000
2023-12-12T13:17:54.910978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호분류
번호1.0000.353
분류0.3531.000

Missing values

2023-12-12T13:17:51.692167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:17:51.818018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호법인또는상호소재지주소분류
01청화산조경상남도 하동군 악양면 노전길 41-16통신판매업
12삼순농장경상남도 하동군 진교면 평당길 13통신판매업
23하동군청경상남도 하동군 하동읍 군청로 23, 하동군청통신판매업
34오즈경상남도 하동군 화개면 쌍계로 404통신판매업
45야란경상남도 하동군 적량면 중도길 90-17통신판매업
56끄판몰경상남도 하동군 옥종면 옥종중앙길 30-9통신판매업
67위컴퍼니경상남도 하동군 옥종면 옥종중앙길 30-9통신판매업
78농업회사법인 도재명차 주식회사경상남도 하동군 화개면 목압길 39-2통신판매업
89다웰빙카페경상남도 하동군 하동읍 오룡정길 24, 1층통신판매업
910녹차하동주식회사농업회사법인경상남도 하동군 악양면 입석길 90-73통신판매업
번호법인또는상호소재지주소분류
864865기아하남대리점경상남도 하동군 하동읍 군청로 48방문판매업
865866지리산청학농업협동조합경상남도 하동군 횡천면 문화1길 5, 횡천농협방문판매업
866867하동녹즙경상남도 하동군 하동읍 경서대로 141, 1층방문판매업
867868옥종농업협동조합경상남도 하동군 옥종면 주포중앙길 36방문판매업
868869금오농업협동조합경상남도 하동군 양보면 진양로 739방문판매업
869870화개악양농업협동조합경상남도 하동군 화개면 화개로 26-1방문판매업
870871하동축산업협동조합경상남도 하동군 하동읍 시장2길 17, 하동축협방문판매업
871872롯데우유(푸르밀)경상남도 하동군 하동읍 경서대로 290방문판매업
872873남양우유하동가정대리점경상남도 하동군 하동읍 경서대로 227-1방문판매업
873874월드국제결혼정보사경상남도 하동군 진교면 민다리안길 97 (진교@ 제상가1층 제102-1호)방문판매업