Overview

Dataset statistics

Number of variables14
Number of observations3103
Missing cells2668
Missing cells (%)6.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory348.6 KiB
Average record size in memory115.0 B

Variable types

Text6
Categorical2
Numeric3
Boolean2
DateTime1

Dataset

Description공공데이터 제공 표준데이터 속성정보(허용값, 표현형식/단위 등)는 [공공데이터 제공 표준] 전문을 참고하시기 바랍니다.(공공데이터포털>정보공유>자료실)
Author지방자치단체
URLhttps://www.data.go.kr/data/15114137/standard.do

Alerts

위도 is highly overall correlated with 경도High correlation
경도 is highly overall correlated with 위도High correlation
운반용차량보유여부 is highly overall correlated with 저장시설보유여부High correlation
저장시설보유여부 is highly overall correlated with 운반용차량보유여부High correlation
업종명 is highly imbalanced (62.3%)Imbalance
영업상태명 is highly imbalanced (85.5%)Imbalance
소재지지번주소 has 1313 (42.3%) missing valuesMissing
전화번호 has 1335 (43.0%) missing valuesMissing

Reproduction

Analysis started2024-04-13 13:07:48.555133
Analysis finished2024-04-13 13:07:56.380137
Duration7.83 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct2928
Distinct (%)94.4%
Missing0
Missing (%)0.0%
Memory size24.4 KiB
2024-04-13T22:07:57.169054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length6.7837577
Min length1

Characters and Unicode

Total characters21050
Distinct characters526
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2782 ?
Unique (%)89.7%

Sample

1st row율곡가스
2nd row합천대병가스
3rd row주식회사 에스에이팜
4th row주식회사 파로스
5th row(주) 렘넌트
ValueCountFrequency (%)
주식회사 330
 
9.3%
유한회사 9
 
0.3%
7
 
0.2%
일반종합도매 6
 
0.2%
제이에스팜 6
 
0.2%
드림팜 6
 
0.2%
약업사 6
 
0.2%
감초당약업사 5
 
0.1%
제이팜 4
 
0.1%
한약도매 4
 
0.1%
Other values (2944) 3149
89.2%
2024-04-13T22:07:58.593780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1432
 
6.8%
1046
 
5.0%
) 1012
 
4.8%
( 1009
 
4.8%
781
 
3.7%
729
 
3.5%
639
 
3.0%
630
 
3.0%
579
 
2.8%
500
 
2.4%
Other values (516) 12693
60.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 18285
86.9%
Close Punctuation 1012
 
4.8%
Open Punctuation 1009
 
4.8%
Space Separator 429
 
2.0%
Other Symbol 172
 
0.8%
Uppercase Letter 110
 
0.5%
Lowercase Letter 11
 
0.1%
Other Punctuation 10
 
< 0.1%
Decimal Number 10
 
< 0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1432
 
7.8%
1046
 
5.7%
781
 
4.3%
729
 
4.0%
639
 
3.5%
630
 
3.4%
579
 
3.2%
500
 
2.7%
470
 
2.6%
441
 
2.4%
Other values (472) 11038
60.4%
Uppercase Letter
ValueCountFrequency (%)
S 18
16.4%
M 13
11.8%
K 12
10.9%
P 9
 
8.2%
B 6
 
5.5%
G 6
 
5.5%
C 5
 
4.5%
R 5
 
4.5%
H 5
 
4.5%
A 4
 
3.6%
Other values (13) 27
24.5%
Lowercase Letter
ValueCountFrequency (%)
k 2
18.2%
a 2
18.2%
u 1
9.1%
m 1
9.1%
r 1
9.1%
y 1
9.1%
p 1
9.1%
h 1
9.1%
b 1
9.1%
Decimal Number
ValueCountFrequency (%)
8 4
40.0%
1 3
30.0%
2 2
20.0%
4 1
 
10.0%
Other Punctuation
ValueCountFrequency (%)
. 6
60.0%
& 2
 
20.0%
, 2
 
20.0%
Close Punctuation
ValueCountFrequency (%)
) 1012
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1009
100.0%
Space Separator
ValueCountFrequency (%)
429
100.0%
Other Symbol
ValueCountFrequency (%)
172
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 18457
87.7%
Common 2472
 
11.7%
Latin 121
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1432
 
7.8%
1046
 
5.7%
781
 
4.2%
729
 
3.9%
639
 
3.5%
630
 
3.4%
579
 
3.1%
500
 
2.7%
470
 
2.5%
441
 
2.4%
Other values (473) 11210
60.7%
Latin
ValueCountFrequency (%)
S 18
14.9%
M 13
 
10.7%
K 12
 
9.9%
P 9
 
7.4%
B 6
 
5.0%
G 6
 
5.0%
C 5
 
4.1%
R 5
 
4.1%
H 5
 
4.1%
A 4
 
3.3%
Other values (22) 38
31.4%
Common
ValueCountFrequency (%)
) 1012
40.9%
( 1009
40.8%
429
17.4%
. 6
 
0.2%
8 4
 
0.2%
1 3
 
0.1%
2 2
 
0.1%
- 2
 
0.1%
& 2
 
0.1%
, 2
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 18285
86.9%
ASCII 2593
 
12.3%
None 172
 
0.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1432
 
7.8%
1046
 
5.7%
781
 
4.3%
729
 
4.0%
639
 
3.5%
630
 
3.4%
579
 
3.2%
500
 
2.7%
470
 
2.6%
441
 
2.4%
Other values (472) 11038
60.4%
ASCII
ValueCountFrequency (%)
) 1012
39.0%
( 1009
38.9%
429
16.5%
S 18
 
0.7%
M 13
 
0.5%
K 12
 
0.5%
P 9
 
0.3%
B 6
 
0.2%
. 6
 
0.2%
G 6
 
0.2%
Other values (33) 73
 
2.8%
None
ValueCountFrequency (%)
172
100.0%

업종명
Categorical

IMBALANCE 

Distinct27
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size24.4 KiB
일반종합도매
2083 
한약도매
432 
기타
 
201
기타(의료용 고압가스)
 
137
원료의약품도매
 
64
Other values (22)
 
186

Length

Max length13
Median length6
Mean length5.918466
Min length2

Unique

Unique11 ?
Unique (%)0.4%

Sample

1st row기타
2nd row기타
3rd row일반종합도매
4th row일반종합도매
5th row일반종합도매

Common Values

ValueCountFrequency (%)
일반종합도매 2083
67.1%
한약도매 432
 
13.9%
기타 201
 
6.5%
기타(의료용 고압가스) 137
 
4.4%
원료의약품도매 64
 
2.1%
(기타)의료용 고압가스 54
 
1.7%
의료용 고압가스 36
 
1.2%
기타(의료용고압가스) 18
 
0.6%
시약도매 17
 
0.5%
수입의약품도매 15
 
0.5%
Other values (17) 46
 
1.5%

Length

2024-04-13T22:07:58.826946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
일반종합도매 2083
62.5%
한약도매 432
 
13.0%
고압가스 227
 
6.8%
기타 201
 
6.0%
기타(의료용 137
 
4.1%
원료의약품도매 64
 
1.9%
기타)의료용 54
 
1.6%
의료용 40
 
1.2%
기타(의료용고압가스 18
 
0.5%
시약도매 17
 
0.5%
Other values (19) 62
 
1.9%
Distinct2907
Distinct (%)94.3%
Missing20
Missing (%)0.6%
Memory size24.4 KiB
2024-04-13T22:07:59.931093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length48
Mean length29.750892
Min length15

Characters and Unicode

Total characters91722
Distinct characters533
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2811 ?
Unique (%)91.2%

Sample

1st row경상남도 합천군 율곡면 임북길 221
2nd row경상남도 합천군 대양면 대야로 572-18
3rd row강원도 강릉시 경강로 2023, 3층 2호 (명주동)
4th row강원도 강릉시 사임당로 131-1, 2동 108호 (홍제동)
5th row강원도 강릉시 옥계면 옥계로 1028-12, 1층 104호 (혜진아파트)
ValueCountFrequency (%)
서울특별시 566
 
3.0%
부산광역시 458
 
2.5%
경기도 416
 
2.2%
대구광역시 340
 
1.8%
2층 313
 
1.7%
경상남도 263
 
1.4%
3층 256
 
1.4%
대전광역시 226
 
1.2%
중구 217
 
1.2%
강남구 165
 
0.9%
Other values (4871) 15471
82.8%
2024-04-13T22:08:01.731643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
15608
 
17.0%
1 3286
 
3.6%
2995
 
3.3%
2947
 
3.2%
2889
 
3.1%
2653
 
2.9%
( 2328
 
2.5%
) 2328
 
2.5%
2 2317
 
2.5%
, 2132
 
2.3%
Other values (523) 52239
57.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 53443
58.3%
Space Separator 15608
 
17.0%
Decimal Number 14976
 
16.3%
Open Punctuation 2328
 
2.5%
Close Punctuation 2328
 
2.5%
Other Punctuation 2136
 
2.3%
Dash Punctuation 623
 
0.7%
Uppercase Letter 238
 
0.3%
Lowercase Letter 29
 
< 0.1%
Letter Number 6
 
< 0.1%
Other values (2) 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2995
 
5.6%
2947
 
5.5%
2889
 
5.4%
2653
 
5.0%
1439
 
2.7%
1434
 
2.7%
1410
 
2.6%
1408
 
2.6%
1400
 
2.6%
1302
 
2.4%
Other values (467) 33566
62.8%
Uppercase Letter
ValueCountFrequency (%)
B 55
23.1%
A 37
15.5%
K 15
 
6.3%
C 13
 
5.5%
S 13
 
5.5%
E 11
 
4.6%
D 8
 
3.4%
N 8
 
3.4%
W 8
 
3.4%
M 8
 
3.4%
Other values (13) 62
26.1%
Lowercase Letter
ValueCountFrequency (%)
e 6
20.7%
n 4
13.8%
r 4
13.8%
c 4
13.8%
o 3
10.3%
t 3
10.3%
d 1
 
3.4%
a 1
 
3.4%
w 1
 
3.4%
i 1
 
3.4%
Decimal Number
ValueCountFrequency (%)
1 3286
21.9%
2 2317
15.5%
3 1829
12.2%
4 1452
9.7%
0 1446
9.7%
5 1218
 
8.1%
6 1019
 
6.8%
8 864
 
5.8%
7 845
 
5.6%
9 700
 
4.7%
Other Punctuation
ValueCountFrequency (%)
, 2132
99.8%
. 2
 
0.1%
· 2
 
0.1%
Letter Number
ValueCountFrequency (%)
4
66.7%
1
 
16.7%
1
 
16.7%
Space Separator
ValueCountFrequency (%)
15608
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2328
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2328
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 623
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 53445
58.3%
Common 38004
41.4%
Latin 273
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2995
 
5.6%
2947
 
5.5%
2889
 
5.4%
2653
 
5.0%
1439
 
2.7%
1434
 
2.7%
1410
 
2.6%
1408
 
2.6%
1400
 
2.6%
1302
 
2.4%
Other values (468) 33568
62.8%
Latin
ValueCountFrequency (%)
B 55
20.1%
A 37
 
13.6%
K 15
 
5.5%
C 13
 
4.8%
S 13
 
4.8%
E 11
 
4.0%
D 8
 
2.9%
N 8
 
2.9%
W 8
 
2.9%
M 8
 
2.9%
Other values (27) 97
35.5%
Common
ValueCountFrequency (%)
15608
41.1%
1 3286
 
8.6%
( 2328
 
6.1%
) 2328
 
6.1%
2 2317
 
6.1%
, 2132
 
5.6%
3 1829
 
4.8%
4 1452
 
3.8%
0 1446
 
3.8%
5 1218
 
3.2%
Other values (8) 4060
 
10.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 53443
58.3%
ASCII 38269
41.7%
Number Forms 6
 
< 0.1%
None 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
15608
40.8%
1 3286
 
8.6%
( 2328
 
6.1%
) 2328
 
6.1%
2 2317
 
6.1%
, 2132
 
5.6%
3 1829
 
4.8%
4 1452
 
3.8%
0 1446
 
3.8%
5 1218
 
3.2%
Other values (41) 4325
 
11.3%
Hangul
ValueCountFrequency (%)
2995
 
5.6%
2947
 
5.5%
2889
 
5.4%
2653
 
5.0%
1439
 
2.7%
1434
 
2.7%
1410
 
2.6%
1408
 
2.6%
1400
 
2.6%
1302
 
2.4%
Other values (467) 33566
62.8%
Number Forms
ValueCountFrequency (%)
4
66.7%
1
 
16.7%
1
 
16.7%
None
ValueCountFrequency (%)
2
50.0%
· 2
50.0%

소재지지번주소
Text

MISSING 

Distinct1583
Distinct (%)88.4%
Missing1313
Missing (%)42.3%
Memory size24.4 KiB
2024-04-13T22:08:02.954865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length46
Mean length23.193855
Min length14

Characters and Unicode

Total characters41517
Distinct characters401
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1482 ?
Unique (%)82.8%

Sample

1st row서울특별시 용산구 한남동 99-2
2nd row서울특별시 용산구 한남동 631-7
3rd row서울특별시 용산구 원효로1가 44-7
4th row서울특별시 용산구 한남동 657-88
5th row서울특별시 용산구 한강로2가 191
ValueCountFrequency (%)
경기도 301
 
3.3%
서울특별시 278
 
3.1%
경상남도 233
 
2.6%
부산광역시 217
 
2.4%
강남구 166
 
1.8%
대구광역시 151
 
1.7%
창원시 132
 
1.5%
대전광역시 129
 
1.4%
수원시 109
 
1.2%
용인시 99
 
1.1%
Other values (2702) 7240
80.0%
2024-04-13T22:08:04.591657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7281
 
17.5%
1729
 
4.2%
1 1687
 
4.1%
1681
 
4.0%
1457
 
3.5%
- 1126
 
2.7%
2 1025
 
2.5%
1007
 
2.4%
3 885
 
2.1%
4 786
 
1.9%
Other values (391) 22853
55.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 24848
59.9%
Decimal Number 8084
 
19.5%
Space Separator 7281
 
17.5%
Dash Punctuation 1126
 
2.7%
Uppercase Letter 71
 
0.2%
Other Punctuation 61
 
0.1%
Lowercase Letter 17
 
< 0.1%
Open Punctuation 12
 
< 0.1%
Close Punctuation 12
 
< 0.1%
Letter Number 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1729
 
7.0%
1681
 
6.8%
1457
 
5.9%
1007
 
4.1%
677
 
2.7%
676
 
2.7%
674
 
2.7%
607
 
2.4%
602
 
2.4%
586
 
2.4%
Other values (343) 15152
61.0%
Uppercase Letter
ValueCountFrequency (%)
B 10
14.1%
K 7
9.9%
A 7
9.9%
I 6
8.5%
S 5
 
7.0%
T 5
 
7.0%
W 5
 
7.0%
N 4
 
5.6%
C 4
 
5.6%
M 4
 
5.6%
Other values (9) 14
19.7%
Lowercase Letter
ValueCountFrequency (%)
o 3
17.6%
r 2
11.8%
c 2
11.8%
e 2
11.8%
n 2
11.8%
w 1
 
5.9%
i 1
 
5.9%
d 1
 
5.9%
a 1
 
5.9%
t 1
 
5.9%
Decimal Number
ValueCountFrequency (%)
1 1687
20.9%
2 1025
12.7%
3 885
10.9%
4 786
9.7%
5 740
9.2%
7 669
 
8.3%
6 632
 
7.8%
0 599
 
7.4%
9 536
 
6.6%
8 525
 
6.5%
Letter Number
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
Space Separator
ValueCountFrequency (%)
7281
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1126
100.0%
Other Punctuation
ValueCountFrequency (%)
, 61
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 24849
59.9%
Common 16576
39.9%
Latin 92
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1729
 
7.0%
1681
 
6.8%
1457
 
5.9%
1007
 
4.1%
677
 
2.7%
676
 
2.7%
674
 
2.7%
607
 
2.4%
602
 
2.4%
586
 
2.4%
Other values (344) 15153
61.0%
Latin
ValueCountFrequency (%)
B 10
 
10.9%
K 7
 
7.6%
A 7
 
7.6%
I 6
 
6.5%
S 5
 
5.4%
T 5
 
5.4%
W 5
 
5.4%
N 4
 
4.3%
C 4
 
4.3%
M 4
 
4.3%
Other values (22) 35
38.0%
Common
ValueCountFrequency (%)
7281
43.9%
1 1687
 
10.2%
- 1126
 
6.8%
2 1025
 
6.2%
3 885
 
5.3%
4 786
 
4.7%
5 740
 
4.5%
7 669
 
4.0%
6 632
 
3.8%
0 599
 
3.6%
Other values (5) 1146
 
6.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 24848
59.9%
ASCII 16664
40.1%
Number Forms 4
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7281
43.7%
1 1687
 
10.1%
- 1126
 
6.8%
2 1025
 
6.2%
3 885
 
5.3%
4 786
 
4.7%
5 740
 
4.4%
7 669
 
4.0%
6 632
 
3.8%
0 599
 
3.6%
Other values (35) 1234
 
7.4%
Hangul
ValueCountFrequency (%)
1729
 
7.0%
1681
 
6.8%
1457
 
5.9%
1007
 
4.1%
677
 
2.7%
676
 
2.7%
674
 
2.7%
607
 
2.4%
602
 
2.4%
586
 
2.4%
Other values (343) 15152
61.0%
Number Forms
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
None
ValueCountFrequency (%)
1
100.0%

위도
Real number (ℝ)

HIGH CORRELATION 

Distinct2478
Distinct (%)79.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean36.397016
Minimum33.263624
Maximum37.969299
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size27.4 KiB
2024-04-13T22:08:05.006789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum33.263624
5-th percentile35.146198
Q135.261559
median36.318169
Q337.47329
95-th percentile37.690736
Maximum37.969299
Range4.7056742
Interquartile range (IQR)2.2117316

Descriptive statistics

Standard deviation0.97364278
Coefficient of variation (CV)0.026750621
Kurtosis-1.4997118
Mean36.397016
Median Absolute Deviation (MAD)1.0802569
Skewness-0.031005165
Sum112939.94
Variance0.94798026
MonotonicityNot monotonic
2024-04-13T22:08:05.427419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
35.94269411 21
 
0.7%
37.2533641 20
 
0.6%
35.20438965 19
 
0.6%
35.2130694252 18
 
0.6%
35.198761 17
 
0.5%
37.51479584 17
 
0.5%
37.26402893 15
 
0.5%
37.731970749 15
 
0.5%
36.10191794 14
 
0.5%
35.216031 13
 
0.4%
Other values (2468) 2934
94.6%
ValueCountFrequency (%)
33.26362428 1
< 0.1%
33.26368292 1
< 0.1%
33.291957 1
< 0.1%
34.73008451 1
< 0.1%
34.73886851 1
< 0.1%
34.73970653 1
< 0.1%
34.73973293 1
< 0.1%
34.74123604 1
< 0.1%
34.7416917 1
< 0.1%
34.74365767 1
< 0.1%
ValueCountFrequency (%)
37.9692985273 1
< 0.1%
37.948169797 1
< 0.1%
37.9207086983 1
< 0.1%
37.910095 1
< 0.1%
37.9046309139 1
< 0.1%
37.8918682095 1
< 0.1%
37.8819331453 1
< 0.1%
37.881015 1
< 0.1%
37.8796694745 1
< 0.1%
37.8775300654 1
< 0.1%

경도
Real number (ℝ)

HIGH CORRELATION 

Distinct2485
Distinct (%)80.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean127.80778
Minimum126.39874
Maximum129.27161
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size27.4 KiB
2024-04-13T22:08:05.815478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.39874
5-th percentile126.77977
Q1127.02635
median127.4508
Q3128.63975
95-th percentile129.10391
Maximum129.27161
Range2.8728701
Interquartile range (IQR)1.6133977

Descriptive statistics

Standard deviation0.86984135
Coefficient of variation (CV)0.0068058558
Kurtosis-1.5798414
Mean127.80778
Median Absolute Deviation (MAD)0.60315211
Skewness0.28851004
Sum396587.55
Variance0.75662397
MonotonicityNot monotonic
2024-04-13T22:08:06.237442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
128.9314918 21
 
0.7%
127.1081733 20
 
0.6%
129.1121914 19
 
0.6%
129.0732824712 18
 
0.6%
129.0894160313 17
 
0.5%
126.8874812 17
 
0.5%
127.0217791 15
 
0.5%
127.0804028947 15
 
0.5%
127.4983403 14
 
0.5%
128.984046 13
 
0.4%
Other values (2475) 2934
94.6%
ValueCountFrequency (%)
126.398739502 1
< 0.1%
126.4552218164 1
< 0.1%
126.4574157177 1
< 0.1%
126.459942 1
< 0.1%
126.464614 1
< 0.1%
126.4742915905 1
< 0.1%
126.5304367717 1
< 0.1%
126.546842 1
< 0.1%
126.5589881824 1
< 0.1%
126.5589882 1
< 0.1%
ValueCountFrequency (%)
129.2716096 1
< 0.1%
129.2331169 1
< 0.1%
129.2305086 1
< 0.1%
129.218182 1
< 0.1%
129.2167059 1
< 0.1%
129.2151297 1
< 0.1%
129.2141416 2
0.1%
129.2138168 1
< 0.1%
129.2127375 1
< 0.1%
129.2098916 1
< 0.1%

영업상태명
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size24.4 KiB
영업
2947 
폐업
 
106
영업중
 
48
(주)한의유통
 
1
증평제일삼화가스
 
1

Length

Max length8
Median length2
Mean length2.0190139
Min length2

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row영업
2nd row영업
3rd row영업
4th row영업
5th row영업

Common Values

ValueCountFrequency (%)
영업 2947
95.0%
폐업 106
 
3.4%
영업중 48
 
1.5%
(주)한의유통 1
 
< 0.1%
증평제일삼화가스 1
 
< 0.1%

Length

2024-04-13T22:08:06.669100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-13T22:08:07.014639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업 2947
95.0%
폐업 106
 
3.4%
영업중 48
 
1.5%
주)한의유통 1
 
< 0.1%
증평제일삼화가스 1
 
< 0.1%

운반용차량보유여부
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
True
2043 
False
1060 
ValueCountFrequency (%)
True 2043
65.8%
False 1060
34.2%
2024-04-13T22:08:07.315649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

저장시설보유여부
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
True
1918 
False
1185 
ValueCountFrequency (%)
True 1918
61.8%
False 1185
38.2%
2024-04-13T22:08:07.587287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

전화번호
Text

MISSING 

Distinct1684
Distinct (%)95.2%
Missing1335
Missing (%)43.0%
Memory size24.4 KiB
2024-04-13T22:08:08.402938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.934955
Min length9

Characters and Unicode

Total characters21101
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1616 ?
Unique (%)91.4%

Sample

1st row055-933-9944
2nd row055-933-6678
3rd row055-963-7117
4th row055-963-8814
5th row055-962-3104
ValueCountFrequency (%)
053-000-0000 15
 
0.8%
051-000-0000 4
 
0.2%
000-0000-0000 3
 
0.2%
063-842-6060 2
 
0.1%
063-291-3029 2
 
0.1%
054-972-6727 2
 
0.1%
02-549-7451 2
 
0.1%
063-432-5285 2
 
0.1%
063-856-1215 2
 
0.1%
063-856-2320 2
 
0.1%
Other values (1674) 1732
98.0%
2024-04-13T22:08:09.602354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 3525
16.7%
0 3278
15.5%
5 2306
10.9%
3 2125
10.1%
2 2105
10.0%
1 1850
8.8%
4 1380
 
6.5%
7 1347
 
6.4%
6 1324
 
6.3%
8 1091
 
5.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 17576
83.3%
Dash Punctuation 3525
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 3278
18.7%
5 2306
13.1%
3 2125
12.1%
2 2105
12.0%
1 1850
10.5%
4 1380
7.9%
7 1347
7.7%
6 1324
7.5%
8 1091
 
6.2%
9 770
 
4.4%
Dash Punctuation
ValueCountFrequency (%)
- 3525
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 21101
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 3525
16.7%
0 3278
15.5%
5 2306
10.9%
3 2125
10.1%
2 2105
10.0%
1 1850
8.8%
4 1380
 
6.5%
7 1347
 
6.4%
6 1324
 
6.3%
8 1091
 
5.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 21101
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 3525
16.7%
0 3278
15.5%
5 2306
10.9%
3 2125
10.1%
2 2105
10.0%
1 1850
8.8%
4 1380
 
6.5%
7 1347
 
6.4%
6 1324
 
6.3%
8 1091
 
5.2%
Distinct120
Distinct (%)3.9%
Missing0
Missing (%)0.0%
Memory size24.4 KiB
2024-04-13T22:08:10.569100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length16
Mean length10.197873
Min length4

Characters and Unicode

Total characters31644
Distinct characters124
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique12 ?
Unique (%)0.4%

Sample

1st row경상남도 합천군청
2nd row경상남도 합천군청
3rd row강원도 강릉시보건소
4th row강원도 강릉시보건소
5th row강원도 강릉시보건소
ValueCountFrequency (%)
부산광역시 458
 
7.1%
서울특별시 449
 
7.0%
경기도 387
 
6.0%
대구광역시 340
 
5.3%
보건소 293
 
4.6%
대전광역시 229
 
3.6%
강남구 166
 
2.6%
경상남도 131
 
2.0%
강원특별자치도 129
 
2.0%
영등포구 119
 
1.9%
Other values (131) 3717
57.9%
2024-04-13T22:08:11.810052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3315
 
10.5%
2577
 
8.1%
2279
 
7.2%
1578
 
5.0%
1573
 
5.0%
1532
 
4.8%
1360
 
4.3%
1200
 
3.8%
1179
 
3.7%
1011
 
3.2%
Other values (114) 14040
44.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 28327
89.5%
Space Separator 3315
 
10.5%
Close Punctuation 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2577
 
9.1%
2279
 
8.0%
1578
 
5.6%
1573
 
5.6%
1532
 
5.4%
1360
 
4.8%
1200
 
4.2%
1179
 
4.2%
1011
 
3.6%
862
 
3.0%
Other values (111) 13176
46.5%
Space Separator
ValueCountFrequency (%)
3315
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 28327
89.5%
Common 3317
 
10.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2577
 
9.1%
2279
 
8.0%
1578
 
5.6%
1573
 
5.6%
1532
 
5.4%
1360
 
4.8%
1200
 
4.2%
1179
 
4.2%
1011
 
3.6%
862
 
3.0%
Other values (111) 13176
46.5%
Common
ValueCountFrequency (%)
3315
99.9%
) 1
 
< 0.1%
( 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 28327
89.5%
ASCII 3317
 
10.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3315
99.9%
) 1
 
< 0.1%
( 1
 
< 0.1%
Hangul
ValueCountFrequency (%)
2577
 
9.1%
2279
 
8.0%
1578
 
5.6%
1573
 
5.6%
1532
 
5.4%
1360
 
4.8%
1200
 
4.2%
1179
 
4.2%
1011
 
3.6%
862
 
3.0%
Other values (111) 13176
46.5%
Distinct45
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size24.4 KiB
Minimum2023-05-31 00:00:00
Maximum2024-04-04 00:00:00
2024-04-13T22:08:12.044311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T22:08:12.290872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)

제공기관코드
Real number (ℝ)

Distinct107
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3933149.2
Minimum3000000
Maximum6520000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size27.4 KiB
2024-04-13T22:08:12.537164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3000000
5-th percentile3150000
Q13330000
median3640000
Q34390000
95-th percentile5670000
Maximum6520000
Range3520000
Interquartile range (IQR)1060000

Descriptive statistics

Standard deviation797318.91
Coefficient of variation (CV)0.20271769
Kurtosis-0.20108841
Mean3933149.2
Median Absolute Deviation (MAD)420000
Skewness1.0241093
Sum1.2204562 × 1010
Variance6.3571744 × 1011
MonotonicityNot monotonic
2024-04-13T22:08:12.797904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3220000 166
 
5.3%
5670000 132
 
4.3%
3180000 119
 
3.8%
3740000 111
 
3.6%
3210000 107
 
3.4%
3650000 103
 
3.3%
4050000 99
 
3.2%
3410000 97
 
3.1%
5350000 96
 
3.1%
3300000 87
 
2.8%
Other values (97) 1986
64.0%
ValueCountFrequency (%)
3000000 26
 
0.8%
3010000 19
 
0.6%
3020000 10
 
0.3%
3060000 19
 
0.6%
3070000 11
 
0.4%
3100000 15
 
0.5%
3110000 10
 
0.3%
3150000 54
1.7%
3180000 119
3.8%
3200000 10
 
0.3%
ValueCountFrequency (%)
6520000 3
 
0.1%
5710000 66
2.1%
5700000 3
 
0.1%
5680000 16
 
0.5%
5670000 132
4.3%
5590000 20
 
0.6%
5570000 2
 
0.1%
5480000 2
 
0.1%
5470000 4
 
0.1%
5460000 4
 
0.1%
Distinct107
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size24.4 KiB
2024-04-13T22:08:13.735189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length8.5310989
Min length7

Characters and Unicode

Total characters26472
Distinct characters92
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)0.2%

Sample

1st row경상남도 합천군
2nd row경상남도 합천군
3rd row강원도 강릉시
4th row강원도 강릉시
5th row강원도 강릉시
ValueCountFrequency (%)
서울특별시 566
 
9.1%
부산광역시 458
 
7.4%
경기도 424
 
6.8%
대구광역시 340
 
5.5%
경상남도 263
 
4.2%
대전광역시 229
 
3.7%
중구 221
 
3.6%
강남구 166
 
2.7%
경상북도 135
 
2.2%
창원시 132
 
2.1%
Other values (97) 3272
52.7%
2024-04-13T22:08:14.860122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3103
 
11.7%
2921
 
11.0%
2069
 
7.8%
1360
 
5.1%
1342
 
5.1%
1200
 
4.5%
895
 
3.4%
867
 
3.3%
814
 
3.1%
731
 
2.8%
Other values (82) 11170
42.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 23369
88.3%
Space Separator 3103
 
11.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2921
 
12.5%
2069
 
8.9%
1360
 
5.8%
1342
 
5.7%
1200
 
5.1%
895
 
3.8%
867
 
3.7%
814
 
3.5%
731
 
3.1%
731
 
3.1%
Other values (81) 10439
44.7%
Space Separator
ValueCountFrequency (%)
3103
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 23369
88.3%
Common 3103
 
11.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2921
 
12.5%
2069
 
8.9%
1360
 
5.8%
1342
 
5.7%
1200
 
5.1%
895
 
3.8%
867
 
3.7%
814
 
3.5%
731
 
3.1%
731
 
3.1%
Other values (81) 10439
44.7%
Common
ValueCountFrequency (%)
3103
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 23369
88.3%
ASCII 3103
 
11.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3103
100.0%
Hangul
ValueCountFrequency (%)
2921
 
12.5%
2069
 
8.9%
1360
 
5.8%
1342
 
5.7%
1200
 
5.1%
895
 
3.8%
867
 
3.7%
814
 
3.5%
731
 
3.1%
731
 
3.1%
Other values (81) 10439
44.7%

Interactions

2024-04-13T22:07:54.322453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T22:07:52.751033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T22:07:53.528263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T22:07:54.579196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T22:07:53.004858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T22:07:53.793748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T22:07:54.845911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T22:07:53.258840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T22:07:54.048417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-13T22:08:15.024374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명위도경도영업상태명운반용차량보유여부저장시설보유여부데이터기준일자제공기관코드
업종명1.0000.5440.5570.4750.2770.3920.8000.689
위도0.5441.0000.7830.2840.3270.3430.9270.808
경도0.5570.7831.0000.4680.4060.2820.9290.759
영업상태명0.4750.2840.4681.0000.0070.0350.6840.287
운반용차량보유여부0.2770.3270.4060.0071.0000.8920.6420.304
저장시설보유여부0.3920.3430.2820.0350.8921.0000.5230.235
데이터기준일자0.8000.9270.9290.6840.6420.5231.0000.954
제공기관코드0.6890.8080.7590.2870.3040.2350.9541.000
2024-04-13T22:08:15.225164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명운반용차량보유여부저장시설보유여부영업상태명
업종명1.0000.2370.3360.251
운반용차량보유여부0.2371.0000.7020.009
저장시설보유여부0.3360.7021.0000.043
영업상태명0.2510.0090.0431.000
2024-04-13T22:08:15.566775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위도경도제공기관코드업종명영업상태명운반용차량보유여부저장시설보유여부
위도1.000-0.619-0.1890.2510.1780.2450.257
경도-0.6191.0000.1050.2360.2120.3120.216
제공기관코드-0.1890.1051.0000.3030.1700.3030.234
업종명0.2510.2360.3031.0000.2510.2370.336
영업상태명0.1780.2120.1700.2511.0000.0090.043
운반용차량보유여부0.2450.3120.3030.2370.0091.0000.702
저장시설보유여부0.2570.2160.2340.3360.0430.7021.000

Missing values

2024-04-13T22:07:55.246135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-13T22:07:55.828634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-13T22:07:56.217369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

시설명업종명소재지도로명주소소재지지번주소위도경도영업상태명운반용차량보유여부저장시설보유여부전화번호관리기관명데이터기준일자제공기관코드제공기관명
0율곡가스기타경상남도 합천군 율곡면 임북길 221<NA>35.578266128.183917영업YY055-933-9944경상남도 합천군청2023-06-095480000경상남도 합천군
1합천대병가스기타경상남도 합천군 대양면 대야로 572-18<NA>35.54005128.170497영업YY055-933-6678경상남도 합천군청2023-06-095480000경상남도 합천군
2주식회사 에스에이팜일반종합도매강원도 강릉시 경강로 2023, 3층 2호 (명주동)<NA>37.749982128.890629영업YN<NA>강원도 강릉시보건소2023-05-314200000강원도 강릉시
3주식회사 파로스일반종합도매강원도 강릉시 사임당로 131-1, 2동 108호 (홍제동)<NA>37.756373128.865135영업YN<NA>강원도 강릉시보건소2023-05-314200000강원도 강릉시
4(주) 렘넌트일반종합도매강원도 강릉시 옥계면 옥계로 1028-12, 1층 104호 (혜진아파트)<NA>37.606882129.032242영업YN<NA>강원도 강릉시보건소2023-05-314200000강원도 강릉시
5은성메디팜 주식회사일반종합도매강원도 강릉시 경포로 84, 2층 (교동)<NA>37.772725128.883246영업YN<NA>강원도 강릉시보건소2023-05-314200000강원도 강릉시
6정약품일반종합도매강원도 강릉시 성덕포남로200번길 6, 2층 (포남동)<NA>37.769468128.907925영업YY<NA>강원도 강릉시보건소2023-05-314200000강원도 강릉시
7(주)관동가스산업기타강원도 강릉시 주문진읍 신리천로 256<NA>37.876207128.805219영업YY<NA>강원도 강릉시보건소2023-05-314200000강원도 강릉시
8주식회사 보리약품일반종합도매강원도 강릉시 임영로176번길 14, 1층 (임당동)<NA>37.756639128.89322영업YY<NA>강원도 강릉시보건소2023-05-314200000강원도 강릉시
9(주)대일가스기타강원도 강릉시 강동면 아래장작골길 68 (대일가스)<NA>37.716552128.965736영업YY<NA>강원도 강릉시보건소2023-05-314200000강원도 강릉시
시설명업종명소재지도로명주소소재지지번주소위도경도영업상태명운반용차량보유여부저장시설보유여부전화번호관리기관명데이터기준일자제공기관코드제공기관명
3093(주)은평바이오일반종합도매서울특별시 은평구 갈현로4길 42, 지층 (신사동)<NA>37.600098126.914223영업YN<NA>서울시 은평구보건소2023-11-273110000서울특별시 은평구
3094(주)비아다빈치 서울지점일반종합도매서울특별시 은평구 통일로 1034, 226-2호 (진관동, 은평 스카이뷰 자이)<NA>37.635281126.917656영업YN<NA>서울시 은평구보건소2023-11-273110000서울특별시 은평구
3095한산바이오팜(주)일반종합도매서울특별시 은평구 불광로 90, 제상가동 1층 147호 (불광동, 대호프라자아파트)<NA>37.615069126.933001영업YY<NA>서울시 은평구보건소2023-11-273110000서울특별시 은평구
3096에스앤제이팜(주)일반종합도매서울특별시 은평구 갈현로 141, 2층 (구산동)<NA>37.610604126.910971영업YY<NA>서울시 은평구보건소2023-11-273110000서울특별시 은평구
3097서울리프한약도매서울특별시 은평구 연서로 93-1 (구산동)<NA>37.607658126.915026영업YY<NA>서울시 은평구보건소2023-11-273110000서울특별시 은평구
3098㈜청원양행일반종합도매서울특별시 은평구 서오릉로 277, 2층,3층 (구산동)<NA>37.618783126.905626영업YY<NA>서울시 은평구보건소2023-11-273110000서울특별시 은평구
3099㈜제이에스팜일반종합도매서울특별시 은평구 은평터널로 134, 4층 (신사동)<NA>37.591041126.905582영업YY<NA>서울시 은평구보건소2023-11-273110000서울특별시 은평구
3100성은메디칼약품(주)일반종합도매서울특별시 은평구 은평로 161, 3층 (응암동)<NA>37.601353126.92559영업YN<NA>서울시 은평구보건소2023-11-273110000서울특별시 은평구
3101㈜원풍약품상사원료의약품도매서울특별시 은평구 가좌로7길 9 (응암동)<NA>37.586572126.920182영업YY<NA>서울시 은평구보건소2023-11-273110000서울특별시 은평구
3102현대고수가스기타전라북도 고창군 고수면 부곡길 107전라북도 고창군 고수면 부곡길 10735.404634126.673555영업YY063-564-3651전라북도 고창군2023-07-204781000전북특별자치도 고창군