Overview

Dataset statistics

Number of variables9
Number of observations5716
Missing cells37
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory413.2 KiB
Average record size in memory74.0 B

Variable types

Numeric1
Categorical4
Text4

Dataset

Description경상남도 관할 대기배출사업장 현황에 대한 데이터로 업체명, 주소지, 업종, 종수, 전화번호 등의 항목을 제공합니다.
Author경상남도
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=3083438

Alerts

지역 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
전화번호 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
관할지자체 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
연번 is highly overall correlated with 관할지자체 and 2 other fieldsHigh correlation
전화번호 is highly imbalanced (94.6%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-11 00:11:38.309462
Analysis finished2023-12-11 00:11:40.065594
Duration1.76 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct5716
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2859.4673
Minimum1
Maximum5717
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size50.4 KiB
2023-12-11T09:11:40.133307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile287.75
Q11430.75
median2859.5
Q34288.25
95-th percentile5431.25
Maximum5717
Range5716
Interquartile range (IQR)2857.5

Descriptive statistics

Standard deviation1650.2662
Coefficient of variation (CV)0.57712366
Kurtosis-1.1998655
Mean2859.4673
Median Absolute Deviation (MAD)1429
Skewness-0.00010761274
Sum16344715
Variance2723378.6
MonotonicityStrictly increasing
2023-12-11T09:11:40.342540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
3800 1
 
< 0.1%
3820 1
 
< 0.1%
3819 1
 
< 0.1%
3818 1
 
< 0.1%
3817 1
 
< 0.1%
3816 1
 
< 0.1%
3815 1
 
< 0.1%
3814 1
 
< 0.1%
3813 1
 
< 0.1%
Other values (5706) 5706
99.8%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
5717 1
< 0.1%
5716 1
< 0.1%
5715 1
< 0.1%
5714 1
< 0.1%
5713 1
< 0.1%
5712 1
< 0.1%
5711 1
< 0.1%
5710 1
< 0.1%
5709 1
< 0.1%
5708 1
< 0.1%

관할지자체
Categorical

HIGH CORRELATION 

Distinct20
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size44.8 KiB
김해시
2016 
창원시
744 
양산시
696 
함안군
528 
진주시
277 
Other values (15)
1455 

Length

Max length11
Median length3
Mean length3.1201889
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경상남도
2nd row경상남도
3rd row경상남도
4th row경상남도
5th row경상남도

Common Values

ValueCountFrequency (%)
김해시 2016
35.3%
창원시 744
 
13.0%
양산시 696
 
12.2%
함안군 528
 
9.2%
진주시 277
 
4.8%
밀양시 224
 
3.9%
경상남도 207
 
3.6%
창녕군 197
 
3.4%
사천시 178
 
3.1%
거제시 76
 
1.3%
Other values (10) 573
 
10.0%

Length

2023-12-11T09:11:40.485695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
김해시 2016
35.3%
창원시 744
 
13.0%
양산시 696
 
12.2%
함안군 528
 
9.2%
진주시 277
 
4.8%
밀양시 224
 
3.9%
경상남도 207
 
3.6%
창녕군 197
 
3.4%
사천시 178
 
3.1%
거제시 76
 
1.3%
Other values (10) 573
 
10.0%

지역
Categorical

HIGH CORRELATION 

Distinct23
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size44.8 KiB
김해시
2022 
양산시
726 
함안군
560 
진주시
289 
밀양시
229 
Other values (18)
1890 

Length

Max length9
Median length3
Mean length3.6777467
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row거제시
2nd row거제시
3rd row창원시
4th row진주시
5th row진주시

Common Values

ValueCountFrequency (%)
김해시 2022
35.4%
양산시 726
 
12.7%
함안군 560
 
9.8%
진주시 289
 
5.1%
밀양시 229
 
4.0%
창원시 마산회원구 223
 
3.9%
창녕군 202
 
3.5%
사천시 194
 
3.4%
창원시 성산구 188
 
3.3%
창원시 의창구 178
 
3.1%
Other values (13) 905
15.8%

Length

2023-12-11T09:11:40.620679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
김해시 2022
31.0%
창원시 892
13.7%
양산시 726
 
11.1%
함안군 560
 
8.6%
진주시 289
 
4.4%
밀양시 229
 
3.5%
마산회원구 223
 
3.4%
창녕군 202
 
3.1%
사천시 194
 
3.0%
성산구 188
 
2.9%
Other values (13) 995
15.3%
Distinct5453
Distinct (%)95.5%
Missing5
Missing (%)0.1%
Memory size44.8 KiB
2023-12-11T09:11:41.213675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length64
Median length41
Mean length17.786202
Min length5

Characters and Unicode

Total characters101577
Distinct characters426
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5220 ?
Unique (%)91.4%

Sample

1st row거제시 거제대로 3370(아주동)
2nd row거제시 장평3로 80(장평동)
3rd row창원시 마산회원구 자유무역6길 157(봉암동)
4th row진주시 남강로1367번길 36(상대동)
5th row진주시 남강로1367번길 14(상대동)
ValueCountFrequency (%)
경상남도 1059
 
5.0%
창원시 891
 
4.2%
함안군 559
 
2.7%
한림면 497
 
2.4%
주촌면 313
 
1.5%
진주시 294
 
1.4%
성산구 259
 
1.2%
진례면 245
 
1.2%
마산회원구 226
 
1.1%
상동면 224
 
1.1%
Other values (4942) 16417
78.2%
2023-12-11T09:11:41.712959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16520
 
16.3%
1 4894
 
4.8%
4232
 
4.2%
2 3226
 
3.2%
3224
 
3.2%
3082
 
3.0%
3 2755
 
2.7%
4 2369
 
2.3%
5 2235
 
2.2%
- 2113
 
2.1%
Other values (416) 56927
56.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 56324
55.4%
Decimal Number 24190
23.8%
Space Separator 16520
 
16.3%
Dash Punctuation 2113
 
2.1%
Open Punctuation 1144
 
1.1%
Close Punctuation 1144
 
1.1%
Other Punctuation 94
 
0.1%
Uppercase Letter 41
 
< 0.1%
Other Symbol 4
 
< 0.1%
Connector Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4232
 
7.5%
3224
 
5.7%
3082
 
5.5%
1993
 
3.5%
1874
 
3.3%
1736
 
3.1%
1733
 
3.1%
1669
 
3.0%
1569
 
2.8%
1442
 
2.6%
Other values (377) 33770
60.0%
Uppercase Letter
ValueCountFrequency (%)
A 8
19.5%
C 6
14.6%
I 4
9.8%
B 4
9.8%
K 3
 
7.3%
D 3
 
7.3%
T 3
 
7.3%
L 2
 
4.9%
P 2
 
4.9%
G 2
 
4.9%
Other values (4) 4
9.8%
Decimal Number
ValueCountFrequency (%)
1 4894
20.2%
2 3226
13.3%
3 2755
11.4%
4 2369
9.8%
5 2235
9.2%
6 2049
8.5%
9 1826
 
7.5%
7 1786
 
7.4%
0 1577
 
6.5%
8 1473
 
6.1%
Other Punctuation
ValueCountFrequency (%)
, 77
81.9%
· 6
 
6.4%
: 4
 
4.3%
. 3
 
3.2%
" 2
 
2.1%
& 1
 
1.1%
* 1
 
1.1%
Open Punctuation
ValueCountFrequency (%)
( 1143
99.9%
[ 1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 1143
99.9%
] 1
 
0.1%
Space Separator
ValueCountFrequency (%)
16520
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2113
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 56328
55.5%
Common 45208
44.5%
Latin 41
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4232
 
7.5%
3224
 
5.7%
3082
 
5.5%
1993
 
3.5%
1874
 
3.3%
1736
 
3.1%
1733
 
3.1%
1669
 
3.0%
1569
 
2.8%
1442
 
2.6%
Other values (378) 33774
60.0%
Common
ValueCountFrequency (%)
16520
36.5%
1 4894
 
10.8%
2 3226
 
7.1%
3 2755
 
6.1%
4 2369
 
5.2%
5 2235
 
4.9%
- 2113
 
4.7%
6 2049
 
4.5%
9 1826
 
4.0%
7 1786
 
4.0%
Other values (14) 5435
 
12.0%
Latin
ValueCountFrequency (%)
A 8
19.5%
C 6
14.6%
I 4
9.8%
B 4
9.8%
K 3
 
7.3%
D 3
 
7.3%
T 3
 
7.3%
L 2
 
4.9%
P 2
 
4.9%
G 2
 
4.9%
Other values (4) 4
9.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 56323
55.4%
ASCII 45243
44.5%
None 10
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
16520
36.5%
1 4894
 
10.8%
2 3226
 
7.1%
3 2755
 
6.1%
4 2369
 
5.2%
5 2235
 
4.9%
- 2113
 
4.7%
6 2049
 
4.5%
9 1826
 
4.0%
7 1786
 
3.9%
Other values (27) 5470
 
12.1%
Hangul
ValueCountFrequency (%)
4232
 
7.5%
3224
 
5.7%
3082
 
5.5%
1993
 
3.5%
1874
 
3.3%
1736
 
3.1%
1733
 
3.1%
1669
 
3.0%
1569
 
2.8%
1442
 
2.6%
Other values (376) 33769
60.0%
None
ValueCountFrequency (%)
· 6
60.0%
4
40.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct5518
Distinct (%)96.5%
Missing0
Missing (%)0.0%
Memory size44.8 KiB
2023-12-11T09:11:41.925450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length68
Median length38
Mean length7.11669
Min length2

Characters and Unicode

Total characters40679
Distinct characters662
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5347 ?
Unique (%)93.5%

Sample

1st row대우조선해양(주)
2nd row삼성중공업(주)거제조선소
3rd row(주)동남
4th row남강제지(주)
5th row진주특종제지(주)
ValueCountFrequency (%)
주식회사 131
 
2.1%
2공장 33
 
0.5%
제2공장 23
 
0.4%
농업회사법인 16
 
0.3%
유한책임회사 10
 
0.2%
김해공장 9
 
0.1%
9
 
0.1%
한국가스공사 8
 
0.1%
3공장 8
 
0.1%
김해지점 8
 
0.1%
Other values (5684) 6122
96.0%
2023-12-11T09:11:42.277371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2369
 
5.8%
1458
 
3.6%
( 1363
 
3.4%
) 1362
 
3.3%
1191
 
2.9%
1077
 
2.6%
872
 
2.1%
857
 
2.1%
749
 
1.8%
702
 
1.7%
Other values (652) 28679
70.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 33639
82.7%
Other Symbol 2369
 
5.8%
Open Punctuation 1374
 
3.4%
Close Punctuation 1373
 
3.4%
Space Separator 702
 
1.7%
Uppercase Letter 591
 
1.5%
Decimal Number 328
 
0.8%
Other Punctuation 263
 
0.6%
Lowercase Letter 15
 
< 0.1%
Dash Punctuation 14
 
< 0.1%
Other values (3) 11
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1458
 
4.3%
1191
 
3.5%
1077
 
3.2%
872
 
2.6%
857
 
2.5%
749
 
2.2%
680
 
2.0%
640
 
1.9%
605
 
1.8%
576
 
1.7%
Other values (589) 24934
74.1%
Uppercase Letter
ValueCountFrequency (%)
S 71
 
12.0%
C 64
 
10.8%
E 46
 
7.8%
M 46
 
7.8%
T 39
 
6.6%
H 35
 
5.9%
P 31
 
5.2%
N 29
 
4.9%
R 27
 
4.6%
G 25
 
4.2%
Other values (14) 178
30.1%
Decimal Number
ValueCountFrequency (%)
2 168
51.2%
1 97
29.6%
3 35
 
10.7%
0 7
 
2.1%
5 7
 
2.1%
4 6
 
1.8%
6 3
 
0.9%
8 3
 
0.9%
7 1
 
0.3%
9 1
 
0.3%
Lowercase Letter
ValueCountFrequency (%)
n 3
20.0%
i 2
13.3%
o 2
13.3%
r 2
13.3%
g 2
13.3%
f 1
 
6.7%
t 1
 
6.7%
m 1
 
6.7%
s 1
 
6.7%
Other Punctuation
ValueCountFrequency (%)
. 189
71.9%
: 27
 
10.3%
& 25
 
9.5%
, 12
 
4.6%
/ 4
 
1.5%
* 3
 
1.1%
· 2
 
0.8%
1
 
0.4%
Math Symbol
ValueCountFrequency (%)
~ 7
77.8%
> 1
 
11.1%
< 1
 
11.1%
Open Punctuation
ValueCountFrequency (%)
( 1363
99.2%
[ 11
 
0.8%
Close Punctuation
ValueCountFrequency (%)
) 1362
99.2%
] 11
 
0.8%
Other Symbol
ValueCountFrequency (%)
2369
100.0%
Space Separator
ValueCountFrequency (%)
702
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 14
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 36008
88.5%
Common 4064
 
10.0%
Latin 607
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2369
 
6.6%
1458
 
4.0%
1191
 
3.3%
1077
 
3.0%
872
 
2.4%
857
 
2.4%
749
 
2.1%
680
 
1.9%
640
 
1.8%
605
 
1.7%
Other values (590) 25510
70.8%
Latin
ValueCountFrequency (%)
S 71
 
11.7%
C 64
 
10.5%
E 46
 
7.6%
M 46
 
7.6%
T 39
 
6.4%
H 35
 
5.8%
P 31
 
5.1%
N 29
 
4.8%
R 27
 
4.4%
G 25
 
4.1%
Other values (24) 194
32.0%
Common
ValueCountFrequency (%)
( 1363
33.5%
) 1362
33.5%
702
17.3%
. 189
 
4.7%
2 168
 
4.1%
1 97
 
2.4%
3 35
 
0.9%
: 27
 
0.7%
& 25
 
0.6%
- 14
 
0.3%
Other values (18) 82
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 33639
82.7%
ASCII 4667
 
11.5%
None 2372
 
5.8%
Number Forms 1
 
< 0.1%

Most frequent character per block

None
ValueCountFrequency (%)
2369
99.9%
· 2
 
0.1%
1
 
< 0.1%
Hangul
ValueCountFrequency (%)
1458
 
4.3%
1191
 
3.5%
1077
 
3.2%
872
 
2.6%
857
 
2.5%
749
 
2.2%
680
 
2.0%
640
 
1.9%
605
 
1.8%
576
 
1.7%
Other values (589) 24934
74.1%
ASCII
ValueCountFrequency (%)
( 1363
29.2%
) 1362
29.2%
702
15.0%
. 189
 
4.0%
2 168
 
3.6%
1 97
 
2.1%
S 71
 
1.5%
C 64
 
1.4%
E 46
 
1.0%
M 46
 
1.0%
Other values (49) 559
12.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct63
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size44.8 KiB
2023-12-11T09:11:42.495054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length4
Mean length4.0181945
Min length2

Characters and Unicode

Total characters22968
Distinct characters110
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)0.6%

Sample

1st row대표이사
2nd row대표이사
3rd row대표이사
4th row대표이사
5th row대표이사
ValueCountFrequency (%)
대표이사 5547
96.9%
조합장 20
 
0.3%
이사장 14
 
0.2%
김해시장 10
 
0.2%
부산경남지역본부장 10
 
0.2%
병원장 8
 
0.1%
진주시장 7
 
0.1%
창원시장 5
 
0.1%
사천시장 4
 
0.1%
창녕군수 4
 
0.1%
Other values (58) 96
 
1.7%
2023-12-11T09:11:42.844779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5571
24.3%
5561
24.2%
5553
24.2%
5549
24.2%
140
 
0.6%
46
 
0.2%
31
 
0.1%
27
 
0.1%
27
 
0.1%
27
 
0.1%
Other values (100) 436
 
1.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 22955
99.9%
Space Separator 9
 
< 0.1%
Decimal Number 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5571
24.3%
5561
24.2%
5553
24.2%
5549
24.2%
140
 
0.6%
46
 
0.2%
31
 
0.1%
27
 
0.1%
27
 
0.1%
27
 
0.1%
Other values (95) 423
 
1.8%
Decimal Number
ValueCountFrequency (%)
2 1
25.0%
8 1
25.0%
7 1
25.0%
9 1
25.0%
Space Separator
ValueCountFrequency (%)
9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 22955
99.9%
Common 13
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5571
24.3%
5561
24.2%
5553
24.2%
5549
24.2%
140
 
0.6%
46
 
0.2%
31
 
0.1%
27
 
0.1%
27
 
0.1%
27
 
0.1%
Other values (95) 423
 
1.8%
Common
ValueCountFrequency (%)
9
69.2%
2 1
 
7.7%
8 1
 
7.7%
7 1
 
7.7%
9 1
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 22955
99.9%
ASCII 13
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5571
24.3%
5561
24.2%
5553
24.2%
5549
24.2%
140
 
0.6%
46
 
0.2%
31
 
0.1%
27
 
0.1%
27
 
0.1%
27
 
0.1%
Other values (95) 423
 
1.8%
ASCII
ValueCountFrequency (%)
9
69.2%
2 1
 
7.7%
8 1
 
7.7%
7 1
 
7.7%
9 1
 
7.7%

업종
Text

Distinct1996
Distinct (%)35.1%
Missing32
Missing (%)0.6%
Memory size44.8 KiB
2023-12-11T09:11:43.063323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length41
Mean length10.896728
Min length2

Characters and Unicode

Total characters61937
Distinct characters398
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1347 ?
Unique (%)23.7%

Sample

1st row선박건조 및 수리업
2nd row선박, 해양플랜트
3rd row비금속광물
4th row제지
5th row제지
ValueCountFrequency (%)
제조업 795
 
7.8%
714
 
7.0%
기타 285
 
2.8%
자동차 216
 
2.1%
수리업 192
 
1.9%
금속제품제조 124
 
1.2%
도장및기타피막처리업 108
 
1.1%
자동차종합수리업 107
 
1.1%
종합 103
 
1.0%
103
 
1.0%
Other values (2149) 7434
73.0%
2023-12-11T09:11:43.419748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5000
 
8.1%
4639
 
7.5%
4632
 
7.5%
4044
 
6.5%
2283
 
3.7%
2073
 
3.3%
1457
 
2.4%
1326
 
2.1%
1260
 
2.0%
1212
 
2.0%
Other values (388) 34011
54.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 53431
86.3%
Space Separator 4632
 
7.5%
Decimal Number 2447
 
4.0%
Open Punctuation 499
 
0.8%
Close Punctuation 497
 
0.8%
Other Punctuation 430
 
0.7%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5000
 
9.4%
4639
 
8.7%
4044
 
7.6%
2283
 
4.3%
2073
 
3.9%
1457
 
2.7%
1326
 
2.5%
1260
 
2.4%
1212
 
2.3%
1152
 
2.2%
Other values (368) 28985
54.2%
Decimal Number
ValueCountFrequency (%)
2 679
27.7%
1 525
21.5%
3 386
15.8%
9 306
12.5%
0 203
 
8.3%
5 121
 
4.9%
8 93
 
3.8%
4 87
 
3.6%
6 27
 
1.1%
7 20
 
0.8%
Other Punctuation
ValueCountFrequency (%)
, 406
94.4%
. 8
 
1.9%
· 7
 
1.6%
; 6
 
1.4%
/ 2
 
0.5%
: 1
 
0.2%
Space Separator
ValueCountFrequency (%)
4632
100.0%
Open Punctuation
ValueCountFrequency (%)
( 499
100.0%
Close Punctuation
ValueCountFrequency (%)
) 497
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 53431
86.3%
Common 8506
 
13.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5000
 
9.4%
4639
 
8.7%
4044
 
7.6%
2283
 
4.3%
2073
 
3.9%
1457
 
2.7%
1326
 
2.5%
1260
 
2.4%
1212
 
2.3%
1152
 
2.2%
Other values (368) 28985
54.2%
Common
ValueCountFrequency (%)
4632
54.5%
2 679
 
8.0%
1 525
 
6.2%
( 499
 
5.9%
) 497
 
5.8%
, 406
 
4.8%
3 386
 
4.5%
9 306
 
3.6%
0 203
 
2.4%
5 121
 
1.4%
Other values (10) 252
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 53431
86.3%
ASCII 8499
 
13.7%
None 7
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5000
 
9.4%
4639
 
8.7%
4044
 
7.6%
2283
 
4.3%
2073
 
3.9%
1457
 
2.7%
1326
 
2.5%
1260
 
2.4%
1212
 
2.3%
1152
 
2.2%
Other values (368) 28985
54.2%
ASCII
ValueCountFrequency (%)
4632
54.5%
2 679
 
8.0%
1 525
 
6.2%
( 499
 
5.9%
) 497
 
5.8%
, 406
 
4.8%
3 386
 
4.5%
9 306
 
3.6%
0 203
 
2.4%
5 121
 
1.4%
Other values (9) 245
 
2.9%
None
ValueCountFrequency (%)
· 7
100.0%

종별
Categorical

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size44.8 KiB
5
3060 
4
2113 
3
 
258
2
 
176
1
 
108

Length

Max length4
Median length1
Mean length1.0005248
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row1
2nd row1
3rd row2
4th row1
5th row2

Common Values

ValueCountFrequency (%)
5 3060
53.5%
4 2113
37.0%
3 258
 
4.5%
2 176
 
3.1%
1 108
 
1.9%
<NA> 1
 
< 0.1%

Length

2023-12-11T09:11:43.546839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:11:43.656314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5 3060
53.5%
4 2113
37.0%
3 258
 
4.5%
2 176
 
3.1%
1 108
 
1.9%
na 1
 
< 0.1%

전화번호
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct31
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size44.8 KiB
<NA>
5510 
055-256-7161
 
177
055-735-9188
 
1
055-851-7500
 
1
055-280-2721
 
1
Other values (26)
 
26

Length

Max length13
Median length4
Mean length4.2891882
Min length4

Unique

Unique29 ?
Unique (%)0.5%

Sample

1st row055-735-9188
2nd row055-630-5373
3rd row055-256-7161
4th row055-256-7161
5th row055-256-7161

Common Values

ValueCountFrequency (%)
<NA> 5510
96.4%
055-256-7161 177
 
3.1%
055-735-9188 1
 
< 0.1%
055-851-7500 1
 
< 0.1%
055-280-2721 1
 
< 0.1%
070-4761-0181 1
 
< 0.1%
070-8878-1541 1
 
< 0.1%
070-8460-3100 1
 
< 0.1%
070-4618-3407 1
 
< 0.1%
055-386-4177 1
 
< 0.1%
Other values (21) 21
 
0.4%

Length

2023-12-11T09:11:43.771089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 5510
96.4%
055-256-7161 177
 
3.1%
055-580-7313 1
 
< 0.1%
055-981-0600 1
 
< 0.1%
055-312-7345 1
 
< 0.1%
055-391-3161 1
 
< 0.1%
055-584-9181 1
 
< 0.1%
055-263-4222 1
 
< 0.1%
055-275-2911 1
 
< 0.1%
055-548-0700 1
 
< 0.1%
Other values (21) 21
 
0.4%

Interactions

2023-12-11T09:11:39.601177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:11:43.855519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번관할지자체지역대표자종별전화번호
연번1.0000.9710.9430.2590.376NaN
관할지자체0.9711.0000.9970.6210.608NaN
지역0.9430.9971.0000.6370.4240.716
대표자0.2590.6210.6371.0000.2690.877
종별0.3760.6080.4240.2691.0000.000
전화번호NaNNaN0.7160.8770.0001.000
2023-12-11T09:11:43.981514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역전화번호종별관할지자체
지역1.0000.2990.2240.959
전화번호0.2991.0000.0001.000
종별0.2240.0001.0000.308
관할지자체0.9591.0000.3081.000
2023-12-11T09:11:44.105106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번관할지자체지역종별전화번호
연번1.0000.7220.7370.1651.000
관할지자체0.7221.0000.9590.3081.000
지역0.7370.9591.0000.2240.299
종별0.1650.3080.2241.0000.000
전화번호1.0001.0000.2990.0001.000

Missing values

2023-12-11T09:11:39.742563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:11:39.884818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T09:11:40.002579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번관할지자체지역도로명주소업체명대표자업종종별전화번호
01경상남도거제시거제시 거제대로 3370(아주동)대우조선해양(주)대표이사선박건조 및 수리업1055-735-9188
12경상남도거제시거제시 장평3로 80(장평동)삼성중공업(주)거제조선소대표이사선박, 해양플랜트1055-630-5373
23경상남도창원시창원시 마산회원구 자유무역6길 157(봉암동)(주)동남대표이사비금속광물2055-256-7161
34경상남도진주시진주시 남강로1367번길 36(상대동)남강제지(주)대표이사제지1055-256-7161
45경상남도진주시진주시 남강로1367번길 14(상대동)진주특종제지(주)대표이사제지2055-256-7161
56경상남도진주시진주시 대신로 146(상평동)(주)신흥[상평공단]대표이사고무제품제조업2055-256-7161
67경상남도진주시진주시 도동로 57(상평동)이누스(주)진주지점대표이사비금속광물제조1055-256-7161
78경상남도진주시진주시 도동로 58(상평동)하이트진로산업㈜진주공장대표이사비금속광물1055-256-7161
89경상남도진주시진주시 남강로 1303(상평동)동일팩키지(주)진주공장대표이사제지1055-256-7161
910경상남도진주시진주시 남강로 1003(상평동)무림페이퍼(주)대표이사제지2055-256-7161
연번관할지자체지역도로명주소업체명대표자업종종별전화번호
57065708부산진해경제자유구역청창원시 진해구창원시 진해구 남의로21번길 69삼진엔텍대표이사코아제조5<NA>
57075709부산진해경제자유구역청창원시 진해구창원시 진해구 남의로21번길 71삼진산업대표이사코아제조5<NA>
57085710부산진해경제자유구역청창원시 진해구창원시 진해구 남영로544번길 57제우산업대표이사조립금속제품제조5<NA>
57095711부산진해경제자유구역청창원시 진해구창원시 진해구 남영로 15(남문동)한국쯔바키모토오토모티브㈜대표이사자동차엔진용품제조5<NA>
57105712부산진해경제자유구역청창원시 진해구창원시 진해구 남의로71번길 35㈜대웅셀테크대표이사주형및금형제조업5<NA>
57115713부산진해경제자유구역청창원시 진해구창원시 진해구 신항북로 320(용원동)세방(주)대표이사운수장비수선및세차또는세척시설5<NA>
57125714부산진해경제자유구역청창원시 진해구창원시 진해구 남의로 68(남양동)동양제강㈜진해사업소대표이사끈 및 로프제조업5<NA>
57135715부산진해경제자유구역청창원시 진해구창원시 진해구 웅천서로41번길 20㈜케이.에스.엠대표이사디지털적층성형기계제조업5<NA>
57145716부산진해경제자유구역청창원시 진해구창원시 진해구 남영로564번길 45우진산업대표이사자동차 엔진용 부품제조업5<NA>
57155717부산진해경제자유구역청창원시 진해구창원시 진해구 웅천동로43번길 81도코다카오카코리아㈜대표이사전자코일, 변성기 및 기타 전자유도자 제조업5<NA>