Overview

Dataset statistics

Number of variables9
Number of observations859
Missing cells79
Missing cells (%)1.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory62.2 KiB
Average record size in memory74.2 B

Variable types

Numeric2
Text6
DateTime1

Dataset

Description부산광역시사하구_공장등록_20230531
Author부산광역시 사하구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15100833

Alerts

데이터 기준일자 has constant value ""Constant
전화번호 has 18 (2.1%) missing valuesMissing
팩스번호 has 56 (6.5%) missing valuesMissing
연번 has unique valuesUnique
종업원수 has 16 (1.9%) zerosZeros

Reproduction

Analysis started2023-12-10 16:59:23.732589
Analysis finished2023-12-10 16:59:26.139607
Duration2.41 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct859
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean430
Minimum1
Maximum859
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.7 KiB
2023-12-11T01:59:26.264122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile43.9
Q1215.5
median430
Q3644.5
95-th percentile816.1
Maximum859
Range858
Interquartile range (IQR)429

Descriptive statistics

Standard deviation248.11624
Coefficient of variation (CV)0.5770145
Kurtosis-1.2
Mean430
Median Absolute Deviation (MAD)215
Skewness0
Sum369370
Variance61561.667
MonotonicityStrictly increasing
2023-12-11T01:59:26.870468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
566 1
 
0.1%
568 1
 
0.1%
569 1
 
0.1%
570 1
 
0.1%
571 1
 
0.1%
572 1
 
0.1%
573 1
 
0.1%
574 1
 
0.1%
575 1
 
0.1%
Other values (849) 849
98.8%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
859 1
0.1%
858 1
0.1%
857 1
0.1%
856 1
0.1%
855 1
0.1%
854 1
0.1%
853 1
0.1%
852 1
0.1%
851 1
0.1%
850 1
0.1%
Distinct829
Distinct (%)96.5%
Missing0
Missing (%)0.0%
Memory size6.8 KiB
2023-12-11T01:59:27.307661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length16
Mean length6.8405122
Min length2

Characters and Unicode

Total characters5876
Distinct characters393
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique803 ?
Unique (%)93.5%

Sample

1st row(사)골든장애인협회 사업단
2nd row(사)하나복지회 피복사업장
3rd row(주) 예맥통상
4th row(주)HN푸드
5th row(주)Y2산업
ValueCountFrequency (%)
주식회사 52
 
5.5%
삼우티이에스(주 4
 
0.4%
사단법인 3
 
0.3%
주)바이넥스 3
 
0.3%
제2공장 3
 
0.3%
탱크테크(주 3
 
0.3%
큰바위얼굴f&b 2
 
0.2%
부산전기 2
 
0.2%
주)신우농수산 2
 
0.2%
부산공장 2
 
0.2%
Other values (847) 870
92.0%
2023-12-11T01:59:27.908644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
472
 
8.0%
( 419
 
7.1%
) 419
 
7.1%
144
 
2.5%
142
 
2.4%
138
 
2.3%
133
 
2.3%
104
 
1.8%
92
 
1.6%
89
 
1.5%
Other values (383) 3724
63.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4800
81.7%
Open Punctuation 419
 
7.1%
Close Punctuation 419
 
7.1%
Uppercase Letter 121
 
2.1%
Space Separator 88
 
1.5%
Other Punctuation 14
 
0.2%
Decimal Number 9
 
0.2%
Lowercase Letter 5
 
0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
472
 
9.8%
144
 
3.0%
142
 
3.0%
138
 
2.9%
133
 
2.8%
104
 
2.2%
92
 
1.9%
89
 
1.9%
78
 
1.6%
74
 
1.5%
Other values (347) 3334
69.5%
Uppercase Letter
ValueCountFrequency (%)
S 14
 
11.6%
G 11
 
9.1%
M 10
 
8.3%
E 10
 
8.3%
T 9
 
7.4%
N 8
 
6.6%
C 6
 
5.0%
F 6
 
5.0%
R 6
 
5.0%
A 6
 
5.0%
Other values (12) 35
28.9%
Lowercase Letter
ValueCountFrequency (%)
a 1
20.0%
g 1
20.0%
u 1
20.0%
l 1
20.0%
e 1
20.0%
Other Punctuation
ValueCountFrequency (%)
& 7
50.0%
. 6
42.9%
, 1
 
7.1%
Decimal Number
ValueCountFrequency (%)
2 8
88.9%
5 1
 
11.1%
Open Punctuation
ValueCountFrequency (%)
( 419
100.0%
Close Punctuation
ValueCountFrequency (%)
) 419
100.0%
Space Separator
ValueCountFrequency (%)
88
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4800
81.7%
Common 950
 
16.2%
Latin 126
 
2.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
472
 
9.8%
144
 
3.0%
142
 
3.0%
138
 
2.9%
133
 
2.8%
104
 
2.2%
92
 
1.9%
89
 
1.9%
78
 
1.6%
74
 
1.5%
Other values (347) 3334
69.5%
Latin
ValueCountFrequency (%)
S 14
 
11.1%
G 11
 
8.7%
M 10
 
7.9%
E 10
 
7.9%
T 9
 
7.1%
N 8
 
6.3%
C 6
 
4.8%
F 6
 
4.8%
R 6
 
4.8%
A 6
 
4.8%
Other values (17) 40
31.7%
Common
ValueCountFrequency (%)
( 419
44.1%
) 419
44.1%
88
 
9.3%
2 8
 
0.8%
& 7
 
0.7%
. 6
 
0.6%
, 1
 
0.1%
- 1
 
0.1%
5 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4800
81.7%
ASCII 1076
 
18.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
472
 
9.8%
144
 
3.0%
142
 
3.0%
138
 
2.9%
133
 
2.8%
104
 
2.2%
92
 
1.9%
89
 
1.9%
78
 
1.6%
74
 
1.5%
Other values (347) 3334
69.5%
ASCII
ValueCountFrequency (%)
( 419
38.9%
) 419
38.9%
88
 
8.2%
S 14
 
1.3%
G 11
 
1.0%
M 10
 
0.9%
E 10
 
0.9%
T 9
 
0.8%
2 8
 
0.7%
N 8
 
0.7%
Other values (26) 80
 
7.4%

주소
Text

Distinct771
Distinct (%)89.8%
Missing0
Missing (%)0.0%
Memory size6.8 KiB
2023-12-11T01:59:28.350716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length55
Mean length28.185099
Min length18

Characters and Unicode

Total characters24211
Distinct characters144
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique697 ?
Unique (%)81.1%

Sample

1st row부산광역시 사하구 보덕포1길 100, B동, 지하1층 (장림동)
2nd row부산광역시 사하구 다대로354번안길 20, 2층(장림동)
3rd row부산광역시 사하구 비봉로 56 (신평동)
4th row부산광역시 사하구 하신중앙로27번길 15 (장림동)
5th row부산광역시 사하구 장림로 100 (장림동)
ValueCountFrequency (%)
부산광역시 859
18.5%
사하구 859
18.5%
장림동 390
 
8.4%
구평동 192
 
4.1%
신평동 84
 
1.8%
66
 
1.4%
감천항로 40
 
0.9%
장평로 39
 
0.8%
감천동 38
 
0.8%
다대동 37
 
0.8%
Other values (578) 2045
44.0%
2023-12-11T01:59:28.966509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3790
 
15.7%
1113
 
4.6%
1055
 
4.4%
917
 
3.8%
889
 
3.7%
874
 
3.6%
( 871
 
3.6%
) 871
 
3.6%
868
 
3.6%
860
 
3.6%
Other values (134) 12103
50.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14572
60.2%
Decimal Number 3791
 
15.7%
Space Separator 3790
 
15.7%
Open Punctuation 871
 
3.6%
Close Punctuation 871
 
3.6%
Other Punctuation 169
 
0.7%
Dash Punctuation 138
 
0.6%
Uppercase Letter 8
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1113
 
7.6%
1055
 
7.2%
917
 
6.3%
889
 
6.1%
874
 
6.0%
868
 
6.0%
860
 
5.9%
859
 
5.9%
859
 
5.9%
818
 
5.6%
Other values (115) 5460
37.5%
Decimal Number
ValueCountFrequency (%)
1 743
19.6%
2 491
13.0%
3 444
11.7%
4 397
10.5%
5 392
10.3%
0 360
9.5%
7 298
7.9%
6 283
 
7.5%
9 199
 
5.2%
8 184
 
4.9%
Uppercase Letter
ValueCountFrequency (%)
B 5
62.5%
A 2
 
25.0%
S 1
 
12.5%
Space Separator
ValueCountFrequency (%)
3790
100.0%
Open Punctuation
ValueCountFrequency (%)
( 871
100.0%
Close Punctuation
ValueCountFrequency (%)
) 871
100.0%
Other Punctuation
ValueCountFrequency (%)
, 169
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 138
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14572
60.2%
Common 9631
39.8%
Latin 8
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1113
 
7.6%
1055
 
7.2%
917
 
6.3%
889
 
6.1%
874
 
6.0%
868
 
6.0%
860
 
5.9%
859
 
5.9%
859
 
5.9%
818
 
5.6%
Other values (115) 5460
37.5%
Common
ValueCountFrequency (%)
3790
39.4%
( 871
 
9.0%
) 871
 
9.0%
1 743
 
7.7%
2 491
 
5.1%
3 444
 
4.6%
4 397
 
4.1%
5 392
 
4.1%
0 360
 
3.7%
7 298
 
3.1%
Other values (6) 974
 
10.1%
Latin
ValueCountFrequency (%)
B 5
62.5%
A 2
 
25.0%
S 1
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14572
60.2%
ASCII 9639
39.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3790
39.3%
( 871
 
9.0%
) 871
 
9.0%
1 743
 
7.7%
2 491
 
5.1%
3 444
 
4.6%
4 397
 
4.1%
5 392
 
4.1%
0 360
 
3.7%
7 298
 
3.1%
Other values (9) 982
 
10.2%
Hangul
ValueCountFrequency (%)
1113
 
7.6%
1055
 
7.2%
917
 
6.3%
889
 
6.1%
874
 
6.0%
868
 
6.0%
860
 
5.9%
859
 
5.9%
859
 
5.9%
818
 
5.6%
Other values (115) 5460
37.5%

종업원수
Real number (ℝ)

ZEROS 

Distinct80
Distinct (%)9.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.429569
Minimum0
Maximum388
Zeros16
Zeros (%)1.9%
Negative0
Negative (%)0.0%
Memory size7.7 KiB
2023-12-11T01:59:29.142250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q14
median8
Q316
95-th percentile49
Maximum388
Range388
Interquartile range (IQR)12

Descriptive statistics

Standard deviation29.036961
Coefficient of variation (CV)1.8819035
Kurtosis67.275901
Mean15.429569
Median Absolute Deviation (MAD)5
Skewness7.1040463
Sum13254
Variance843.14509
MonotonicityNot monotonic
2023-12-11T01:59:29.330139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4 67
 
7.8%
3 63
 
7.3%
2 61
 
7.1%
5 60
 
7.0%
6 54
 
6.3%
10 50
 
5.8%
7 40
 
4.7%
1 39
 
4.5%
9 37
 
4.3%
8 36
 
4.2%
Other values (70) 352
41.0%
ValueCountFrequency (%)
0 16
 
1.9%
1 39
4.5%
2 61
7.1%
3 63
7.3%
4 67
7.8%
5 60
7.0%
6 54
6.3%
7 40
4.7%
8 36
4.2%
9 37
4.3%
ValueCountFrequency (%)
388 1
0.1%
320 1
0.1%
299 1
0.1%
250 1
0.1%
239 1
0.1%
196 1
0.1%
156 1
0.1%
144 1
0.1%
134 1
0.1%
128 1
0.1%

전화번호
Text

MISSING 

Distinct804
Distinct (%)95.6%
Missing18
Missing (%)2.1%
Memory size6.8 KiB
2023-12-11T01:59:29.593826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.023781
Min length11

Characters and Unicode

Total characters10112
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique771 ?
Unique (%)91.7%

Sample

1st row051-311-1791
2nd row051-262-6911
3rd row051-207-9393
4th row051-263-7249
5th row051-264-4714
ValueCountFrequency (%)
051-412-0647 5
 
0.6%
051-205-8911 3
 
0.4%
051-418-6141 2
 
0.2%
051-261-3551 2
 
0.2%
051-293-7500 2
 
0.2%
051-262-8827 2
 
0.2%
051-261-4977 2
 
0.2%
051-205-3105 2
 
0.2%
051-263-0125 2
 
0.2%
051-261-7322 2
 
0.2%
Other values (794) 817
97.1%
2023-12-11T01:59:30.073439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1682
16.6%
0 1471
14.5%
1 1448
14.3%
5 1269
12.5%
2 1137
11.2%
6 899
8.9%
4 526
 
5.2%
3 489
 
4.8%
7 444
 
4.4%
8 390
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 8430
83.4%
Dash Punctuation 1682
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1471
17.4%
1 1448
17.2%
5 1269
15.1%
2 1137
13.5%
6 899
10.7%
4 526
 
6.2%
3 489
 
5.8%
7 444
 
5.3%
8 390
 
4.6%
9 357
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 1682
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 10112
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1682
16.6%
0 1471
14.5%
1 1448
14.3%
5 1269
12.5%
2 1137
11.2%
6 899
8.9%
4 526
 
5.2%
3 489
 
4.8%
7 444
 
4.4%
8 390
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 10112
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1682
16.6%
0 1471
14.5%
1 1448
14.3%
5 1269
12.5%
2 1137
11.2%
6 899
8.9%
4 526
 
5.2%
3 489
 
4.8%
7 444
 
4.4%
8 390
 
3.9%

팩스번호
Text

MISSING 

Distinct753
Distinct (%)93.8%
Missing56
Missing (%)6.5%
Memory size6.8 KiB
2023-12-11T01:59:30.505092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.01868
Min length11

Characters and Unicode

Total characters9651
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique710 ?
Unique (%)88.4%

Sample

1st row051-311-1792
2nd row051-262-6912
3rd row051-207-9494
4th row051-264-7249
5th row051-264-4724
ValueCountFrequency (%)
051-412-0646 4
 
0.5%
051-294-8500 4
 
0.5%
051-261-4979 3
 
0.4%
051-207-4674 3
 
0.4%
051-979-1601 3
 
0.4%
051-265-9129 2
 
0.2%
051-262-7158 2
 
0.2%
051-987-4893 2
 
0.2%
051-264-2526 2
 
0.2%
051-832-0239 2
 
0.2%
Other values (743) 776
96.6%
2023-12-11T01:59:31.180102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1606
16.6%
0 1336
13.8%
1 1291
13.4%
5 1226
12.7%
2 1101
11.4%
6 891
9.2%
4 520
 
5.4%
3 518
 
5.4%
7 388
 
4.0%
9 387
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 8045
83.4%
Dash Punctuation 1606
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1336
16.6%
1 1291
16.0%
5 1226
15.2%
2 1101
13.7%
6 891
11.1%
4 520
 
6.5%
3 518
 
6.4%
7 388
 
4.8%
9 387
 
4.8%
8 387
 
4.8%
Dash Punctuation
ValueCountFrequency (%)
- 1606
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 9651
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1606
16.6%
0 1336
13.8%
1 1291
13.4%
5 1226
12.7%
2 1101
11.4%
6 891
9.2%
4 520
 
5.4%
3 518
 
5.4%
7 388
 
4.0%
9 387
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9651
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1606
16.6%
0 1336
13.8%
1 1291
13.4%
5 1226
12.7%
2 1101
11.4%
6 891
9.2%
4 520
 
5.4%
3 518
 
5.4%
7 388
 
4.0%
9 387
 
4.0%
Distinct398
Distinct (%)46.4%
Missing1
Missing (%)0.1%
Memory size6.8 KiB
2023-12-11T01:59:31.801146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length28
Mean length17.362471
Min length5

Characters and Unicode

Total characters14897
Distinct characters291
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique268 ?
Unique (%)31.2%

Sample

1st row기타 음향기기 제조업 외 2 종
2nd row남자용 겉옷 제조업 외 24 종
3rd row수산동물 훈제, 조리 및 유사 조제식품 제조업
4th row기타 곡물 가공품 제조업
5th row기타 가공 공작기계 제조업
ValueCountFrequency (%)
제조업 735
 
15.0%
472
 
9.7%
382
 
7.8%
368
 
7.5%
기타 191
 
3.9%
1 186
 
3.8%
수산동물 105
 
2.1%
90
 
1.8%
부분품 87
 
1.8%
선박 86
 
1.8%
Other values (430) 2187
44.7%
2023-12-11T01:59:32.961564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4031
27.1%
950
 
6.4%
895
 
6.0%
864
 
5.8%
477
 
3.2%
415
 
2.8%
390
 
2.6%
369
 
2.5%
363
 
2.4%
250
 
1.7%
Other values (281) 5893
39.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10331
69.3%
Space Separator 4031
 
27.1%
Decimal Number 410
 
2.8%
Other Punctuation 121
 
0.8%
Open Punctuation 2
 
< 0.1%
Close Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
950
 
9.2%
895
 
8.7%
864
 
8.4%
477
 
4.6%
415
 
4.0%
390
 
3.8%
369
 
3.6%
363
 
3.5%
250
 
2.4%
198
 
1.9%
Other values (266) 5160
49.9%
Decimal Number
ValueCountFrequency (%)
1 210
51.2%
2 71
 
17.3%
3 56
 
13.7%
4 24
 
5.9%
5 24
 
5.9%
6 8
 
2.0%
0 6
 
1.5%
8 5
 
1.2%
7 5
 
1.2%
9 1
 
0.2%
Other Punctuation
ValueCountFrequency (%)
, 118
97.5%
. 3
 
2.5%
Space Separator
ValueCountFrequency (%)
4031
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10331
69.3%
Common 4566
30.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
950
 
9.2%
895
 
8.7%
864
 
8.4%
477
 
4.6%
415
 
4.0%
390
 
3.8%
369
 
3.6%
363
 
3.5%
250
 
2.4%
198
 
1.9%
Other values (266) 5160
49.9%
Common
ValueCountFrequency (%)
4031
88.3%
1 210
 
4.6%
, 118
 
2.6%
2 71
 
1.6%
3 56
 
1.2%
4 24
 
0.5%
5 24
 
0.5%
6 8
 
0.2%
0 6
 
0.1%
8 5
 
0.1%
Other values (5) 13
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10327
69.3%
ASCII 4566
30.7%
Compat Jamo 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4031
88.3%
1 210
 
4.6%
, 118
 
2.6%
2 71
 
1.6%
3 56
 
1.2%
4 24
 
0.5%
5 24
 
0.5%
6 8
 
0.2%
0 6
 
0.1%
8 5
 
0.1%
Other values (5) 13
 
0.3%
Hangul
ValueCountFrequency (%)
950
 
9.2%
895
 
8.7%
864
 
8.4%
477
 
4.6%
415
 
4.0%
390
 
3.8%
369
 
3.6%
363
 
3.5%
250
 
2.4%
198
 
1.9%
Other values (265) 5156
49.9%
Compat Jamo
ValueCountFrequency (%)
4
100.0%
Distinct768
Distinct (%)89.8%
Missing4
Missing (%)0.5%
Memory size6.8 KiB
2023-12-11T01:59:33.568818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length106
Median length39
Mean length8.722807
Min length1

Characters and Unicode

Total characters7458
Distinct characters514
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique714 ?
Unique (%)83.5%

Sample

1st row구내방송장치, CCTV, CCTV브라켓
2nd row의류
3rd row냉동어개류(어류)
4th row참깨, 들깨
5th row기계제작
ValueCountFrequency (%)
49
 
3.0%
48
 
2.9%
선박 24
 
1.5%
선박용 18
 
1.1%
부품 16
 
1.0%
자동차부품 15
 
0.9%
밸브 15
 
0.9%
배전반 14
 
0.9%
제조 13
 
0.8%
수산물 13
 
0.8%
Other values (1012) 1410
86.2%
2023-12-11T01:59:34.457395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
783
 
10.5%
, 406
 
5.4%
246
 
3.3%
240
 
3.2%
160
 
2.1%
139
 
1.9%
123
 
1.6%
120
 
1.6%
119
 
1.6%
117
 
1.6%
Other values (504) 5005
67.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5842
78.3%
Space Separator 783
 
10.5%
Other Punctuation 419
 
5.6%
Lowercase Letter 171
 
2.3%
Uppercase Letter 139
 
1.9%
Open Punctuation 50
 
0.7%
Close Punctuation 50
 
0.7%
Decimal Number 2
 
< 0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
246
 
4.2%
240
 
4.1%
160
 
2.7%
139
 
2.4%
123
 
2.1%
120
 
2.1%
119
 
2.0%
117
 
2.0%
113
 
1.9%
112
 
1.9%
Other values (449) 4353
74.5%
Lowercase Letter
ValueCountFrequency (%)
e 20
11.7%
r 17
 
9.9%
t 17
 
9.9%
o 14
 
8.2%
i 13
 
7.6%
a 11
 
6.4%
n 11
 
6.4%
b 8
 
4.7%
c 8
 
4.7%
l 8
 
4.7%
Other values (14) 44
25.7%
Uppercase Letter
ValueCountFrequency (%)
C 22
15.8%
E 16
11.5%
T 13
9.4%
S 11
7.9%
A 9
 
6.5%
V 9
 
6.5%
P 9
 
6.5%
D 8
 
5.8%
L 8
 
5.8%
B 7
 
5.0%
Other values (12) 27
19.4%
Other Punctuation
ValueCountFrequency (%)
, 406
96.9%
. 8
 
1.9%
/ 4
 
1.0%
& 1
 
0.2%
Space Separator
ValueCountFrequency (%)
783
100.0%
Open Punctuation
ValueCountFrequency (%)
( 50
100.0%
Close Punctuation
ValueCountFrequency (%)
) 50
100.0%
Decimal Number
ValueCountFrequency (%)
2 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5841
78.3%
Common 1306
 
17.5%
Latin 310
 
4.2%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
246
 
4.2%
240
 
4.1%
160
 
2.7%
139
 
2.4%
123
 
2.1%
120
 
2.1%
119
 
2.0%
117
 
2.0%
113
 
1.9%
112
 
1.9%
Other values (448) 4352
74.5%
Latin
ValueCountFrequency (%)
C 22
 
7.1%
e 20
 
6.5%
r 17
 
5.5%
t 17
 
5.5%
E 16
 
5.2%
o 14
 
4.5%
i 13
 
4.2%
T 13
 
4.2%
a 11
 
3.5%
n 11
 
3.5%
Other values (36) 156
50.3%
Common
ValueCountFrequency (%)
783
60.0%
, 406
31.1%
( 50
 
3.8%
) 50
 
3.8%
. 8
 
0.6%
/ 4
 
0.3%
2 2
 
0.2%
- 2
 
0.2%
& 1
 
0.1%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5841
78.3%
ASCII 1616
 
21.7%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
783
48.5%
, 406
25.1%
( 50
 
3.1%
) 50
 
3.1%
C 22
 
1.4%
e 20
 
1.2%
r 17
 
1.1%
t 17
 
1.1%
E 16
 
1.0%
o 14
 
0.9%
Other values (45) 221
 
13.7%
Hangul
ValueCountFrequency (%)
246
 
4.2%
240
 
4.1%
160
 
2.7%
139
 
2.4%
123
 
2.1%
120
 
2.1%
119
 
2.0%
117
 
2.0%
113
 
1.9%
112
 
1.9%
Other values (448) 4352
74.5%
CJK
ValueCountFrequency (%)
1
100.0%

데이터 기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.8 KiB
Minimum2023-05-31 00:00:00
Maximum2023-05-31 00:00:00
2023-12-11T01:59:34.742027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:59:35.020667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-11T01:59:25.269382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:59:24.961248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:59:25.421104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:59:25.103858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:59:35.181410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번종업원수
연번1.0000.184
종업원수0.1841.000
2023-12-11T01:59:35.329936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번종업원수
연번1.000-0.178
종업원수-0.1781.000

Missing values

2023-12-11T01:59:25.607484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:59:25.851436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T01:59:26.036726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번회사명주소종업원수전화번호팩스번호업종명생산품데이터 기준일자
01(사)골든장애인협회 사업단부산광역시 사하구 보덕포1길 100, B동, 지하1층 (장림동)0051-311-1791051-311-1792기타 음향기기 제조업 외 2 종구내방송장치, CCTV, CCTV브라켓2023-05-31
12(사)하나복지회 피복사업장부산광역시 사하구 다대로354번안길 20, 2층(장림동)10051-262-6911051-262-6912남자용 겉옷 제조업 외 24 종의류2023-05-31
23(주) 예맥통상부산광역시 사하구 비봉로 56 (신평동)17051-207-9393051-207-9494수산동물 훈제, 조리 및 유사 조제식품 제조업냉동어개류(어류)2023-05-31
34(주)HN푸드부산광역시 사하구 하신중앙로27번길 15 (장림동)2051-263-7249051-264-7249기타 곡물 가공품 제조업참깨, 들깨2023-05-31
45(주)Y2산업부산광역시 사하구 장림로 100 (장림동)9051-264-4714051-264-4724기타 가공 공작기계 제조업기계제작2023-05-31
56(주)강남부산광역시 사하구 구평로16번길 71 (구평동) 외 18필지320051-260-6000051-262-7400강선 건조업 외 4 종FRP선박건조,강선수리,프랜트제작,비철금속선,보트2023-05-31
67(주)경기색소부산광역시 사하구 을숙도대로 526 (신평동)40051-291-0265051-203-0178염료, 조제 무기안료, 유연제 및 기타 착색제 제조업안료2023-05-31
78(주)경맥부산광역시 사하구 다대로354번길 71 (장림동)20051-292-8234051-208-7002기타 수산동물 가공 및 저장 처리업자숙문어2023-05-31
89(주)경진하네스산업부산광역시 사하구 장평로41번길 36 (장림동)32051-264-1184051-264-4381자동차용 신품 전기장치 제조업 외 1 종와이어링2023-05-31
910(주)광명광고공사부산광역시 사하구 낙동대로 246-1 (괴정동)3051-202-8897051-203-8897간판 및 광고물 제조업간판,광고물2023-05-31
연번회사명주소종업원수전화번호팩스번호업종명생산품데이터 기준일자
849850화성부산광역시 사하구 하신중앙로27번길 17, 1동 2층 1 (장림동, 에이스밀)4<NA><NA>배전반 및 전기 자동제어반 제조업배전반2023-05-31
850851화인.캡부산광역시 사하구 다대로354번안길 80 (장림동)2051-262-1141051-262-1173기타 식품 첨가물 제조업 외 1 종주방용세척제2023-05-31
851852화인골드마린스부산광역시 사하구 원양로 407-11 (감천동)15051-205-6928051-205-0510기타 수산동물 가공 및 저장 처리업오징어 제품외2023-05-31
852853화진물산부산광역시 사하구 하신중앙로54번길 28 (장림동)4051-261-3710051-261-3530생물학적 제제 제조업 외 4 종탈취제, 비료2023-05-31
853854효동전기부산광역시 사하구 구평로 7 (구평동)2051-416-6159051-416-6591선박 구성 부분품 제조업판넬, 선박전기2023-05-31
854855효진정밀부산광역시 사하구 장평로190번길 25 (장림동)17051-266-1697051-266-1690금속 위생용품 제조업금속파스너 및 나사제품 제조품2023-05-31
855856효창프라스틱부산광역시 사하구 다대로354번길 55 (장림동)20051-266-5290051-266-5266그 외 자동차용 신품 부품 제조업 외 4 종자동차용 시계렌즈2023-05-31
856857흥성공업사부산광역시 사하구 장평로138번길 17-2 (장림동)8051-264-5825051-266-5822그 외 자동차용 신품 부품 제조업 외 3 종자동차부품2023-05-31
857858흥진산업부산광역시 사하구 비봉로21번길 25 (신평동)5051-206-5673051-206-5674식품 위생용 종이 상자 및 용기 제조업 외 1 종색지,포장지2023-05-31
858859힐튼디자인부산광역시 사하구 하신중앙로27번길 17, 410호(장림동, 에이스밀)4070-7578-0370050-8090-0370소파 및 기타 내장가구 제조업소파2023-05-31