Overview

Dataset statistics

Number of variables8
Number of observations1087
Missing cells1199
Missing cells (%)13.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory70.2 KiB
Average record size in memory66.1 B

Variable types

Numeric2
Text6

Dataset

Description공장설립온라인지원시스템에 등록된 강원특별자치도 내 건축분야 제조업체 현황에 대한 자료(2024. 2. 8. 기준)
Author강원특별자치도
URLhttps://www.data.go.kr/data/15126661/fileData.do

Alerts

연번 is highly overall correlated with 대표업종번호High correlation
대표업종번호 is highly overall correlated with 연번High correlation
공장대표주소(도로명) has 33 (3.0%) missing valuesMissing
전화번호 has 135 (12.4%) missing valuesMissing
공장홈페이지 has 1031 (94.8%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-14 23:09:17.013858
Analysis finished2024-03-14 23:09:20.029496
Duration3.02 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1087
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean544
Minimum1
Maximum1087
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.7 KiB
2024-03-15T08:09:20.266273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile55.3
Q1272.5
median544
Q3815.5
95-th percentile1032.7
Maximum1087
Range1086
Interquartile range (IQR)543

Descriptive statistics

Standard deviation313.93418
Coefficient of variation (CV)0.57708488
Kurtosis-1.2
Mean544
Median Absolute Deviation (MAD)272
Skewness0
Sum591328
Variance98554.667
MonotonicityStrictly increasing
2024-03-15T08:09:20.706611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
724 1
 
0.1%
730 1
 
0.1%
729 1
 
0.1%
728 1
 
0.1%
727 1
 
0.1%
726 1
 
0.1%
725 1
 
0.1%
723 1
 
0.1%
2 1
 
0.1%
Other values (1077) 1077
99.1%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1087 1
0.1%
1086 1
0.1%
1085 1
0.1%
1084 1
0.1%
1083 1
0.1%
1082 1
0.1%
1081 1
0.1%
1080 1
0.1%
1079 1
0.1%
1078 1
0.1%
Distinct1034
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Memory size8.6 KiB
2024-03-15T08:09:21.708245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length18
Mean length7.5731371
Min length2

Characters and Unicode

Total characters8232
Distinct characters429
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique984 ?
Unique (%)90.5%

Sample

1st row(주)우리창씨앤비
2nd row강원블라인드
3rd row뉴스타 커텐
4th row선엔지니어링(주)
5th row스타종합커텐
ValueCountFrequency (%)
주식회사 206
 
15.1%
합자회사 8
 
0.6%
제2공장 6
 
0.4%
주)영남유리산업 3
 
0.2%
나무나라 3
 
0.2%
한라시멘트주식회사 3
 
0.2%
주)미림테크 3
 
0.2%
영월공장 3
 
0.2%
주)뉴보텍 3
 
0.2%
시에라대창목재 2
 
0.1%
Other values (1057) 1124
82.4%
2024-03-15T08:09:22.942556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
754
 
9.2%
( 527
 
6.4%
) 527
 
6.4%
301
 
3.7%
283
 
3.4%
261
 
3.2%
247
 
3.0%
205
 
2.5%
192
 
2.3%
181
 
2.2%
Other values (419) 4754
57.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6783
82.4%
Open Punctuation 527
 
6.4%
Close Punctuation 527
 
6.4%
Space Separator 283
 
3.4%
Uppercase Letter 74
 
0.9%
Decimal Number 16
 
0.2%
Lowercase Letter 10
 
0.1%
Other Punctuation 8
 
0.1%
Other Symbol 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
754
 
11.1%
301
 
4.4%
261
 
3.8%
247
 
3.6%
205
 
3.0%
192
 
2.8%
181
 
2.7%
163
 
2.4%
98
 
1.4%
91
 
1.3%
Other values (380) 4290
63.2%
Uppercase Letter
ValueCountFrequency (%)
S 9
12.2%
E 7
 
9.5%
N 7
 
9.5%
G 6
 
8.1%
K 5
 
6.8%
P 5
 
6.8%
M 5
 
6.8%
O 5
 
6.8%
C 4
 
5.4%
D 3
 
4.1%
Other values (11) 18
24.3%
Lowercase Letter
ValueCountFrequency (%)
t 2
20.0%
i 1
10.0%
n 1
10.0%
h 1
10.0%
g 1
10.0%
l 1
10.0%
a 1
10.0%
e 1
10.0%
u 1
10.0%
Other Punctuation
ValueCountFrequency (%)
. 4
50.0%
& 3
37.5%
· 1
 
12.5%
Decimal Number
ValueCountFrequency (%)
2 11
68.8%
1 5
31.2%
Open Punctuation
ValueCountFrequency (%)
( 527
100.0%
Close Punctuation
ValueCountFrequency (%)
) 527
100.0%
Space Separator
ValueCountFrequency (%)
283
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6787
82.4%
Common 1361
 
16.5%
Latin 84
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
754
 
11.1%
301
 
4.4%
261
 
3.8%
247
 
3.6%
205
 
3.0%
192
 
2.8%
181
 
2.7%
163
 
2.4%
98
 
1.4%
91
 
1.3%
Other values (381) 4294
63.3%
Latin
ValueCountFrequency (%)
S 9
 
10.7%
E 7
 
8.3%
N 7
 
8.3%
G 6
 
7.1%
K 5
 
6.0%
P 5
 
6.0%
M 5
 
6.0%
O 5
 
6.0%
C 4
 
4.8%
D 3
 
3.6%
Other values (20) 28
33.3%
Common
ValueCountFrequency (%)
( 527
38.7%
) 527
38.7%
283
20.8%
2 11
 
0.8%
1 5
 
0.4%
. 4
 
0.3%
& 3
 
0.2%
· 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6783
82.4%
ASCII 1444
 
17.5%
None 5
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
754
 
11.1%
301
 
4.4%
261
 
3.8%
247
 
3.6%
205
 
3.0%
192
 
2.8%
181
 
2.7%
163
 
2.4%
98
 
1.4%
91
 
1.3%
Other values (380) 4290
63.2%
ASCII
ValueCountFrequency (%)
( 527
36.5%
) 527
36.5%
283
19.6%
2 11
 
0.8%
S 9
 
0.6%
E 7
 
0.5%
N 7
 
0.5%
G 6
 
0.4%
1 5
 
0.3%
K 5
 
0.3%
Other values (27) 57
 
3.9%
None
ValueCountFrequency (%)
4
80.0%
· 1
 
20.0%
Distinct995
Distinct (%)94.4%
Missing33
Missing (%)3.0%
Memory size8.6 KiB
2024-03-15T08:09:24.567146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length64
Median length52
Mean length29.368121
Min length15

Characters and Unicode

Total characters30954
Distinct characters362
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique948 ?
Unique (%)89.9%

Sample

1st row강원특별자치도 원주시 소초면 노루고개길 102
2nd row강원특별자치도 원주시 소초면 장수1로 289
3rd row강원특별자치도 동해시 대동로 109-2, 483-2번지 1층 (구미동)
4th row강원특별자치도 춘천시 퇴계공단1길 58 (퇴계동, 정안사)
5th row강원특별자치도 강릉시 강릉대로 180-1 (교동)
ValueCountFrequency (%)
강원특별자치도 1053
 
17.1%
원주시 274
 
4.5%
159
 
2.6%
춘천시 158
 
2.6%
동해시 99
 
1.6%
강릉시 97
 
1.6%
1필지 82
 
1.3%
횡성군 73
 
1.2%
1동 71
 
1.2%
구호동 70
 
1.1%
Other values (1557) 4005
65.2%
2024-03-15T08:09:26.354181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5088
 
16.4%
1433
 
4.6%
1246
 
4.0%
1080
 
3.5%
1075
 
3.5%
1071
 
3.5%
1055
 
3.4%
1054
 
3.4%
1 977
 
3.2%
732
 
2.4%
Other values (352) 16143
52.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 19795
63.9%
Space Separator 5088
 
16.4%
Decimal Number 4430
 
14.3%
Close Punctuation 470
 
1.5%
Open Punctuation 470
 
1.5%
Dash Punctuation 439
 
1.4%
Other Punctuation 214
 
0.7%
Uppercase Letter 43
 
0.1%
Other Symbol 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1433
 
7.2%
1246
 
6.3%
1080
 
5.5%
1075
 
5.4%
1071
 
5.4%
1055
 
5.3%
1054
 
5.3%
732
 
3.7%
711
 
3.6%
546
 
2.8%
Other values (319) 9792
49.5%
Uppercase Letter
ValueCountFrequency (%)
C 8
18.6%
A 7
16.3%
D 6
14.0%
B 4
9.3%
E 4
9.3%
N 2
 
4.7%
O 2
 
4.7%
S 2
 
4.7%
I 2
 
4.7%
L 1
 
2.3%
Other values (5) 5
11.6%
Decimal Number
ValueCountFrequency (%)
1 977
22.1%
2 688
15.5%
3 480
10.8%
4 380
 
8.6%
5 369
 
8.3%
6 344
 
7.8%
7 333
 
7.5%
0 330
 
7.4%
8 288
 
6.5%
9 241
 
5.4%
Other Punctuation
ValueCountFrequency (%)
, 208
97.2%
. 4
 
1.9%
& 2
 
0.9%
Space Separator
ValueCountFrequency (%)
5088
100.0%
Close Punctuation
ValueCountFrequency (%)
) 470
100.0%
Open Punctuation
ValueCountFrequency (%)
( 470
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 439
100.0%
Other Symbol
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 19800
64.0%
Common 11111
35.9%
Latin 43
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1433
 
7.2%
1246
 
6.3%
1080
 
5.5%
1075
 
5.4%
1071
 
5.4%
1055
 
5.3%
1054
 
5.3%
732
 
3.7%
711
 
3.6%
546
 
2.8%
Other values (320) 9797
49.5%
Common
ValueCountFrequency (%)
5088
45.8%
1 977
 
8.8%
2 688
 
6.2%
3 480
 
4.3%
) 470
 
4.2%
( 470
 
4.2%
- 439
 
4.0%
4 380
 
3.4%
5 369
 
3.3%
6 344
 
3.1%
Other values (7) 1406
 
12.7%
Latin
ValueCountFrequency (%)
C 8
18.6%
A 7
16.3%
D 6
14.0%
B 4
9.3%
E 4
9.3%
N 2
 
4.7%
O 2
 
4.7%
S 2
 
4.7%
I 2
 
4.7%
L 1
 
2.3%
Other values (5) 5
11.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 19795
63.9%
ASCII 11154
36.0%
None 5
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5088
45.6%
1 977
 
8.8%
2 688
 
6.2%
3 480
 
4.3%
) 470
 
4.2%
( 470
 
4.2%
- 439
 
3.9%
4 380
 
3.4%
5 369
 
3.3%
6 344
 
3.1%
Other values (22) 1449
 
13.0%
Hangul
ValueCountFrequency (%)
1433
 
7.2%
1246
 
6.3%
1080
 
5.5%
1075
 
5.4%
1071
 
5.4%
1055
 
5.3%
1054
 
5.3%
732
 
3.7%
711
 
3.6%
546
 
2.8%
Other values (319) 9792
49.5%
None
ValueCountFrequency (%)
5
100.0%

대표업종번호
Real number (ℝ)

HIGH CORRELATION 

Distinct81
Distinct (%)7.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean23348.453
Minimum13223
Maximum72922
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.7 KiB
2024-03-15T08:09:26.769877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum13223
5-th percentile16101
Q122211
median23325
Q325112
95-th percentile25119
Maximum72922
Range59699
Interquartile range (IQR)2901

Descriptive statistics

Standard deviation8113.9339
Coefficient of variation (CV)0.34751484
Kurtosis26.357104
Mean23348.453
Median Absolute Deviation (MAD)1786
Skewness4.6896273
Sum25379768
Variance65835923
MonotonicityIncreasing
2024-03-15T08:09:27.117281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
25112 140
 
12.9%
25113 78
 
7.2%
25111 77
 
7.1%
16101 70
 
6.4%
23324 55
 
5.1%
22223 41
 
3.8%
23991 36
 
3.3%
16102 35
 
3.2%
25119 35
 
3.2%
23325 33
 
3.0%
Other values (71) 487
44.8%
ValueCountFrequency (%)
13223 5
 
0.5%
13224 10
 
0.9%
13225 2
 
0.2%
13992 3
 
0.3%
14192 6
 
0.6%
16101 70
6.4%
16102 35
3.2%
16103 1
 
0.1%
16211 1
 
0.1%
16212 12
 
1.1%
ValueCountFrequency (%)
72922 1
 
0.1%
72921 1
 
0.1%
72129 5
 
0.5%
72121 5
 
0.5%
72112 3
 
0.3%
72111 9
 
0.8%
41225 2
 
0.2%
25119 35
 
3.2%
25113 78
7.2%
25112 140
12.9%
Distinct274
Distinct (%)25.2%
Missing0
Missing (%)0.0%
Memory size8.6 KiB
2024-03-15T08:09:28.355295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length27
Mean length20.334867
Min length3

Characters and Unicode

Total characters22104
Distinct characters179
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique136 ?
Unique (%)12.5%

Sample

1st row커튼 및 유사제품 제조업
2nd row커튼 및 유사제품 제조업
3rd row커튼 및 유사제품 제조업
4th row커튼 및 유사제품 제조업 외 6 종
5th row커튼 및 유사제품 제조업
ValueCountFrequency (%)
제조업 960
 
13.3%
637
 
8.8%
579
 
8.0%
548
 
7.6%
금속 295
 
4.1%
기타 249
 
3.4%
구조용 208
 
2.9%
1 207
 
2.9%
콘크리트 163
 
2.3%
플라스틱 143
 
2.0%
Other values (172) 3255
44.9%
2024-03-15T08:09:29.883139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6157
27.9%
1623
 
7.3%
1340
 
6.1%
1114
 
5.0%
637
 
2.9%
580
 
2.6%
555
 
2.5%
550
 
2.5%
402
 
1.8%
396
 
1.8%
Other values (169) 8750
39.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14989
67.8%
Space Separator 6157
27.9%
Decimal Number 606
 
2.7%
Other Punctuation 352
 
1.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1623
 
10.8%
1340
 
8.9%
1114
 
7.4%
637
 
4.2%
580
 
3.9%
555
 
3.7%
550
 
3.7%
402
 
2.7%
396
 
2.6%
350
 
2.3%
Other values (157) 7442
49.6%
Decimal Number
ValueCountFrequency (%)
1 275
45.4%
2 105
 
17.3%
3 75
 
12.4%
4 49
 
8.1%
5 38
 
6.3%
6 24
 
4.0%
8 14
 
2.3%
0 10
 
1.7%
7 8
 
1.3%
9 8
 
1.3%
Space Separator
ValueCountFrequency (%)
6157
100.0%
Other Punctuation
ValueCountFrequency (%)
, 352
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14989
67.8%
Common 7115
32.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1623
 
10.8%
1340
 
8.9%
1114
 
7.4%
637
 
4.2%
580
 
3.9%
555
 
3.7%
550
 
3.7%
402
 
2.7%
396
 
2.6%
350
 
2.3%
Other values (157) 7442
49.6%
Common
ValueCountFrequency (%)
6157
86.5%
, 352
 
4.9%
1 275
 
3.9%
2 105
 
1.5%
3 75
 
1.1%
4 49
 
0.7%
5 38
 
0.5%
6 24
 
0.3%
8 14
 
0.2%
0 10
 
0.1%
Other values (2) 16
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14989
67.8%
ASCII 7115
32.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6157
86.5%
, 352
 
4.9%
1 275
 
3.9%
2 105
 
1.5%
3 75
 
1.1%
4 49
 
0.7%
5 38
 
0.5%
6 24
 
0.3%
8 14
 
0.2%
0 10
 
0.1%
Other values (2) 16
 
0.2%
Hangul
ValueCountFrequency (%)
1623
 
10.8%
1340
 
8.9%
1114
 
7.4%
637
 
4.2%
580
 
3.9%
555
 
3.7%
550
 
3.7%
402
 
2.7%
396
 
2.6%
350
 
2.3%
Other values (157) 7442
49.6%

전화번호
Text

MISSING 

Distinct873
Distinct (%)91.7%
Missing135
Missing (%)12.4%
Memory size8.6 KiB
2024-03-15T08:09:30.725916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.008403
Min length9

Characters and Unicode

Total characters11432
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique800 ?
Unique (%)84.0%

Sample

1st row033-734-1284
2nd row033-747-8671
3rd row033-522-3296
4th row033-264-0945
5th row033-648-7709
ValueCountFrequency (%)
033-731-7661 3
 
0.3%
033-766-2933 3
 
0.3%
033-734-6001 3
 
0.3%
033-581-5654 3
 
0.3%
033-521-8400 3
 
0.3%
033-534-1000 3
 
0.3%
033-522-3980 2
 
0.2%
033-257-1935 2
 
0.2%
041-753-9060 2
 
0.2%
070-8844-7122 2
 
0.2%
Other values (863) 926
97.3%
2024-03-15T08:09:32.387807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 2633
23.0%
- 1903
16.6%
0 1623
14.2%
2 852
 
7.5%
7 779
 
6.8%
4 750
 
6.6%
6 724
 
6.3%
5 720
 
6.3%
1 616
 
5.4%
8 482
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 9529
83.4%
Dash Punctuation 1903
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 2633
27.6%
0 1623
17.0%
2 852
 
8.9%
7 779
 
8.2%
4 750
 
7.9%
6 724
 
7.6%
5 720
 
7.6%
1 616
 
6.5%
8 482
 
5.1%
9 350
 
3.7%
Dash Punctuation
ValueCountFrequency (%)
- 1903
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 11432
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 2633
23.0%
- 1903
16.6%
0 1623
14.2%
2 852
 
7.5%
7 779
 
6.8%
4 750
 
6.6%
6 724
 
6.3%
5 720
 
6.3%
1 616
 
5.4%
8 482
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 11432
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 2633
23.0%
- 1903
16.6%
0 1623
14.2%
2 852
 
7.5%
7 779
 
6.8%
4 750
 
6.6%
6 724
 
6.3%
5 720
 
6.3%
1 616
 
5.4%
8 482
 
4.2%
Distinct927
Distinct (%)85.3%
Missing0
Missing (%)0.0%
Memory size8.6 KiB
2024-03-15T08:09:33.339731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length146
Median length55
Mean length11.703772
Min length2

Characters and Unicode

Total characters12722
Distinct characters514
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique863 ?
Unique (%)79.4%

Sample

1st row커튼
2nd row롤스크린, 버티컬, 홀딩도어
3rd row커튼, 침구류
4th row무대장치
5th row커튼,버디칼
ValueCountFrequency (%)
63
 
2.6%
59
 
2.5%
창호 40
 
1.7%
아스콘 31
 
1.3%
울타리 27
 
1.1%
목재 25
 
1.0%
플라스틱 23
 
1.0%
철구조물 23
 
1.0%
22
 
0.9%
금속구조물 22
 
0.9%
Other values (1290) 2066
86.0%
2024-03-15T08:09:34.661111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1334
 
10.5%
, 1015
 
8.0%
318
 
2.5%
281
 
2.2%
239
 
1.9%
213
 
1.7%
211
 
1.7%
190
 
1.5%
187
 
1.5%
157
 
1.2%
Other values (504) 8577
67.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9977
78.4%
Space Separator 1334
 
10.5%
Other Punctuation 1028
 
8.1%
Uppercase Letter 210
 
1.7%
Open Punctuation 58
 
0.5%
Close Punctuation 57
 
0.4%
Lowercase Letter 38
 
0.3%
Decimal Number 14
 
0.1%
Dash Punctuation 5
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
318
 
3.2%
281
 
2.8%
239
 
2.4%
213
 
2.1%
211
 
2.1%
190
 
1.9%
187
 
1.9%
157
 
1.6%
156
 
1.6%
136
 
1.4%
Other values (454) 7889
79.1%
Uppercase Letter
ValueCountFrequency (%)
P 46
21.9%
E 29
13.8%
C 27
12.9%
D 18
 
8.6%
V 18
 
8.6%
L 15
 
7.1%
A 8
 
3.8%
R 7
 
3.3%
B 6
 
2.9%
T 6
 
2.9%
Other values (10) 30
14.3%
Lowercase Letter
ValueCountFrequency (%)
c 5
13.2%
o 5
13.2%
p 4
10.5%
t 3
7.9%
e 3
7.9%
d 3
7.9%
w 2
 
5.3%
a 2
 
5.3%
r 2
 
5.3%
s 2
 
5.3%
Other values (7) 7
18.4%
Decimal Number
ValueCountFrequency (%)
1 4
28.6%
2 4
28.6%
3 3
21.4%
9 2
14.3%
7 1
 
7.1%
Other Punctuation
ValueCountFrequency (%)
, 1015
98.7%
. 10
 
1.0%
/ 3
 
0.3%
Space Separator
ValueCountFrequency (%)
1334
100.0%
Open Punctuation
ValueCountFrequency (%)
( 58
100.0%
Close Punctuation
ValueCountFrequency (%)
) 57
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9977
78.4%
Common 2497
 
19.6%
Latin 248
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
318
 
3.2%
281
 
2.8%
239
 
2.4%
213
 
2.1%
211
 
2.1%
190
 
1.9%
187
 
1.9%
157
 
1.6%
156
 
1.6%
136
 
1.4%
Other values (454) 7889
79.1%
Latin
ValueCountFrequency (%)
P 46
18.5%
E 29
11.7%
C 27
 
10.9%
D 18
 
7.3%
V 18
 
7.3%
L 15
 
6.0%
A 8
 
3.2%
R 7
 
2.8%
B 6
 
2.4%
T 6
 
2.4%
Other values (27) 68
27.4%
Common
ValueCountFrequency (%)
1334
53.4%
, 1015
40.6%
( 58
 
2.3%
) 57
 
2.3%
. 10
 
0.4%
- 5
 
0.2%
1 4
 
0.2%
2 4
 
0.2%
/ 3
 
0.1%
3 3
 
0.1%
Other values (3) 4
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9977
78.4%
ASCII 2745
 
21.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1334
48.6%
, 1015
37.0%
( 58
 
2.1%
) 57
 
2.1%
P 46
 
1.7%
E 29
 
1.1%
C 27
 
1.0%
D 18
 
0.7%
V 18
 
0.7%
L 15
 
0.5%
Other values (40) 128
 
4.7%
Hangul
ValueCountFrequency (%)
318
 
3.2%
281
 
2.8%
239
 
2.4%
213
 
2.1%
211
 
2.1%
190
 
1.9%
187
 
1.9%
157
 
1.6%
156
 
1.6%
136
 
1.4%
Other values (454) 7889
79.1%

공장홈페이지
Text

MISSING 

Distinct54
Distinct (%)96.4%
Missing1031
Missing (%)94.8%
Memory size8.6 KiB
2024-03-15T08:09:35.629496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length25
Mean length18.428571
Min length11

Characters and Unicode

Total characters1032
Distinct characters40
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)92.9%

Sample

1st rowwww.t-one21.com
2nd rowwww.gilhan.co.kr
3rd rowhttp://www.romasindustries.com
4th rowwww.nfcf.or.kr
5th rowwww.hcwood.co.kr
ValueCountFrequency (%)
pipeway.co.kr 2
 
3.5%
hwangtosesang.co.kr 2
 
3.5%
www.kmlight.co.kr 1
 
1.8%
www.hyundaiholesolar.com 1
 
1.8%
www.galed.co.kr 1
 
1.8%
daeyoucore.co.kr 1
 
1.8%
www.wooryong.co.kr 1
 
1.8%
www.caco3.co.kr 1
 
1.8%
www.wooilers.co.kr 1
 
1.8%
www.biltzone.com 1
 
1.8%
Other values (45) 45
78.9%
2024-03-15T08:09:36.952349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
w 141
13.7%
. 136
13.2%
o 96
 
9.3%
c 70
 
6.8%
r 68
 
6.6%
k 54
 
5.2%
e 49
 
4.7%
t 44
 
4.3%
n 39
 
3.8%
a 37
 
3.6%
Other values (30) 298
28.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 831
80.5%
Other Punctuation 177
 
17.2%
Decimal Number 19
 
1.8%
Dash Punctuation 3
 
0.3%
Connector Punctuation 1
 
0.1%
Space Separator 1
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
w 141
17.0%
o 96
11.6%
c 70
 
8.4%
r 68
 
8.2%
k 54
 
6.5%
e 49
 
5.9%
t 44
 
5.3%
n 39
 
4.7%
a 37
 
4.5%
s 29
 
3.5%
Other values (14) 204
24.5%
Decimal Number
ValueCountFrequency (%)
2 5
26.3%
6 3
15.8%
9 3
15.8%
3 2
 
10.5%
4 2
 
10.5%
0 1
 
5.3%
7 1
 
5.3%
8 1
 
5.3%
1 1
 
5.3%
Other Punctuation
ValueCountFrequency (%)
. 136
76.8%
/ 28
 
15.8%
: 12
 
6.8%
@ 1
 
0.6%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 831
80.5%
Common 201
 
19.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
w 141
17.0%
o 96
11.6%
c 70
 
8.4%
r 68
 
8.2%
k 54
 
6.5%
e 49
 
5.9%
t 44
 
5.3%
n 39
 
4.7%
a 37
 
4.5%
s 29
 
3.5%
Other values (14) 204
24.5%
Common
ValueCountFrequency (%)
. 136
67.7%
/ 28
 
13.9%
: 12
 
6.0%
2 5
 
2.5%
6 3
 
1.5%
9 3
 
1.5%
- 3
 
1.5%
3 2
 
1.0%
4 2
 
1.0%
0 1
 
0.5%
Other values (6) 6
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1032
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
w 141
13.7%
. 136
13.2%
o 96
 
9.3%
c 70
 
6.8%
r 68
 
6.6%
k 54
 
5.2%
e 49
 
4.7%
t 44
 
4.3%
n 39
 
3.8%
a 37
 
3.6%
Other values (30) 298
28.9%

Interactions

2024-03-15T08:09:18.778504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T08:09:18.318730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T08:09:18.946796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T08:09:18.571245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T08:09:37.126228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번대표업종번호공장홈페이지
연번1.0000.7920.971
대표업종번호0.7921.0001.000
공장홈페이지0.9711.0001.000
2024-03-15T08:09:37.274751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번대표업종번호
연번1.0000.998
대표업종번호0.9981.000

Missing values

2024-03-15T08:09:19.210077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T08:09:19.545965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-15T08:09:19.871840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번회사명공장대표주소(도로명)대표업종번호업종명전화번호생산품공장홈페이지
01(주)우리창씨앤비강원특별자치도 원주시 소초면 노루고개길 10213223커튼 및 유사제품 제조업033-734-1284커튼<NA>
12강원블라인드강원특별자치도 원주시 소초면 장수1로 28913223커튼 및 유사제품 제조업033-747-8671롤스크린, 버티컬, 홀딩도어<NA>
23뉴스타 커텐강원특별자치도 동해시 대동로 109-2, 483-2번지 1층 (구미동)13223커튼 및 유사제품 제조업033-522-3296커튼, 침구류<NA>
34선엔지니어링(주)강원특별자치도 춘천시 퇴계공단1길 58 (퇴계동, 정안사)13223커튼 및 유사제품 제조업 외 6 종033-264-0945무대장치<NA>
45스타종합커텐강원특별자치도 강릉시 강릉대로 180-1 (교동)13223커튼 및 유사제품 제조업033-648-7709커튼,버디칼<NA>
56(주)가림강원특별자치도 원주시 태장공단길 54-4, 1층 1호(태장동)13224천막, 텐트 및 유사 제품 제조업033-762-2949막구조물<NA>
67(주)스페이스업강원특별자치도 원주시 문막읍 문막공단길 7113224천막, 텐트 및 유사 제품 제조업 외 7 종033-743-0640천막 및 기타 캔버스 제품<NA>
78(주)태성스페이스강원특별자치도 횡성군 우천면 우천제2농공단지로 88-34 (우천면)13224천막, 텐트 및 유사 제품 제조업 외 9 종033-433-7275막구조물, 교량난간, 금속재울타리, 가드레일 외<NA>
89(주)현대M&S강원특별자치도 평창군 평창읍 농공단지길 40, (주)현대M&S (INNO WIZ)13224천막, 텐트 및 유사 제품 제조업033-333-0997막구조물<NA>
910수성캐노피강원특별자치도 철원군 동송읍 이평로 4313224천막, 텐트 및 유사 제품 제조업 외 1 종033-458-7111캐노피,접이식 방갈로,접이식 평상<NA>
연번회사명공장대표주소(도로명)대표업종번호업종명전화번호생산품공장홈페이지
10771078라 ENC강원특별자치도 춘천시 소양강로 10, 1동 8층 816호 (후평동) 1동 8층 816호72121건물 및 토목 엔지니어링 서비스업 외 1 종<NA>설계,엔지니어링<NA>
10781079에덴종합건축강원특별자치도 춘천시 후석로420번길 7, 1동 7층 723호(후평동) 1동 7층 723호 외 1필지72121건물 및 토목 엔지니어링 서비스업<NA>건물엔지니어링 서비스<NA>
10791080온(ON)구조안전기술사사무소강원특별자치도 춘천시 소양강로 10, 1동 6층 609호 (후평동) 1동 6층 609호72121건물 및 토목 엔지니어링 서비스업<NA>건물 및 토목 엔지니어링 서비스<NA>
10801081우리건기강원특별자치도 춘천시 후석로420번길 7, 1동 3층 331호(후평동) 1동 3층 331호72129기타 엔지니어링 서비스업033-262-6099기술서비스, 기타전기, 배선, 엔지니어링, 토목 건설<NA>
10811082주식회사 강일이엔씨강원특별자치도 춘천시 소양강로 10, 1동 9층 901호 (후평동) 1동 9층 901호72129기타 엔지니어링 서비스업033-263-3607토목구조설계서<NA>
10821083주식회사 다올이엔지강원특별자치도 춘천시 후석로420번길 7, 1동 7층 706호(후평동) 1동 7층 706호 외 1필지72129기타 엔지니어링 서비스업 외 2 종033-255-0030설계 및 감리도서, 관리용 소프트웨어<NA>
10831084춘천철거강원특별자치도 춘천시 후석로420번길 7, 1동 5층 509호(후평동) 1동 5층 509호72129기타 엔지니어링 서비스업<NA>기타엔지니어링서비스<NA>
10841085퍼스트 공간정보강원특별자치도 춘천시 후석로420번길 7, 1동 7층 711호(후평동) 1동 7층 711호72129기타 엔지니어링 서비스업<NA>토목건설 측량, 도면설계<NA>
10851086(주)대성그랜드강원특별자치도 춘천시 후석로420번길 7, 1동 7층 715호(후평동) 1동 7층 715호72921측량업033-256-5938측량<NA>
10861087대경적산강원특별자치도 춘천시 소양강로 10, 1동 6층 603호 (후평동) 1동 6층 603호72922제도업<NA>제도, 적산<NA>