Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells2761
Missing cells (%)4.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory556.6 KiB
Average record size in memory57.0 B

Variable types

Numeric1
Categorical1
Text4

Dataset

Description충청북도 제조업체 현황에 관한 데이터 입니다. (업체명, 분류코드, 주생산품, 종업원수, 소재지, 전화번호)분류코드는 한국표준산업분류표: 제조업종을 참고하였습니다.
Author충청북도
URLhttps://www.data.go.kr/data/15034235/fileData.do

Alerts

연번 is highly overall correlated with 시군High correlation
시군 is highly overall correlated with 연번High correlation
전화번호 has 2755 (27.6%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-15 02:05:35.107225
Analysis finished2024-03-15 02:05:38.462711
Duration3.36 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5337.4745
Minimum1
Maximum10643
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T11:05:38.604352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile537.95
Q12677.75
median5346.5
Q38008.5
95-th percentile10115.05
Maximum10643
Range10642
Interquartile range (IQR)5330.75

Descriptive statistics

Standard deviation3074.4815
Coefficient of variation (CV)0.57601802
Kurtosis-1.2014717
Mean5337.4745
Median Absolute Deviation (MAD)2666
Skewness-0.0066592746
Sum53374745
Variance9452436.5
MonotonicityNot monotonic
2024-03-15T11:05:38.864269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
8994 1
 
< 0.1%
467 1
 
< 0.1%
1014 1
 
< 0.1%
351 1
 
< 0.1%
9997 1
 
< 0.1%
5982 1
 
< 0.1%
2092 1
 
< 0.1%
3778 1
 
< 0.1%
8985 1
 
< 0.1%
10192 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
10643 1
< 0.1%
10642 1
< 0.1%
10641 1
< 0.1%
10640 1
< 0.1%
10639 1
< 0.1%
10638 1
< 0.1%
10637 1
< 0.1%
10636 1
< 0.1%
10635 1
< 0.1%
10634 1
< 0.1%

시군
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
청주시
3351 
음성군
2800 
진천군
1180 
충주시
892 
옥천군
490 
Other values (6)
1287 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row음성군
2nd row청주시
3rd row진천군
4th row청주시
5th row음성군

Common Values

ValueCountFrequency (%)
청주시 3351
33.5%
음성군 2800
28.0%
진천군 1180
 
11.8%
충주시 892
 
8.9%
옥천군 490
 
4.9%
제천시 366
 
3.7%
괴산군 347
 
3.5%
보은군 184
 
1.8%
영동군 162
 
1.6%
증평군 138
 
1.4%

Length

2024-03-15T11:05:39.171452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
청주시 3351
33.5%
음성군 2800
28.0%
진천군 1180
 
11.8%
충주시 892
 
8.9%
옥천군 490
 
4.9%
제천시 366
 
3.7%
괴산군 347
 
3.5%
보은군 184
 
1.8%
영동군 162
 
1.6%
증평군 138
 
1.4%
Distinct9461
Distinct (%)94.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T11:05:40.391046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length24
Mean length7.7809
Min length2

Characters and Unicode

Total characters77809
Distinct characters805
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8995 ?
Unique (%)90.0%

Sample

1st row(주)한국씨엔엠
2nd row(주)한랩
3rd row진솔화학 주식회사
4th row(주)엠지바이오
5th row성원산업
ValueCountFrequency (%)
주식회사 951
 
8.0%
농업회사법인 165
 
1.4%
제2공장 116
 
1.0%
2공장 56
 
0.5%
36
 
0.3%
유한회사 24
 
0.2%
영농조합법인 22
 
0.2%
제3공장 21
 
0.2%
음성공장 16
 
0.1%
제1공장 14
 
0.1%
Other values (9450) 10405
88.0%
2024-03-15T11:05:42.380019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7385
 
9.5%
) 6334
 
8.1%
( 6333
 
8.1%
2379
 
3.1%
1939
 
2.5%
1863
 
2.4%
1631
 
2.1%
1510
 
1.9%
1406
 
1.8%
1403
 
1.8%
Other values (795) 45626
58.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 61764
79.4%
Close Punctuation 6334
 
8.1%
Open Punctuation 6333
 
8.1%
Space Separator 1939
 
2.5%
Uppercase Letter 711
 
0.9%
Decimal Number 436
 
0.6%
Lowercase Letter 125
 
0.2%
Other Punctuation 103
 
0.1%
Other Symbol 52
 
0.1%
Dash Punctuation 12
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7385
 
12.0%
2379
 
3.9%
1863
 
3.0%
1631
 
2.6%
1510
 
2.4%
1406
 
2.3%
1403
 
2.3%
1266
 
2.0%
1191
 
1.9%
835
 
1.4%
Other values (725) 40895
66.2%
Uppercase Letter
ValueCountFrequency (%)
E 83
 
11.7%
S 77
 
10.8%
N 63
 
8.9%
G 55
 
7.7%
T 55
 
7.7%
C 48
 
6.8%
M 42
 
5.9%
H 28
 
3.9%
K 25
 
3.5%
L 24
 
3.4%
Other values (16) 211
29.7%
Lowercase Letter
ValueCountFrequency (%)
n 17
13.6%
e 17
13.6%
i 12
9.6%
o 12
9.6%
h 10
8.0%
a 9
7.2%
c 8
 
6.4%
t 6
 
4.8%
r 6
 
4.8%
g 6
 
4.8%
Other values (12) 22
17.6%
Decimal Number
ValueCountFrequency (%)
2 290
66.5%
1 64
 
14.7%
3 50
 
11.5%
4 9
 
2.1%
5 7
 
1.6%
0 5
 
1.1%
9 4
 
0.9%
8 3
 
0.7%
7 2
 
0.5%
6 2
 
0.5%
Other Punctuation
ValueCountFrequency (%)
. 62
60.2%
& 31
30.1%
· 3
 
2.9%
, 3
 
2.9%
" 2
 
1.9%
: 1
 
1.0%
/ 1
 
1.0%
Close Punctuation
ValueCountFrequency (%)
) 6334
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6333
100.0%
Space Separator
ValueCountFrequency (%)
1939
100.0%
Other Symbol
ValueCountFrequency (%)
52
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 61814
79.4%
Common 15157
 
19.5%
Latin 836
 
1.1%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7385
 
11.9%
2379
 
3.8%
1863
 
3.0%
1631
 
2.6%
1510
 
2.4%
1406
 
2.3%
1403
 
2.3%
1266
 
2.0%
1191
 
1.9%
835
 
1.4%
Other values (724) 40945
66.2%
Latin
ValueCountFrequency (%)
E 83
 
9.9%
S 77
 
9.2%
N 63
 
7.5%
G 55
 
6.6%
T 55
 
6.6%
C 48
 
5.7%
M 42
 
5.0%
H 28
 
3.3%
K 25
 
3.0%
L 24
 
2.9%
Other values (38) 336
40.2%
Common
ValueCountFrequency (%)
) 6334
41.8%
( 6333
41.8%
1939
 
12.8%
2 290
 
1.9%
1 64
 
0.4%
. 62
 
0.4%
3 50
 
0.3%
& 31
 
0.2%
- 12
 
0.1%
4 9
 
0.1%
Other values (11) 33
 
0.2%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 61761
79.4%
ASCII 15990
 
20.6%
None 55
 
0.1%
CJK 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7385
 
12.0%
2379
 
3.9%
1863
 
3.0%
1631
 
2.6%
1510
 
2.4%
1406
 
2.3%
1403
 
2.3%
1266
 
2.0%
1191
 
1.9%
835
 
1.4%
Other values (722) 40892
66.2%
ASCII
ValueCountFrequency (%)
) 6334
39.6%
( 6333
39.6%
1939
 
12.1%
2 290
 
1.8%
E 83
 
0.5%
S 77
 
0.5%
1 64
 
0.4%
N 63
 
0.4%
. 62
 
0.4%
G 55
 
0.3%
Other values (58) 690
 
4.3%
None
ValueCountFrequency (%)
52
94.5%
· 3
 
5.5%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct9412
Distinct (%)94.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T11:05:43.703925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length97
Median length65
Mean length25.9645
Min length12

Characters and Unicode

Total characters259645
Distinct characters462
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8986 ?
Unique (%)89.9%

Sample

1st row충청북도 음성군 음성읍 한벌리 90 외 1필지
2nd row충청북도 청주시 흥덕구 오송읍 연제리 640
3rd row충청북도 진천군 문백면 평산리 503-2번지
4th row충청북도 청주시 흥덕구 강내면 월곡리 301-3번지 충청대학 창업보육센터 R동 116호
5th row충청북도 음성군 대소면 소석리 292-1 외 6필지
ValueCountFrequency (%)
충청북도 10002
 
17.2%
청주시 3349
 
5.8%
음성군 2802
 
4.8%
1439
 
2.5%
흥덕구 1371
 
2.4%
청원구 1287
 
2.2%
진천군 1180
 
2.0%
충주시 892
 
1.5%
금왕읍 673
 
1.2%
대소면 653
 
1.1%
Other values (8500) 34463
59.3%
2024-03-15T11:05:45.220674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
48626
 
18.7%
15106
 
5.8%
11170
 
4.3%
10517
 
4.1%
10376
 
4.0%
1 8901
 
3.4%
5915
 
2.3%
5825
 
2.2%
- 5763
 
2.2%
2 5741
 
2.2%
Other values (452) 131705
50.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 160677
61.9%
Space Separator 48626
 
18.7%
Decimal Number 41462
 
16.0%
Dash Punctuation 5763
 
2.2%
Close Punctuation 1196
 
0.5%
Open Punctuation 1194
 
0.5%
Other Punctuation 430
 
0.2%
Uppercase Letter 285
 
0.1%
Math Symbol 6
 
< 0.1%
Other Symbol 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
15106
 
9.4%
11170
 
7.0%
10517
 
6.5%
10376
 
6.5%
5915
 
3.7%
5825
 
3.6%
5638
 
3.5%
5494
 
3.4%
4897
 
3.0%
4655
 
2.9%
Other values (407) 81084
50.5%
Uppercase Letter
ValueCountFrequency (%)
B 81
28.4%
L 40
14.0%
E 27
 
9.5%
G 25
 
8.8%
A 21
 
7.4%
F 16
 
5.6%
S 14
 
4.9%
D 10
 
3.5%
C 10
 
3.5%
T 9
 
3.2%
Other values (10) 32
 
11.2%
Decimal Number
ValueCountFrequency (%)
1 8901
21.5%
2 5741
13.8%
3 4529
10.9%
4 3949
9.5%
5 3732
9.0%
6 3251
 
7.8%
7 3198
 
7.7%
0 3163
 
7.6%
8 2604
 
6.3%
9 2394
 
5.8%
Other Punctuation
ValueCountFrequency (%)
, 424
98.6%
' 2
 
0.5%
. 1
 
0.2%
* 1
 
0.2%
1
 
0.2%
& 1
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 1188
99.3%
] 8
 
0.7%
Open Punctuation
ValueCountFrequency (%)
( 1186
99.3%
[ 8
 
0.7%
Other Symbol
ValueCountFrequency (%)
4
66.7%
2
33.3%
Space Separator
ValueCountFrequency (%)
48626
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5763
100.0%
Math Symbol
ValueCountFrequency (%)
~ 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 160679
61.9%
Common 98679
38.0%
Latin 285
 
0.1%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
15106
 
9.4%
11170
 
7.0%
10517
 
6.5%
10376
 
6.5%
5915
 
3.7%
5825
 
3.6%
5638
 
3.5%
5494
 
3.4%
4897
 
3.0%
4655
 
2.9%
Other values (406) 81086
50.5%
Common
ValueCountFrequency (%)
48626
49.3%
1 8901
 
9.0%
- 5763
 
5.8%
2 5741
 
5.8%
3 4529
 
4.6%
4 3949
 
4.0%
5 3732
 
3.8%
6 3251
 
3.3%
7 3198
 
3.2%
0 3163
 
3.2%
Other values (14) 7826
 
7.9%
Latin
ValueCountFrequency (%)
B 81
28.4%
L 40
14.0%
E 27
 
9.5%
G 25
 
8.8%
A 21
 
7.4%
F 16
 
5.6%
S 14
 
4.9%
D 10
 
3.5%
C 10
 
3.5%
T 9
 
3.2%
Other values (10) 32
 
11.2%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 160675
61.9%
ASCII 98961
38.1%
None 5
 
< 0.1%
CJK Compat 2
 
< 0.1%
CJK 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
48626
49.1%
1 8901
 
9.0%
- 5763
 
5.8%
2 5741
 
5.8%
3 4529
 
4.6%
4 3949
 
4.0%
5 3732
 
3.8%
6 3251
 
3.3%
7 3198
 
3.2%
0 3163
 
3.2%
Other values (32) 8108
 
8.2%
Hangul
ValueCountFrequency (%)
15106
 
9.4%
11170
 
7.0%
10517
 
6.5%
10376
 
6.5%
5915
 
3.7%
5825
 
3.6%
5638
 
3.5%
5494
 
3.4%
4897
 
3.0%
4655
 
2.9%
Other values (405) 81082
50.5%
None
ValueCountFrequency (%)
4
80.0%
1
 
20.0%
CJK Compat
ValueCountFrequency (%)
2
100.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%

전화번호
Text

MISSING 

Distinct6546
Distinct (%)90.4%
Missing2755
Missing (%)27.6%
Memory size156.2 KiB
2024-03-15T11:05:46.034456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length12.015459
Min length2

Characters and Unicode

Total characters87052
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5955 ?
Unique (%)82.2%

Sample

1st row043-229-6200
2nd row043-232-6768
3rd row043-260-6048
4th row043-836-9019
5th row042-382-7909
ValueCountFrequency (%)
043-535-1922 9
 
0.1%
043-733-7208 7
 
0.1%
043-877-2034 6
 
0.1%
043-883-0495 5
 
0.1%
043-0000-0000 5
 
0.1%
043-820-4111 5
 
0.1%
043-844-1100 4
 
0.1%
043-872-4353 4
 
0.1%
043-857-9755 4
 
0.1%
043-276-8599 4
 
0.1%
Other values (6536) 7192
99.3%
2024-03-15T11:05:47.131463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 14462
16.6%
0 12833
14.7%
3 11929
13.7%
4 10265
11.8%
2 7219
8.3%
8 7088
8.1%
7 6020
6.9%
1 5781
 
6.6%
5 4552
 
5.2%
6 4026
 
4.6%
Other values (2) 2877
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 72588
83.4%
Dash Punctuation 14462
 
16.6%
Space Separator 2
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 12833
17.7%
3 11929
16.4%
4 10265
14.1%
2 7219
9.9%
8 7088
9.8%
7 6020
8.3%
1 5781
8.0%
5 4552
 
6.3%
6 4026
 
5.5%
9 2875
 
4.0%
Dash Punctuation
ValueCountFrequency (%)
- 14462
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 87052
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 14462
16.6%
0 12833
14.7%
3 11929
13.7%
4 10265
11.8%
2 7219
8.3%
8 7088
8.1%
7 6020
6.9%
1 5781
 
6.6%
5 4552
 
5.2%
6 4026
 
4.6%
Other values (2) 2877
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 87052
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 14462
16.6%
0 12833
14.7%
3 11929
13.7%
4 10265
11.8%
2 7219
8.3%
8 7088
8.1%
7 6020
6.9%
1 5781
 
6.6%
5 4552
 
5.2%
6 4026
 
4.6%
Other values (2) 2877
 
3.3%
Distinct8296
Distinct (%)83.0%
Missing6
Missing (%)0.1%
Memory size156.2 KiB
2024-03-15T11:05:48.246129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length89
Median length73
Mean length10.941265
Min length1

Characters and Unicode

Total characters109347
Distinct characters949
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7662 ?
Unique (%)76.7%

Sample

1st row케이블트로프,플륨관,수로관,콘크리트제품 형틀
2nd row의료용 원심분리기
3rd row수산화세륨,인산,질산나트륨
4th row과채주스, 과채음료
5th rowH-BEAM
ValueCountFrequency (%)
507
 
2.5%
380
 
1.9%
플라스틱 154
 
0.8%
부품 140
 
0.7%
122
 
0.6%
철구조물 96
 
0.5%
알루미늄 84
 
0.4%
반도체 74
 
0.4%
창호 74
 
0.4%
화장품 71
 
0.4%
Other values (10528) 18280
91.5%
2024-03-15T11:05:49.940210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10143
 
9.3%
, 6798
 
6.2%
2720
 
2.5%
1806
 
1.7%
1791
 
1.6%
1779
 
1.6%
1618
 
1.5%
1483
 
1.4%
1269
 
1.2%
1223
 
1.1%
Other values (939) 78717
72.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 83267
76.1%
Space Separator 10143
 
9.3%
Other Punctuation 7096
 
6.5%
Uppercase Letter 4598
 
4.2%
Lowercase Letter 1901
 
1.7%
Open Punctuation 980
 
0.9%
Close Punctuation 978
 
0.9%
Decimal Number 292
 
0.3%
Dash Punctuation 88
 
0.1%
Math Symbol 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2720
 
3.3%
1806
 
2.2%
1791
 
2.2%
1779
 
2.1%
1618
 
1.9%
1483
 
1.8%
1269
 
1.5%
1223
 
1.5%
1118
 
1.3%
1104
 
1.3%
Other values (854) 67356
80.9%
Uppercase Letter
ValueCountFrequency (%)
P 676
14.7%
C 562
12.2%
E 415
 
9.0%
L 282
 
6.1%
D 257
 
5.6%
T 251
 
5.5%
S 251
 
5.5%
A 249
 
5.4%
V 199
 
4.3%
R 198
 
4.3%
Other values (17) 1258
27.4%
Lowercase Letter
ValueCountFrequency (%)
e 237
12.5%
a 168
 
8.8%
l 155
 
8.2%
r 155
 
8.2%
i 146
 
7.7%
o 145
 
7.6%
t 105
 
5.5%
s 104
 
5.5%
n 93
 
4.9%
p 90
 
4.7%
Other values (16) 503
26.5%
Other Punctuation
ValueCountFrequency (%)
, 6798
95.8%
. 159
 
2.2%
/ 110
 
1.6%
' 8
 
0.1%
· 8
 
0.1%
& 3
 
< 0.1%
% 3
 
< 0.1%
: 2
 
< 0.1%
? 2
 
< 0.1%
2
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
2 78
26.7%
0 58
19.9%
3 37
12.7%
1 32
11.0%
4 25
 
8.6%
6 18
 
6.2%
5 17
 
5.8%
9 14
 
4.8%
7 7
 
2.4%
8 6
 
2.1%
Open Punctuation
ValueCountFrequency (%)
( 974
99.4%
[ 5
 
0.5%
{ 1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 972
99.4%
] 5
 
0.5%
} 1
 
0.1%
Math Symbol
ValueCountFrequency (%)
+ 2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
10143
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 88
100.0%
Control
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 83267
76.1%
Common 19581
 
17.9%
Latin 6498
 
5.9%
Greek 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2720
 
3.3%
1806
 
2.2%
1791
 
2.2%
1779
 
2.1%
1618
 
1.9%
1483
 
1.8%
1269
 
1.5%
1223
 
1.5%
1118
 
1.3%
1104
 
1.3%
Other values (854) 67356
80.9%
Latin
ValueCountFrequency (%)
P 676
 
10.4%
C 562
 
8.6%
E 415
 
6.4%
L 282
 
4.3%
D 257
 
4.0%
T 251
 
3.9%
S 251
 
3.9%
A 249
 
3.8%
e 237
 
3.6%
V 199
 
3.1%
Other values (42) 3119
48.0%
Common
ValueCountFrequency (%)
10143
51.8%
, 6798
34.7%
( 974
 
5.0%
) 972
 
5.0%
. 159
 
0.8%
/ 110
 
0.6%
- 88
 
0.4%
2 78
 
0.4%
0 58
 
0.3%
3 37
 
0.2%
Other values (22) 164
 
0.8%
Greek
ValueCountFrequency (%)
Φ 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 83264
76.1%
ASCII 26067
 
23.8%
None 13
 
< 0.1%
Compat Jamo 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10143
38.9%
, 6798
26.1%
( 974
 
3.7%
) 972
 
3.7%
P 676
 
2.6%
C 562
 
2.2%
E 415
 
1.6%
L 282
 
1.1%
D 257
 
1.0%
T 251
 
1.0%
Other values (70) 4737
18.2%
Hangul
ValueCountFrequency (%)
2720
 
3.3%
1806
 
2.2%
1791
 
2.2%
1779
 
2.1%
1618
 
1.9%
1483
 
1.8%
1269
 
1.5%
1223
 
1.5%
1118
 
1.3%
1104
 
1.3%
Other values (851) 67353
80.9%
None
ValueCountFrequency (%)
· 8
61.5%
2
 
15.4%
Φ 1
 
7.7%
1
 
7.7%
1
 
7.7%
Compat Jamo
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Interactions

2024-03-15T11:05:37.515284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T11:05:50.091260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시군
연번1.0000.871
시군0.8711.000
2024-03-15T11:05:50.244333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시군
연번1.0000.617
시군0.6171.000

Missing values

2024-03-15T11:05:37.738166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T11:05:38.043329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-15T11:05:38.345488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번시군회사명공장대표주소(도로명)전화번호주생산품
89938994음성군(주)한국씨엔엠충청북도 음성군 음성읍 한벌리 90 외 1필지<NA>케이블트로프,플륨관,수로관,콘크리트제품 형틀
14161417청주시(주)한랩충청북도 청주시 흥덕구 오송읍 연제리 640043-229-6200의료용 원심분리기
71437144진천군진솔화학 주식회사충청북도 진천군 문백면 평산리 503-2번지<NA>수산화세륨,인산,질산나트륨
851852청주시(주)엠지바이오충청북도 청주시 흥덕구 강내면 월곡리 301-3번지 충청대학 창업보육센터 R동 116호043-232-6768과채주스, 과채음료
96519652음성군성원산업충청북도 음성군 대소면 소석리 292-1 외 6필지<NA>H-BEAM
18031804청주시대양정공(주)충청북도 청주시 서원구 현도면 중삼리 322043-260-6048광산용기계 제작
75697570괴산군크린팩 주식회사충청북도 괴산군 사리면 사리로 510 외 1필지043-836-9019흡수 패드
30143015청주시주식회사 워터제네시스충청북도 청주시 흥덕구 강내면 탑연리 322-8042-382-7909살균세척기
1041610417음성군플라맥스(주) 제3공장충청북도 음성군 맹동면 덕금로23번길 52043-877-4611볼펜심, 고급펜
45744575제천시(주)박원충청북도 제천시 바이오밸리로 105 (왕암동) 외 1필지031-227-0981볼베어링용 강구
연번시군회사명공장대표주소(도로명)전화번호주생산품
25422543청주시엠.비. 산업충청북도 청주시 흥덕구 옥산면 호죽리 888-0043-237-3226포장용 봉투
62346235진천군(주)강한스틸충청북도 진천군 덕산읍 화상리 299-6번지<NA>철근 형상 가공품
54215422옥천군락희푸드충청북도 옥천군 옥천읍 가풍리 901-1043-733-9383과채가공품, 서류가공품, 초콜릿가공품
34193420청주시푸르미영농조합법인충청북도 청주시 흥덕구 오송읍 봉산리 379-10번지<NA>백미
57225723영동군네오리텍(주)충청북도 영동군 용산면 법화리 10번지043-770-7701페라이트파우더
43914392충주시한성공압콤푸레샤(주)충청북도 충주시 대소원면 완오리 1139 번지 (충주첨단산업단지)043-855-8000콤퓨레샤
24382439청주시양지말영농조합법인충청북도 청주시 청원구 북이면 화상리 353-7번지<NA>백미
58615862증평군(주)윈텍충청북도 증평군 도안면 증평2산단로 86, (주)윈텍043-838-8371전사지 잉크젯
77147715음성군(주)네오그린텍충청북도 음성군 금왕읍 오선산단로 12 외 1필지043-877-5804액상형구체방수제, 분말형구체방수제,특수몰탈,세라믹코팅제
55185519옥천군육각수식품충청북도 옥천군 이원면 건진리 725-1번지 외 1필지043-733-6088두부