Overview

Dataset statistics

Number of variables4
Number of observations54
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 KiB
Average record size in memory34.4 B

Variable types

Text3
Boolean1

Dataset

Description계룡시 관내 등록된 제조업체 현황으로 기업체명, 소재지, 주생산품, 공장등록여부에 관한 공공데이터를 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=308&beforeMenuCd=DOM_000000201001001000&publicdatapk=15093903

Alerts

공장등록 is highly imbalanced (61.9%)Imbalance
기업체명 has unique valuesUnique
주생산품 has unique valuesUnique

Reproduction

Analysis started2024-01-09 21:52:57.197512
Analysis finished2024-01-09 21:52:57.562463
Duration0.36 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기업체명
Text

UNIQUE 

Distinct54
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size564.0 B
2024-01-10T06:52:57.705399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length14
Mean length7.5740741
Min length4

Characters and Unicode

Total characters409
Distinct characters133
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique54 ?
Unique (%)100.0%

Sample

1st row(주)굿스굿
2nd row(주)그레이스식품
3rd row(주)금아공조
4th row(주)내담에프앤비
5th row우리디자인
ValueCountFrequency (%)
농업회사법인 2
 
3.3%
주)굿스굿 1
 
1.7%
서로 1
 
1.7%
주)휴마스 1
 
1.7%
고향식품 1
 
1.7%
길산스틸(주 1
 
1.7%
신도안종합식품(주 1
 
1.7%
주)훼미리푸드 1
 
1.7%
리브가 1
 
1.7%
푸드 1
 
1.7%
Other values (49) 49
81.7%
2024-01-10T06:52:58.000336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42
 
10.3%
( 39
 
9.5%
) 39
 
9.5%
17
 
4.2%
16
 
3.9%
8
 
2.0%
7
 
1.7%
7
 
1.7%
6
 
1.5%
6
 
1.5%
Other values (123) 222
54.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 325
79.5%
Open Punctuation 39
 
9.5%
Close Punctuation 39
 
9.5%
Space Separator 6
 
1.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
42
 
12.9%
17
 
5.2%
16
 
4.9%
8
 
2.5%
7
 
2.2%
7
 
2.2%
6
 
1.8%
5
 
1.5%
5
 
1.5%
5
 
1.5%
Other values (120) 207
63.7%
Open Punctuation
ValueCountFrequency (%)
( 39
100.0%
Close Punctuation
ValueCountFrequency (%)
) 39
100.0%
Space Separator
ValueCountFrequency (%)
6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 325
79.5%
Common 84
 
20.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
42
 
12.9%
17
 
5.2%
16
 
4.9%
8
 
2.5%
7
 
2.2%
7
 
2.2%
6
 
1.8%
5
 
1.5%
5
 
1.5%
5
 
1.5%
Other values (120) 207
63.7%
Common
ValueCountFrequency (%)
( 39
46.4%
) 39
46.4%
6
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 325
79.5%
ASCII 84
 
20.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
42
 
12.9%
17
 
5.2%
16
 
4.9%
8
 
2.5%
7
 
2.2%
7
 
2.2%
6
 
1.8%
5
 
1.5%
5
 
1.5%
5
 
1.5%
Other values (120) 207
63.7%
ASCII
ValueCountFrequency (%)
( 39
46.4%
) 39
46.4%
6
 
7.1%
Distinct50
Distinct (%)92.6%
Missing0
Missing (%)0.0%
Memory size564.0 B
2024-01-10T06:52:58.179775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length16
Mean length13.240741
Min length7

Characters and Unicode

Total characters715
Distinct characters54
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique47 ?
Unique (%)87.0%

Sample

1st row두마면 제1산단로 38
2nd row금암로 137-4, 1층
3rd row두마면 제1산단로 25-16
4th row두마면 제1산단로 40-21
5th row금암로 28 (금암동)
ValueCountFrequency (%)
두마면 36
22.4%
제1산단로 19
 
11.8%
엄사면 11
 
6.8%
입암길 8
 
5.0%
도곡로 5
 
3.1%
금암동 4
 
2.5%
입암리 4
 
2.5%
125 3
 
1.9%
28 3
 
1.9%
금암로 3
 
1.9%
Other values (58) 65
40.4%
2024-01-10T06:52:58.452626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
110
 
15.4%
47
 
6.6%
1 47
 
6.6%
2 38
 
5.3%
36
 
5.0%
36
 
5.0%
36
 
5.0%
- 30
 
4.2%
3 21
 
2.9%
20
 
2.8%
Other values (44) 294
41.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 359
50.2%
Decimal Number 204
28.5%
Space Separator 110
 
15.4%
Dash Punctuation 30
 
4.2%
Other Punctuation 4
 
0.6%
Close Punctuation 4
 
0.6%
Open Punctuation 4
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
47
13.1%
36
 
10.0%
36
 
10.0%
36
 
10.0%
20
 
5.6%
19
 
5.3%
19
 
5.3%
19
 
5.3%
13
 
3.6%
12
 
3.3%
Other values (29) 102
28.4%
Decimal Number
ValueCountFrequency (%)
1 47
23.0%
2 38
18.6%
3 21
10.3%
5 19
9.3%
4 18
 
8.8%
6 17
 
8.3%
7 16
 
7.8%
8 11
 
5.4%
0 10
 
4.9%
9 7
 
3.4%
Space Separator
ValueCountFrequency (%)
110
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 30
100.0%
Other Punctuation
ValueCountFrequency (%)
, 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 359
50.2%
Common 356
49.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
47
13.1%
36
 
10.0%
36
 
10.0%
36
 
10.0%
20
 
5.6%
19
 
5.3%
19
 
5.3%
19
 
5.3%
13
 
3.6%
12
 
3.3%
Other values (29) 102
28.4%
Common
ValueCountFrequency (%)
110
30.9%
1 47
13.2%
2 38
 
10.7%
- 30
 
8.4%
3 21
 
5.9%
5 19
 
5.3%
4 18
 
5.1%
6 17
 
4.8%
7 16
 
4.5%
8 11
 
3.1%
Other values (5) 29
 
8.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 359
50.2%
ASCII 356
49.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
110
30.9%
1 47
13.2%
2 38
 
10.7%
- 30
 
8.4%
3 21
 
5.9%
5 19
 
5.3%
4 18
 
5.1%
6 17
 
4.8%
7 16
 
4.5%
8 11
 
3.1%
Other values (5) 29
 
8.1%
Hangul
ValueCountFrequency (%)
47
13.1%
36
 
10.0%
36
 
10.0%
36
 
10.0%
20
 
5.6%
19
 
5.3%
19
 
5.3%
19
 
5.3%
13
 
3.6%
12
 
3.3%
Other values (29) 102
28.4%

주생산품
Text

UNIQUE 

Distinct54
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size564.0 B
2024-01-10T06:52:58.639378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length10
Mean length7.0555556
Min length2

Characters and Unicode

Total characters381
Distinct characters156
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique54 ?
Unique (%)100.0%

Sample

1st row제습보관함
2nd row그레이스 혼합장
3rd row공기조화장치
4th row아기이유식,과자,두유
5th row인쇄편집물, 광고물
ValueCountFrequency (%)
방송장비 2
 
2.5%
두부 2
 
2.5%
공기조화장치 2
 
2.5%
물엿 2
 
2.5%
전분제품 1
 
1.2%
보쌈 1
 
1.2%
족발 1
 
1.2%
김치 1
 
1.2%
쌀조청 1
 
1.2%
강판 1
 
1.2%
Other values (66) 66
82.5%
2024-01-10T06:52:58.936048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
28
 
7.3%
, 21
 
5.5%
13
 
3.4%
11
 
2.9%
9
 
2.4%
9
 
2.4%
9
 
2.4%
8
 
2.1%
7
 
1.8%
7
 
1.8%
Other values (146) 259
68.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 324
85.0%
Space Separator 28
 
7.3%
Other Punctuation 21
 
5.5%
Uppercase Letter 4
 
1.0%
Close Punctuation 2
 
0.5%
Open Punctuation 2
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13
 
4.0%
11
 
3.4%
9
 
2.8%
9
 
2.8%
9
 
2.8%
8
 
2.5%
7
 
2.2%
7
 
2.2%
6
 
1.9%
6
 
1.9%
Other values (139) 239
73.8%
Uppercase Letter
ValueCountFrequency (%)
C 2
50.0%
V 1
25.0%
T 1
25.0%
Space Separator
ValueCountFrequency (%)
28
100.0%
Other Punctuation
ValueCountFrequency (%)
, 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 324
85.0%
Common 53
 
13.9%
Latin 4
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
13
 
4.0%
11
 
3.4%
9
 
2.8%
9
 
2.8%
9
 
2.8%
8
 
2.5%
7
 
2.2%
7
 
2.2%
6
 
1.9%
6
 
1.9%
Other values (139) 239
73.8%
Common
ValueCountFrequency (%)
28
52.8%
, 21
39.6%
) 2
 
3.8%
( 2
 
3.8%
Latin
ValueCountFrequency (%)
C 2
50.0%
V 1
25.0%
T 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 324
85.0%
ASCII 57
 
15.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
28
49.1%
, 21
36.8%
) 2
 
3.5%
( 2
 
3.5%
C 2
 
3.5%
V 1
 
1.8%
T 1
 
1.8%
Hangul
ValueCountFrequency (%)
13
 
4.0%
11
 
3.4%
9
 
2.8%
9
 
2.8%
9
 
2.8%
8
 
2.5%
7
 
2.2%
7
 
2.2%
6
 
1.9%
6
 
1.9%
Other values (139) 239
73.8%

공장등록
Boolean

IMBALANCE 

Distinct2
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Memory size186.0 B
True
50 
False
 
4
ValueCountFrequency (%)
True 50
92.6%
False 4
 
7.4%
2024-01-10T06:52:59.023185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T06:52:59.069883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기업체명소재지주생산품공장등록
기업체명1.0001.0001.0001.000
소재지1.0001.0001.0000.000
주생산품1.0001.0001.0001.000
공장등록1.0000.0001.0001.000

Missing values

2024-01-10T06:52:57.466891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T06:52:57.534979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기업체명소재지주생산품공장등록
0(주)굿스굿두마면 제1산단로 38제습보관함Y
1(주)그레이스식품금암로 137-4, 1층그레이스 혼합장Y
2(주)금아공조두마면 제1산단로 25-16공기조화장치Y
3(주)내담에프앤비두마면 제1산단로 40-21아기이유식,과자,두유Y
4우리디자인금암로 28 (금암동)인쇄편집물, 광고물N
5(주)마메든도어두마면 제1산단로 40-7방화문Y
6(주)마이크로닉계룡대로 245 (금암동)방송장비Y
7(주)메덱스더블유두마면 입암리 79-1번지플라스틱 용기Y
8(주)비타바이오두마면 제1산단로 40-33보조사료Y
9(주)세진엄사면 도곡로 128-25무대장치, 자동제어반Y
기업체명소재지주생산품공장등록
44신화중기정비공업사두마면 왕대로 105-5토목공사기계장비제조Y
45영순아그로두마면 왕대리 165번지복합비료제조Y
46오케이퓨처(주)두마면 제1산단로 25-57건설장비부품Y
47우리겨레협동조합엄사면 사랑재길20옻칠생활용품Y
48유로농자재엄사면 광석향한길 80농자재(물꼬조절기)Y
49주안레미콘주식회사엄사면 계백로 2906-27레미콘Y
50케이웰텍두마면 입암길 68수전해 평가장치Y
51팔천식품두마면 입암길 78육가공, 순대Y
52레드락주식회사서금암로 53홍콩밀크티N
53명성테크두마면 제1산단로 26-31금속가공N