Overview

Dataset statistics

Number of variables5
Number of observations35
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 KiB
Average record size in memory43.8 B

Variable types

Categorical1
Text4

Dataset

Description화성시의 공장등록현황입니다. 회사명, 단지명, 등록일, 전화번호, 종업원수, 생산품, 공장대표주소, 업종번호, 업종명으로 구성되어있습니다.
Author경기도 화성시
URLhttps://www.data.go.kr/data/15093464/fileData.do

Alerts

기관명 has unique valuesUnique
소재지도로명주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 01:50:41.107259
Analysis finished2023-12-12 01:50:41.829477
Duration0.72 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기관종류
Categorical

Distinct2
Distinct (%)5.7%
Missing0
Missing (%)0.0%
Memory size412.0 B
사회적기업
25 
마을기업
10 

Length

Max length5
Median length5
Mean length4.7142857
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사회적기업
2nd row사회적기업
3rd row사회적기업
4th row사회적기업
5th row사회적기업

Common Values

ValueCountFrequency (%)
사회적기업 25
71.4%
마을기업 10
 
28.6%

Length

2023-12-12T10:50:41.899108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:50:42.033738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사회적기업 25
71.4%
마을기업 10
 
28.6%

기관명
Text

UNIQUE 

Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-12-12T10:50:42.262222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length12
Mean length9.3428571
Min length3

Characters and Unicode

Total characters327
Distinct characters132
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)100.0%

Sample

1st row㈜컴윈
2nd row사회복지법인천주교수원교구사회복지회행복한일터
3rd row㈜아이티그린
4th row주식회사 에이치앤에스두리반
5th row주식회사 동부케어
ValueCountFrequency (%)
주식회사 6
 
12.0%
협동조합 3
 
6.0%
㈜컴윈 1
 
2.0%
화성국민체육센터점 1
 
2.0%
풀향기영농조합 1
 
2.0%
㈜크린씨티화성 1
 
2.0%
㈜희망세상 1
 
2.0%
키움 1
 
2.0%
해피멘토 1
 
2.0%
㈜경영 1
 
2.0%
Other values (33) 33
66.0%
2023-12-12T10:50:42.730103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
15
 
4.6%
12
 
3.7%
12
 
3.7%
12
 
3.7%
11
 
3.4%
10
 
3.1%
9
 
2.8%
9
 
2.8%
8
 
2.4%
8
 
2.4%
Other values (122) 221
67.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 293
89.6%
Space Separator 15
 
4.6%
Other Symbol 8
 
2.4%
Open Punctuation 4
 
1.2%
Close Punctuation 4
 
1.2%
Decimal Number 3
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
 
4.1%
12
 
4.1%
12
 
4.1%
11
 
3.8%
10
 
3.4%
9
 
3.1%
9
 
3.1%
8
 
2.7%
7
 
2.4%
6
 
2.0%
Other values (116) 197
67.2%
Decimal Number
ValueCountFrequency (%)
1 2
66.7%
9 1
33.3%
Space Separator
ValueCountFrequency (%)
15
100.0%
Other Symbol
ValueCountFrequency (%)
8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 301
92.0%
Common 26
 
8.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12
 
4.0%
12
 
4.0%
12
 
4.0%
11
 
3.7%
10
 
3.3%
9
 
3.0%
9
 
3.0%
8
 
2.7%
8
 
2.7%
7
 
2.3%
Other values (117) 203
67.4%
Common
ValueCountFrequency (%)
15
57.7%
( 4
 
15.4%
) 4
 
15.4%
1 2
 
7.7%
9 1
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 293
89.6%
ASCII 26
 
8.0%
None 8
 
2.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
15
57.7%
( 4
 
15.4%
) 4
 
15.4%
1 2
 
7.7%
9 1
 
3.8%
Hangul
ValueCountFrequency (%)
12
 
4.1%
12
 
4.1%
12
 
4.1%
11
 
3.8%
10
 
3.4%
9
 
3.1%
9
 
3.1%
8
 
2.7%
7
 
2.4%
6
 
2.0%
Other values (116) 197
67.2%
None
ValueCountFrequency (%)
8
100.0%
Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-12-12T10:50:43.039811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length25
Mean length22.914286
Min length17

Characters and Unicode

Total characters802
Distinct characters112
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)100.0%

Sample

1st row경기도 화성시 장안면 석포로74번길 23
2nd row경기도 화성시 현대기아로 506(무송동)
3rd row경기도 화성시 서신면 제부로722번길 18
4th row경기도 화성시 팔탄면 푸른들판로 622
5th row경기도 화성시 병점중앙로 155(진안동,5층)
ValueCountFrequency (%)
경기도 35
20.0%
화성시 35
20.0%
봉담읍 6
 
3.4%
팔탄면 5
 
2.9%
송산면 3
 
1.7%
18 2
 
1.1%
향남읍 2
 
1.1%
하가등안길 2
 
1.1%
양감면 2
 
1.1%
병점중앙로 2
 
1.1%
Other values (76) 81
46.3%
2023-12-12T10:50:43.523281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
140
 
17.5%
38
 
4.7%
38
 
4.7%
36
 
4.5%
36
 
4.5%
36
 
4.5%
36
 
4.5%
1 31
 
3.9%
22
 
2.7%
0 21
 
2.6%
Other values (102) 368
45.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 483
60.2%
Decimal Number 141
 
17.6%
Space Separator 140
 
17.5%
Dash Punctuation 10
 
1.2%
Other Punctuation 10
 
1.2%
Open Punctuation 8
 
1.0%
Close Punctuation 8
 
1.0%
Uppercase Letter 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
38
 
7.9%
38
 
7.9%
36
 
7.5%
36
 
7.5%
36
 
7.5%
36
 
7.5%
22
 
4.6%
21
 
4.3%
15
 
3.1%
13
 
2.7%
Other values (84) 192
39.8%
Decimal Number
ValueCountFrequency (%)
1 31
22.0%
0 21
14.9%
5 19
13.5%
3 14
9.9%
2 14
9.9%
4 13
9.2%
8 10
 
7.1%
6 8
 
5.7%
9 6
 
4.3%
7 5
 
3.5%
Other Punctuation
ValueCountFrequency (%)
, 9
90.0%
. 1
 
10.0%
Uppercase Letter
ValueCountFrequency (%)
C 1
50.0%
A 1
50.0%
Space Separator
ValueCountFrequency (%)
140
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 483
60.2%
Common 317
39.5%
Latin 2
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
38
 
7.9%
38
 
7.9%
36
 
7.5%
36
 
7.5%
36
 
7.5%
36
 
7.5%
22
 
4.6%
21
 
4.3%
15
 
3.1%
13
 
2.7%
Other values (84) 192
39.8%
Common
ValueCountFrequency (%)
140
44.2%
1 31
 
9.8%
0 21
 
6.6%
5 19
 
6.0%
3 14
 
4.4%
2 14
 
4.4%
4 13
 
4.1%
8 10
 
3.2%
- 10
 
3.2%
, 9
 
2.8%
Other values (6) 36
 
11.4%
Latin
ValueCountFrequency (%)
C 1
50.0%
A 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 483
60.2%
ASCII 319
39.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
140
43.9%
1 31
 
9.7%
0 21
 
6.6%
5 19
 
6.0%
3 14
 
4.4%
2 14
 
4.4%
4 13
 
4.1%
8 10
 
3.1%
- 10
 
3.1%
, 9
 
2.8%
Other values (8) 38
 
11.9%
Hangul
ValueCountFrequency (%)
38
 
7.9%
38
 
7.9%
36
 
7.5%
36
 
7.5%
36
 
7.5%
36
 
7.5%
22
 
4.6%
21
 
4.3%
15
 
3.1%
13
 
2.7%
Other values (84) 192
39.8%
Distinct34
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-12-12T10:50:43.814391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length22
Mean length20.228571
Min length15

Characters and Unicode

Total characters708
Distinct characters67
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)94.3%

Sample

1st row경기도 화성시 장안면 석포리 677-10
2nd row경기도 화성시 남양읍 무송리 181-1
3rd row경기도 화성시 서신면광평리 246-16
4th row경기도 화성시 팔탄면 구장리 146-4
5th row경기도 화성시 진안동 514-2
ValueCountFrequency (%)
경기도 35
21.3%
화성시 35
21.3%
봉담읍 6
 
3.7%
팔탄면 5
 
3.0%
진안동 3
 
1.8%
병점동 3
 
1.8%
하가등리 3
 
1.8%
송산면 3
 
1.8%
가재리 3
 
1.8%
사창리 2
 
1.2%
Other values (61) 66
40.2%
2023-12-12T10:50:44.228481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
129
18.2%
38
 
5.4%
35
 
4.9%
35
 
4.9%
35
 
4.9%
35
 
4.9%
35
 
4.9%
1 31
 
4.4%
- 28
 
4.0%
25
 
3.5%
Other values (57) 282
39.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 410
57.9%
Decimal Number 141
 
19.9%
Space Separator 129
 
18.2%
Dash Punctuation 28
 
4.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
38
 
9.3%
35
 
8.5%
35
 
8.5%
35
 
8.5%
35
 
8.5%
35
 
8.5%
25
 
6.1%
15
 
3.7%
12
 
2.9%
11
 
2.7%
Other values (45) 134
32.7%
Decimal Number
ValueCountFrequency (%)
1 31
22.0%
4 19
13.5%
8 16
11.3%
5 14
9.9%
2 13
9.2%
6 12
 
8.5%
9 10
 
7.1%
3 10
 
7.1%
7 9
 
6.4%
0 7
 
5.0%
Space Separator
ValueCountFrequency (%)
129
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 28
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 410
57.9%
Common 298
42.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
38
 
9.3%
35
 
8.5%
35
 
8.5%
35
 
8.5%
35
 
8.5%
35
 
8.5%
25
 
6.1%
15
 
3.7%
12
 
2.9%
11
 
2.7%
Other values (45) 134
32.7%
Common
ValueCountFrequency (%)
129
43.3%
1 31
 
10.4%
- 28
 
9.4%
4 19
 
6.4%
8 16
 
5.4%
5 14
 
4.7%
2 13
 
4.4%
6 12
 
4.0%
9 10
 
3.4%
3 10
 
3.4%
Other values (2) 16
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 410
57.9%
ASCII 298
42.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
129
43.3%
1 31
 
10.4%
- 28
 
9.4%
4 19
 
6.4%
8 16
 
5.4%
5 14
 
4.7%
2 13
 
4.4%
6 12
 
4.0%
9 10
 
3.4%
3 10
 
3.4%
Other values (2) 16
 
5.4%
Hangul
ValueCountFrequency (%)
38
 
9.3%
35
 
8.5%
35
 
8.5%
35
 
8.5%
35
 
8.5%
35
 
8.5%
25
 
6.1%
15
 
3.7%
12
 
2.9%
11
 
2.7%
Other values (45) 134
32.7%
Distinct34
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-12-12T10:50:44.465343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length3
Mean length3.2571429
Min length3

Characters and Unicode

Total characters114
Distinct characters60
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)94.3%

Sample

1st row정연철
2nd row이용기
3rd row박찬일
4th row김성현
5th row진락천
ValueCountFrequency (%)
박명분 2
 
5.6%
최수자 1
 
2.8%
박승원 1
 
2.8%
이경숙 1
 
2.8%
권순국 1
 
2.8%
박옥자 1
 
2.8%
장희석 1
 
2.8%
이용기 1
 
2.8%
이주현 1
 
2.8%
임창미 1
 
2.8%
Other values (25) 25
69.4%
2023-12-12T10:50:44.835470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8
 
7.0%
6
 
5.3%
6
 
5.3%
5
 
4.4%
4
 
3.5%
4
 
3.5%
3
 
2.6%
3
 
2.6%
3
 
2.6%
2
 
1.8%
Other values (50) 70
61.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 111
97.4%
Other Punctuation 2
 
1.8%
Space Separator 1
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8
 
7.2%
6
 
5.4%
6
 
5.4%
5
 
4.5%
4
 
3.6%
4
 
3.6%
3
 
2.7%
3
 
2.7%
3
 
2.7%
2
 
1.8%
Other values (48) 67
60.4%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 111
97.4%
Common 3
 
2.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8
 
7.2%
6
 
5.4%
6
 
5.4%
5
 
4.5%
4
 
3.6%
4
 
3.6%
3
 
2.7%
3
 
2.7%
3
 
2.7%
2
 
1.8%
Other values (48) 67
60.4%
Common
ValueCountFrequency (%)
, 2
66.7%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 111
97.4%
ASCII 3
 
2.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
8
 
7.2%
6
 
5.4%
6
 
5.4%
5
 
4.5%
4
 
3.6%
4
 
3.6%
3
 
2.7%
3
 
2.7%
3
 
2.7%
2
 
1.8%
Other values (48) 67
60.4%
ASCII
ValueCountFrequency (%)
, 2
66.7%
1
33.3%

Correlations

2023-12-12T10:50:44.942875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기관종류기관명소재지도로명주소소재지지번주소대표자명
기관종류1.0001.0001.0001.0001.000
기관명1.0001.0001.0001.0001.000
소재지도로명주소1.0001.0001.0001.0001.000
소재지지번주소1.0001.0001.0001.0000.993
대표자명1.0001.0001.0000.9931.000

Missing values

2023-12-12T10:50:41.649393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:50:41.789162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기관종류기관명소재지도로명주소소재지지번주소대표자명
0사회적기업㈜컴윈경기도 화성시 장안면 석포로74번길 23경기도 화성시 장안면 석포리 677-10정연철
1사회적기업사회복지법인천주교수원교구사회복지회행복한일터경기도 화성시 현대기아로 506(무송동)경기도 화성시 남양읍 무송리 181-1이용기
2사회적기업㈜아이티그린경기도 화성시 서신면 제부로722번길 18경기도 화성시 서신면광평리 246-16박찬일
3사회적기업주식회사 에이치앤에스두리반경기도 화성시 팔탄면 푸른들판로 622경기도 화성시 팔탄면 구장리 146-4김성현
4사회적기업주식회사 동부케어경기도 화성시 병점중앙로 155(진안동,5층)경기도 화성시 진안동 514-2진락천
5사회적기업주식회사 나눔피엔씨경기도 화성시 양감면 초록로 446경기도 화성시 양감면 사창리 491이영민
6사회적기업㈜케어119돌봄센터경기도 화성시 팔탄면 삼천병마로 518-10경기도 화성시 팔탄면 가재리 304이수영, 양경옥
7사회적기업세종환경주식회사경기도 화성시 반월남길 105-25(반월동)경기도 화성시 반월동 616이현식
8사회적기업사단법인 행복플러스경기도 화성시 효행로 1056, 1003호(병점동, 탑플라자)경기도 화성시 병점동 844-2권연정
9사회적기업문화발전소 열터경기도 화성시 병점로 8 .A (병점동,용우빌딩4층)경기도 화성시 병점동 347-10김정오
기관종류기관명소재지도로명주소소재지지번주소대표자명
25마을기업화성시니어클럽(노노카페 화성국민체육센터점)경기도 화성시 봉담읍 동화길 18경기도 화성시 봉담읍 동화리 406-1번지남장숙
26마을기업살림과나눔영농조합경기도 화성시 우정읍 두레길 13경기도 화성시 우정읍 화산리 701-18번지최수자
27마을기업풀향기영농조합경기도 화성시 봉담읍 하가등길 82경기도 화성시 봉담읍 하가등리 258번지박명분
28마을기업영농조합법인 공룡마을경기도 화성시 송산면 공룡로 484경기도 화성시 송산면 고정리 156-4번지장희석
29마을기업제부도오리골협동조합경기도 화성시 서신면 해양공단로100번길 35-8경기도 화성시 서신면 장외리 138-3번지박옥자
30마을기업햇살담은연협동조합경기도 화성시 봉담읍 샘마을1길 15경기도 화성시 봉담읍 상리 22-54번지권순국
31마을기업화성시발효식품 협동조합경기도 화성시 향남읍 상신초교길 52경기도 화성시 향남읍 상신리 874번지이경숙
32마을기업경기유통형마을기업경기도 화성시 봉담읍 하가등안길 60-50경기도 화성시 봉담읍 하가등리 191-5번지박명분
33마을기업생태예술한옥마을영농조합법인경기도 화성시 송산면 개매기길 90경기도 화성시 송산면 고포리 385-2번지박승원
34마을기업화성열매협동조합경기도 화성시 팔탄면 모시울길 19경기도 화성시 팔탄면 가재리 516-4번지김성한