Overview

Dataset statistics

Number of variables5
Number of observations93
Missing cells3
Missing cells (%)0.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.8 KiB
Average record size in memory41.4 B

Variable types

Text5

Dataset

Description경상남도 합천군의 읍면별 소재 기업체 현황에 관한 데이터로 회사명, 연락처, 생산품목, 공장주소 등을 제공하고 있습니다.
Author경상남도 합천군
URLhttps://www.data.go.kr/data/3069203/fileData.do

Alerts

전화번호 has 3 (3.2%) missing valuesMissing
회사명 has unique valuesUnique
대표자명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 22:13:46.502218
Analysis finished2023-12-12 22:13:47.550731
Duration1.05 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

회사명
Text

UNIQUE 

Distinct93
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size876.0 B
2023-12-13T07:13:47.672409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length15
Mean length7.172043
Min length2

Characters and Unicode

Total characters667
Distinct characters171
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique93 ?
Unique (%)100.0%

Sample

1st row(주)가야아스콘
2nd row(주)가회청목주가
3rd row(주)구포국수
4th row(주)글로벌오토클래드 합천공장
5th row(주)담양대나무
ValueCountFrequency (%)
주식회사 5
 
4.7%
농업회사법인 2
 
1.9%
주)가야아스콘 1
 
0.9%
제일석재 1
 
0.9%
전진바이오팜(주 1
 
0.9%
장원유니크 1
 
0.9%
일흥도자기 1
 
0.9%
유한회사알파 1
 
0.9%
유신도자기 1
 
0.9%
유림산업 1
 
0.9%
Other values (91) 91
85.8%
2023-12-13T07:13:48.019854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42
 
6.3%
( 33
 
4.9%
) 33
 
4.9%
26
 
3.9%
20
 
3.0%
17
 
2.5%
16
 
2.4%
14
 
2.1%
14
 
2.1%
13
 
1.9%
Other values (161) 439
65.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 586
87.9%
Open Punctuation 33
 
4.9%
Close Punctuation 33
 
4.9%
Space Separator 13
 
1.9%
Decimal Number 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
42
 
7.2%
26
 
4.4%
20
 
3.4%
17
 
2.9%
16
 
2.7%
14
 
2.4%
14
 
2.4%
12
 
2.0%
12
 
2.0%
11
 
1.9%
Other values (156) 402
68.6%
Decimal Number
ValueCountFrequency (%)
2 1
50.0%
1 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 33
100.0%
Close Punctuation
ValueCountFrequency (%)
) 33
100.0%
Space Separator
ValueCountFrequency (%)
13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 586
87.9%
Common 81
 
12.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
42
 
7.2%
26
 
4.4%
20
 
3.4%
17
 
2.9%
16
 
2.7%
14
 
2.4%
14
 
2.4%
12
 
2.0%
12
 
2.0%
11
 
1.9%
Other values (156) 402
68.6%
Common
ValueCountFrequency (%)
( 33
40.7%
) 33
40.7%
13
 
16.0%
2 1
 
1.2%
1 1
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 586
87.9%
ASCII 81
 
12.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
42
 
7.2%
26
 
4.4%
20
 
3.4%
17
 
2.9%
16
 
2.7%
14
 
2.4%
14
 
2.4%
12
 
2.0%
12
 
2.0%
11
 
1.9%
Other values (156) 402
68.6%
ASCII
ValueCountFrequency (%)
( 33
40.7%
) 33
40.7%
13
 
16.0%
2 1
 
1.2%
1 1
 
1.2%

대표자명
Text

UNIQUE 

Distinct93
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size876.0 B
2023-12-13T07:13:48.341496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length3
Mean length3.1612903
Min length2

Characters and Unicode

Total characters294
Distinct characters109
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique93 ?
Unique (%)100.0%

Sample

1st row강동훈
2nd row이병웅
3rd row허영민
4th row이쌍근
5th row박염미
ValueCountFrequency (%)
강동훈 1
 
1.1%
정광환 1
 
1.1%
조무강 1
 
1.1%
전영효 1
 
1.1%
이태훈 1
 
1.1%
이치우 1
 
1.1%
변종덕 1
 
1.1%
김임선 1
 
1.1%
손은석 1
 
1.1%
최귀순 1
 
1.1%
Other values (83) 83
89.2%
2023-12-13T07:13:48.795880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
14
 
4.8%
12
 
4.1%
11
 
3.7%
9
 
3.1%
8
 
2.7%
7
 
2.4%
7
 
2.4%
7
 
2.4%
7
 
2.4%
6
 
2.0%
Other values (99) 206
70.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 290
98.6%
Math Symbol 4
 
1.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
4.8%
12
 
4.1%
11
 
3.8%
9
 
3.1%
8
 
2.8%
7
 
2.4%
7
 
2.4%
7
 
2.4%
7
 
2.4%
6
 
2.1%
Other values (98) 202
69.7%
Math Symbol
ValueCountFrequency (%)
+ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 290
98.6%
Common 4
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
 
4.8%
12
 
4.1%
11
 
3.8%
9
 
3.1%
8
 
2.8%
7
 
2.4%
7
 
2.4%
7
 
2.4%
7
 
2.4%
6
 
2.1%
Other values (98) 202
69.7%
Common
ValueCountFrequency (%)
+ 4
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 290
98.6%
ASCII 4
 
1.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
14
 
4.8%
12
 
4.1%
11
 
3.8%
9
 
3.1%
8
 
2.8%
7
 
2.4%
7
 
2.4%
7
 
2.4%
7
 
2.4%
6
 
2.1%
Other values (98) 202
69.7%
ASCII
ValueCountFrequency (%)
+ 4
100.0%
Distinct91
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Memory size876.0 B
2023-12-13T07:13:49.114857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length24
Mean length21.301075
Min length19

Characters and Unicode

Total characters1981
Distinct characters103
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique89 ?
Unique (%)95.7%

Sample

1st row경상남도 합천군 묘산면 영서로 1651-58
2nd row경상남도 합천군 가회면 덕촌길 167-1
3rd row경상남도 합천군 가회면 황매산로 183
4th row경상남도 합천군 합천읍 계림1길 45
5th row경상남도 합천군 삼가면 신평3길 11
ValueCountFrequency (%)
경상남도 93
20.0%
합천군 93
20.0%
가야면 22
 
4.7%
야로면 9
 
1.9%
동부로 8
 
1.7%
합천읍 8
 
1.7%
율곡면 8
 
1.7%
묘산면 8
 
1.7%
매화산로 7
 
1.5%
청덕면 6
 
1.3%
Other values (143) 203
43.7%
2023-12-13T07:13:49.590273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
373
18.8%
104
 
5.2%
102
 
5.1%
95
 
4.8%
93
 
4.7%
93
 
4.7%
93
 
4.7%
93
 
4.7%
85
 
4.3%
71
 
3.6%
Other values (93) 779
39.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1255
63.4%
Space Separator 373
 
18.8%
Decimal Number 324
 
16.4%
Dash Punctuation 29
 
1.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
104
 
8.3%
102
 
8.1%
95
 
7.6%
93
 
7.4%
93
 
7.4%
93
 
7.4%
93
 
7.4%
85
 
6.8%
71
 
5.7%
47
 
3.7%
Other values (81) 379
30.2%
Decimal Number
ValueCountFrequency (%)
1 66
20.4%
2 51
15.7%
3 31
9.6%
5 30
9.3%
8 28
8.6%
4 26
 
8.0%
6 26
 
8.0%
9 23
 
7.1%
7 22
 
6.8%
0 21
 
6.5%
Space Separator
ValueCountFrequency (%)
373
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 29
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1255
63.4%
Common 726
36.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
104
 
8.3%
102
 
8.1%
95
 
7.6%
93
 
7.4%
93
 
7.4%
93
 
7.4%
93
 
7.4%
85
 
6.8%
71
 
5.7%
47
 
3.7%
Other values (81) 379
30.2%
Common
ValueCountFrequency (%)
373
51.4%
1 66
 
9.1%
2 51
 
7.0%
3 31
 
4.3%
5 30
 
4.1%
- 29
 
4.0%
8 28
 
3.9%
4 26
 
3.6%
6 26
 
3.6%
9 23
 
3.2%
Other values (2) 43
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1255
63.4%
ASCII 726
36.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
373
51.4%
1 66
 
9.1%
2 51
 
7.0%
3 31
 
4.3%
5 30
 
4.1%
- 29
 
4.0%
8 28
 
3.9%
4 26
 
3.6%
6 26
 
3.6%
9 23
 
3.2%
Other values (2) 43
 
5.9%
Hangul
ValueCountFrequency (%)
104
 
8.3%
102
 
8.1%
95
 
7.6%
93
 
7.4%
93
 
7.4%
93
 
7.4%
93
 
7.4%
85
 
6.8%
71
 
5.7%
47
 
3.7%
Other values (81) 379
30.2%

전화번호
Text

MISSING 

Distinct89
Distinct (%)98.9%
Missing3
Missing (%)3.2%
Memory size876.0 B
2023-12-13T07:13:49.834810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters1080
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique88 ?
Unique (%)97.8%

Sample

1st row055-932-5861
2nd row055-932-9130
3rd row055-933-1300
4th row055-931-4360
5th row055-932-9888
ValueCountFrequency (%)
055-933-1300 2
 
2.2%
055-931-0455 1
 
1.1%
055-932-5861 1
 
1.1%
055-932-2498 1
 
1.1%
055-931-1494 1
 
1.1%
055-933-1454 1
 
1.1%
055-933-7196 1
 
1.1%
055-934-2882 1
 
1.1%
055-933-2888 1
 
1.1%
055-933-9330 1
 
1.1%
Other values (79) 79
87.8%
2023-12-13T07:13:50.180699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 209
19.4%
- 180
16.7%
0 154
14.3%
3 144
13.3%
9 124
11.5%
1 70
 
6.5%
2 61
 
5.6%
8 40
 
3.7%
4 37
 
3.4%
7 35
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 900
83.3%
Dash Punctuation 180
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 209
23.2%
0 154
17.1%
3 144
16.0%
9 124
13.8%
1 70
 
7.8%
2 61
 
6.8%
8 40
 
4.4%
4 37
 
4.1%
7 35
 
3.9%
6 26
 
2.9%
Dash Punctuation
ValueCountFrequency (%)
- 180
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1080
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 209
19.4%
- 180
16.7%
0 154
14.3%
3 144
13.3%
9 124
11.5%
1 70
 
6.5%
2 61
 
5.6%
8 40
 
3.7%
4 37
 
3.4%
7 35
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1080
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 209
19.4%
- 180
16.7%
0 154
14.3%
3 144
13.3%
9 124
11.5%
1 70
 
6.5%
2 61
 
5.6%
8 40
 
3.7%
4 37
 
3.4%
7 35
 
3.2%
Distinct82
Distinct (%)88.2%
Missing0
Missing (%)0.0%
Memory size876.0 B
2023-12-13T07:13:50.397389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length15
Mean length6.4408602
Min length1

Characters and Unicode

Total characters599
Distinct characters170
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique79 ?
Unique (%)84.9%

Sample

1st row아스팔트+콘크리트
2nd row주류
3rd row건면(국수)
4th row이종금속+판재+관재
5th row대나무제품
ValueCountFrequency (%)
생활도자기 7
 
7.0%
도자기 6
 
6.0%
2
 
2.0%
광케이블용 1
 
1.0%
아스팔트+콘크리트 1
 
1.0%
pe망사 1
 
1.0%
끈+로프 1
 
1.0%
몰타르 1
 
1.0%
장식용도자기 1
 
1.0%
포장용지+상자 1
 
1.0%
Other values (78) 78
78.0%
2023-12-13T07:13:50.765451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
+ 50
 
8.3%
31
 
5.2%
23
 
3.8%
19
 
3.2%
18
 
3.0%
18
 
3.0%
12
 
2.0%
12
 
2.0%
11
 
1.8%
11
 
1.8%
Other values (160) 394
65.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 532
88.8%
Math Symbol 50
 
8.3%
Space Separator 7
 
1.2%
Open Punctuation 4
 
0.7%
Close Punctuation 4
 
0.7%
Uppercase Letter 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
31
 
5.8%
23
 
4.3%
19
 
3.6%
18
 
3.4%
18
 
3.4%
12
 
2.3%
12
 
2.3%
11
 
2.1%
11
 
2.1%
8
 
1.5%
Other values (154) 369
69.4%
Uppercase Letter
ValueCountFrequency (%)
P 1
50.0%
E 1
50.0%
Math Symbol
ValueCountFrequency (%)
+ 50
100.0%
Space Separator
ValueCountFrequency (%)
7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 532
88.8%
Common 65
 
10.9%
Latin 2
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
31
 
5.8%
23
 
4.3%
19
 
3.6%
18
 
3.4%
18
 
3.4%
12
 
2.3%
12
 
2.3%
11
 
2.1%
11
 
2.1%
8
 
1.5%
Other values (154) 369
69.4%
Common
ValueCountFrequency (%)
+ 50
76.9%
7
 
10.8%
( 4
 
6.2%
) 4
 
6.2%
Latin
ValueCountFrequency (%)
P 1
50.0%
E 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 532
88.8%
ASCII 67
 
11.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
+ 50
74.6%
7
 
10.4%
( 4
 
6.0%
) 4
 
6.0%
P 1
 
1.5%
E 1
 
1.5%
Hangul
ValueCountFrequency (%)
31
 
5.8%
23
 
4.3%
19
 
3.6%
18
 
3.4%
18
 
3.4%
12
 
2.3%
12
 
2.3%
11
 
2.1%
11
 
2.1%
8
 
1.5%
Other values (154) 369
69.4%

Correlations

2023-12-13T07:13:50.860435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
회사명대표자명공장대표 주소전화번호생산품
회사명1.0001.0001.0001.0001.000
대표자명1.0001.0001.0001.0001.000
공장대표 주소1.0001.0001.0000.9990.994
전화번호1.0001.0000.9991.0000.995
생산품1.0001.0000.9940.9951.000

Missing values

2023-12-13T07:13:47.430802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:13:47.515550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

회사명대표자명공장대표 주소전화번호생산품
0(주)가야아스콘강동훈경상남도 합천군 묘산면 영서로 1651-58055-932-5861아스팔트+콘크리트
1(주)가회청목주가이병웅경상남도 합천군 가회면 덕촌길 167-1055-932-9130주류
2(주)구포국수허영민경상남도 합천군 가회면 황매산로 183055-933-1300건면(국수)
3(주)글로벌오토클래드 합천공장이쌍근경상남도 합천군 합천읍 계림1길 45055-931-4360이종금속+판재+관재
4(주)담양대나무박염미경상남도 합천군 삼가면 신평3길 11055-932-9888대나무제품
5(주)대경21세기이소순경상남도 합천군 용주면 황계폭포로 715055-932-1632화장품업
6(주)대유인터내셔날 야로공장황선철경상남도 합천군 야로면 나대길 57055-931-8104화섬직물
7(주)동천수 가야산샘물박철호경상남도 합천군 묘산면 영서로 1724-12054-531-2003생수
8(주)둥지산업이동일경상남도 합천군 가야면 매화산로 55055-931-7020참숯벽돌+참숯보드+토벽돌
9(주)라파클린텍허덕규경상남도 합천군 초계면 동부로 1049055-931-3001마스크
회사명대표자명공장대표 주소전화번호생산품
83합천농협연합미곡종합처리장노태윤경상남도 합천군 합천읍 마령로 240055-931-5543
84합천명품토종돼지유달형경상남도 합천군 묘산면 묘산로 229055-931-1131돼지고기 포장육
85합천생약가공영농조합법인백문기경상남도 합천군 합천읍 내곡길 16055-933-0770생약+식품원료
86합천우리밀영농조합법인김상복경상남도 합천군 초계면 양동로 81055-932-2563국수+밀가루
87합천우리식품박종옥경상남도 합천군 삼가면 모의로 70-3055-931-8889된장+재래식간장
88합천축산업협동조합주영길경상남도 합천군 율곡면 임북1길 25055-932-8000섬유질사료
89합천합동양조장박봉하경상남도 합천군 합천읍 충효로 55055-931-2032탁주
90합천호농업협동조합손덕봉경상남도 합천군 대병면 대지3길 10055-933-7006
91형제도자기권혁례경상남도 합천군 가야면 구미가천로 107055-931-9789생활도자기
92형제영농조합법인박용수경상남도 합천군 청덕면 초곡3길 62055-932-9996비료+질소화합물