Overview

Dataset statistics

Number of variables6
Number of observations125
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.1 KiB
Average record size in memory50.1 B

Variable types

Text4
Numeric1
Categorical1

Dataset

Description경상남도 함양군 관내 등록된 공장등록현황 정보로 회사명, 대표자명, 공장대표주소(지번), 종업원수, 생산품, 데이터기준일자 등으로 구성되어 있습니다.
Author경상남도 함양군
URLhttps://www.data.go.kr/data/3064780/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
종업원수 has 4 (3.2%) zerosZeros

Reproduction

Analysis started2023-12-12 05:21:53.386307
Analysis finished2023-12-12 05:21:53.965582
Duration0.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct124
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-12T14:21:54.120300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length15
Mean length7.704
Min length3

Characters and Unicode

Total characters963
Distinct characters197
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique123 ?
Unique (%)98.4%

Sample

1st row(유)동방
2nd row(유)석주산업
3rd row(주)경호
4th row(주)근하기공
5th row(주)근하하이테크산업
ValueCountFrequency (%)
주식회사 11
 
7.3%
농업회사법인 3
 
2.0%
주)동성중공업 2
 
1.3%
함양공장 2
 
1.3%
주)인산가 2
 
1.3%
죽림지점 2
 
1.3%
퓨어플러스(주 2
 
1.3%
티피 1
 
0.7%
유원 1
 
0.7%
중방콘크리트 1
 
0.7%
Other values (123) 123
82.0%
2023-12-12T14:21:54.783973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
63
 
6.5%
( 47
 
4.9%
) 47
 
4.9%
39
 
4.0%
38
 
3.9%
31
 
3.2%
30
 
3.1%
27
 
2.8%
25
 
2.6%
24
 
2.5%
Other values (187) 592
61.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 841
87.3%
Open Punctuation 47
 
4.9%
Close Punctuation 47
 
4.9%
Space Separator 25
 
2.6%
Uppercase Letter 2
 
0.2%
Decimal Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
63
 
7.5%
39
 
4.6%
38
 
4.5%
31
 
3.7%
30
 
3.6%
27
 
3.2%
24
 
2.9%
23
 
2.7%
23
 
2.7%
19
 
2.3%
Other values (181) 524
62.3%
Uppercase Letter
ValueCountFrequency (%)
S 1
50.0%
B 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 47
100.0%
Close Punctuation
ValueCountFrequency (%)
) 47
100.0%
Space Separator
ValueCountFrequency (%)
25
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 841
87.3%
Common 120
 
12.5%
Latin 2
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
63
 
7.5%
39
 
4.6%
38
 
4.5%
31
 
3.7%
30
 
3.6%
27
 
3.2%
24
 
2.9%
23
 
2.7%
23
 
2.7%
19
 
2.3%
Other values (181) 524
62.3%
Common
ValueCountFrequency (%)
( 47
39.2%
) 47
39.2%
25
20.8%
2 1
 
0.8%
Latin
ValueCountFrequency (%)
S 1
50.0%
B 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 841
87.3%
ASCII 122
 
12.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
63
 
7.5%
39
 
4.6%
38
 
4.5%
31
 
3.7%
30
 
3.6%
27
 
3.2%
24
 
2.9%
23
 
2.7%
23
 
2.7%
19
 
2.3%
Other values (181) 524
62.3%
ASCII
ValueCountFrequency (%)
( 47
38.5%
) 47
38.5%
25
20.5%
2 1
 
0.8%
S 1
 
0.8%
B 1
 
0.8%
Distinct116
Distinct (%)92.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-12T14:21:55.085746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length3
Mean length3.136
Min length3

Characters and Unicode

Total characters392
Distinct characters125
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique108 ?
Unique (%)86.4%

Sample

1st row한상권, 박선주
2nd row안종범
3rd row노원상
4th row문천연
5th row권종규
ValueCountFrequency (%)
박재용 3
 
2.3%
한상권 3
 
2.3%
신정환 2
 
1.6%
홍순명 2
 
1.6%
박시우 2
 
1.6%
김윤세 2
 
1.6%
강선욱 2
 
1.6%
박상대 2
 
1.6%
임혜경 1
 
0.8%
임승호 1
 
0.8%
Other values (108) 108
84.4%
2023-12-12T14:21:55.516179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
22
 
5.6%
20
 
5.1%
15
 
3.8%
15
 
3.8%
11
 
2.8%
10
 
2.6%
9
 
2.3%
9
 
2.3%
9
 
2.3%
9
 
2.3%
Other values (115) 263
67.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 385
98.2%
Space Separator 3
 
0.8%
Other Punctuation 3
 
0.8%
Decimal Number 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
22
 
5.7%
20
 
5.2%
15
 
3.9%
15
 
3.9%
11
 
2.9%
10
 
2.6%
9
 
2.3%
9
 
2.3%
9
 
2.3%
9
 
2.3%
Other values (112) 256
66.5%
Space Separator
ValueCountFrequency (%)
3
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%
Decimal Number
ValueCountFrequency (%)
3 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 385
98.2%
Common 7
 
1.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
22
 
5.7%
20
 
5.2%
15
 
3.9%
15
 
3.9%
11
 
2.9%
10
 
2.6%
9
 
2.3%
9
 
2.3%
9
 
2.3%
9
 
2.3%
Other values (112) 256
66.5%
Common
ValueCountFrequency (%)
3
42.9%
, 3
42.9%
3 1
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 385
98.2%
ASCII 7
 
1.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
22
 
5.7%
20
 
5.2%
15
 
3.9%
15
 
3.9%
11
 
2.9%
10
 
2.6%
9
 
2.3%
9
 
2.3%
9
 
2.3%
9
 
2.3%
Other values (112) 256
66.5%
ASCII
ValueCountFrequency (%)
3
42.9%
, 3
42.9%
3 1
 
14.3%
Distinct118
Distinct (%)94.4%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-12T14:21:55.831326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length41
Mean length25.592
Min length21

Characters and Unicode

Total characters3199
Distinct characters98
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique111 ?
Unique (%)88.8%

Sample

1st row경상남도 함양군 지곡면 도촌리 258-82번지 외 1필지
2nd row경상남도 함양군 안의면 황곡리 1834-6번지 황곡리 1834-6 외 2필지
3rd row경상남도 함양군 유림면 화촌리 20-1번지
4th row경상남도 함양군 안의면 황곡리 1818번지 (주)근하하이테크산업 외 1필지
5th row경상남도 함양군 안의면 황곡리 1818번지 (황곡리 1818)
ValueCountFrequency (%)
경상남도 126
18.2%
함양군 126
18.2%
함양읍 29
 
4.2%
수동면 29
 
4.2%
안의면 24
 
3.5%
황곡리 22
 
3.2%
22
 
3.2%
이은리 13
 
1.9%
1필지 11
 
1.6%
화산리 11
 
1.6%
Other values (177) 280
40.4%
2023-12-12T14:21:56.302007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
568
17.8%
156
 
4.9%
155
 
4.8%
148
 
4.6%
136
 
4.3%
133
 
4.2%
132
 
4.1%
1 128
 
4.0%
128
 
4.0%
127
 
4.0%
Other values (88) 1388
43.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1982
62.0%
Space Separator 568
 
17.8%
Decimal Number 557
 
17.4%
Dash Punctuation 79
 
2.5%
Open Punctuation 5
 
0.2%
Close Punctuation 5
 
0.2%
Other Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
156
 
7.9%
155
 
7.8%
148
 
7.5%
136
 
6.9%
133
 
6.7%
132
 
6.7%
128
 
6.5%
127
 
6.4%
126
 
6.4%
119
 
6.0%
Other values (72) 622
31.4%
Decimal Number
ValueCountFrequency (%)
1 128
23.0%
2 84
15.1%
0 57
10.2%
3 54
9.7%
8 51
 
9.2%
5 46
 
8.3%
7 45
 
8.1%
4 36
 
6.5%
9 28
 
5.0%
6 28
 
5.0%
Other Punctuation
ValueCountFrequency (%)
. 2
66.7%
, 1
33.3%
Space Separator
ValueCountFrequency (%)
568
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 79
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1982
62.0%
Common 1217
38.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
156
 
7.9%
155
 
7.8%
148
 
7.5%
136
 
6.9%
133
 
6.7%
132
 
6.7%
128
 
6.5%
127
 
6.4%
126
 
6.4%
119
 
6.0%
Other values (72) 622
31.4%
Common
ValueCountFrequency (%)
568
46.7%
1 128
 
10.5%
2 84
 
6.9%
- 79
 
6.5%
0 57
 
4.7%
3 54
 
4.4%
8 51
 
4.2%
5 46
 
3.8%
7 45
 
3.7%
4 36
 
3.0%
Other values (6) 69
 
5.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1982
62.0%
ASCII 1217
38.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
568
46.7%
1 128
 
10.5%
2 84
 
6.9%
- 79
 
6.5%
0 57
 
4.7%
3 54
 
4.4%
8 51
 
4.2%
5 46
 
3.8%
7 45
 
3.7%
4 36
 
3.0%
Other values (6) 69
 
5.7%
Hangul
ValueCountFrequency (%)
156
 
7.9%
155
 
7.8%
148
 
7.5%
136
 
6.9%
133
 
6.7%
132
 
6.7%
128
 
6.5%
127
 
6.4%
126
 
6.4%
119
 
6.0%
Other values (72) 622
31.4%

종업원수
Real number (ℝ)

ZEROS 

Distinct31
Distinct (%)24.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.456
Minimum0
Maximum250
Zeros4
Zeros (%)3.2%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2023-12-12T14:21:56.475706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1.2
Q13
median7
Q312
95-th percentile39.6
Maximum250
Range250
Interquartile range (IQR)9

Descriptive statistics

Standard deviation29.271671
Coefficient of variation (CV)2.175362
Kurtosis43.452726
Mean13.456
Median Absolute Deviation (MAD)4
Skewness6.1730265
Sum1682
Variance856.83071
MonotonicityNot monotonic
2023-12-12T14:21:56.614576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
5 15
12.0%
2 14
 
11.2%
7 13
 
10.4%
3 13
 
10.4%
10 9
 
7.2%
6 7
 
5.6%
4 6
 
4.8%
8 4
 
3.2%
0 4
 
3.2%
1 3
 
2.4%
Other values (21) 37
29.6%
ValueCountFrequency (%)
0 4
 
3.2%
1 3
 
2.4%
2 14
11.2%
3 13
10.4%
4 6
 
4.8%
5 15
12.0%
6 7
5.6%
7 13
10.4%
8 4
 
3.2%
9 3
 
2.4%
ValueCountFrequency (%)
250 1
0.8%
185 1
0.8%
81 1
0.8%
67 1
0.8%
61 1
0.8%
55 1
0.8%
42 1
0.8%
30 2
1.6%
27 1
0.8%
25 2
1.6%
Distinct117
Distinct (%)93.6%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-12T14:21:56.967994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length25
Mean length8.168
Min length1

Characters and Unicode

Total characters1021
Distinct characters271
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique112 ?
Unique (%)89.6%

Sample

1st row레미콘
2nd row콘크리트 블록
3rd row건양파,건당근
4th rowSTEEL Form, 철물창호, 철근콘크리트공사, PC콘크리트
5th row철구조물
ValueCountFrequency (%)
레미콘 5
 
2.5%
4
 
2.0%
3
 
1.5%
석제품 3
 
1.5%
철구조물 3
 
1.5%
톱밥 2
 
1.0%
백미 2
 
1.0%
형강 2
 
1.0%
2
 
1.0%
구조물 2
 
1.0%
Other values (171) 175
86.2%
2023-12-12T14:21:57.455956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
79
 
7.7%
, 63
 
6.2%
30
 
2.9%
19
 
1.9%
18
 
1.8%
17
 
1.7%
15
 
1.5%
12
 
1.2%
12
 
1.2%
12
 
1.2%
Other values (261) 744
72.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 792
77.6%
Space Separator 79
 
7.7%
Other Punctuation 71
 
7.0%
Uppercase Letter 37
 
3.6%
Lowercase Letter 18
 
1.8%
Open Punctuation 11
 
1.1%
Close Punctuation 11
 
1.1%
Decimal Number 1
 
0.1%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
30
 
3.8%
19
 
2.4%
18
 
2.3%
17
 
2.1%
15
 
1.9%
12
 
1.5%
12
 
1.5%
12
 
1.5%
11
 
1.4%
11
 
1.4%
Other values (223) 635
80.2%
Uppercase Letter
ValueCountFrequency (%)
P 5
13.5%
N 4
10.8%
E 4
10.8%
T 4
10.8%
I 3
8.1%
O 2
 
5.4%
G 2
 
5.4%
R 2
 
5.4%
C 2
 
5.4%
S 2
 
5.4%
Other values (7) 7
18.9%
Lowercase Letter
ValueCountFrequency (%)
e 3
16.7%
p 2
11.1%
u 2
11.1%
r 2
11.1%
o 1
 
5.6%
d 1
 
5.6%
b 1
 
5.6%
t 1
 
5.6%
a 1
 
5.6%
l 1
 
5.6%
Other values (3) 3
16.7%
Other Punctuation
ValueCountFrequency (%)
, 63
88.7%
. 5
 
7.0%
/ 3
 
4.2%
Space Separator
ValueCountFrequency (%)
79
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Decimal Number
ValueCountFrequency (%)
4 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 792
77.6%
Common 174
 
17.0%
Latin 55
 
5.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
30
 
3.8%
19
 
2.4%
18
 
2.3%
17
 
2.1%
15
 
1.9%
12
 
1.5%
12
 
1.5%
12
 
1.5%
11
 
1.4%
11
 
1.4%
Other values (223) 635
80.2%
Latin
ValueCountFrequency (%)
P 5
 
9.1%
N 4
 
7.3%
E 4
 
7.3%
T 4
 
7.3%
I 3
 
5.5%
e 3
 
5.5%
p 2
 
3.6%
u 2
 
3.6%
O 2
 
3.6%
G 2
 
3.6%
Other values (20) 24
43.6%
Common
ValueCountFrequency (%)
79
45.4%
, 63
36.2%
( 11
 
6.3%
) 11
 
6.3%
. 5
 
2.9%
/ 3
 
1.7%
4 1
 
0.6%
- 1
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 792
77.6%
ASCII 229
 
22.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
79
34.5%
, 63
27.5%
( 11
 
4.8%
) 11
 
4.8%
P 5
 
2.2%
. 5
 
2.2%
N 4
 
1.7%
E 4
 
1.7%
T 4
 
1.7%
/ 3
 
1.3%
Other values (28) 40
17.5%
Hangul
ValueCountFrequency (%)
30
 
3.8%
19
 
2.4%
18
 
2.3%
17
 
2.1%
15
 
1.9%
12
 
1.5%
12
 
1.5%
12
 
1.5%
11
 
1.4%
11
 
1.4%
Other values (223) 635
80.2%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2021-07-13
125 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-07-13
2nd row2021-07-13
3rd row2021-07-13
4th row2021-07-13
5th row2021-07-13

Common Values

ValueCountFrequency (%)
2021-07-13 125
100.0%

Length

2023-12-12T14:21:57.608475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:21:57.721027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-07-13 125
100.0%

Interactions

2023-12-12T14:21:53.722995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T14:21:53.826049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:21:53.924864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

회사명대표자명공장대표주소(지번)종업원수생산품데이터기준일자
0(유)동방한상권, 박선주경상남도 함양군 지곡면 도촌리 258-82번지 외 1필지11레미콘2021-07-13
1(유)석주산업안종범경상남도 함양군 안의면 황곡리 1834-6번지 황곡리 1834-6 외 2필지7콘크리트 블록2021-07-13
2(주)경호노원상경상남도 함양군 유림면 화촌리 20-1번지4건양파,건당근2021-07-13
3(주)근하기공문천연경상남도 함양군 안의면 황곡리 1818번지 (주)근하하이테크산업 외 1필지7STEEL Form, 철물창호, 철근콘크리트공사, PC콘크리트2021-07-13
4(주)근하하이테크산업권종규경상남도 함양군 안의면 황곡리 1818번지 (황곡리 1818)25철구조물2021-07-13
5(주)금산철강 함양지점김화중경상남도 함양군 안의면 황곡리 2203-00농업용 파이프, 강관, 형강2021-07-13
6(주)동성중공업신정환경상남도 함양군 안의면 황곡리 2199번지0철골구조물2021-07-13
7(주)동성중공업신정환경상남도 함양군 안의면 황곡리 1820번지 (주)동성중공업42철구조물 제작 및 설치2021-07-13
8(주)동주산업한상권경상남도 함양군 수동면 원평리 734-27번지 외 2필지18레미콘2021-07-13
9(주)동주아스콘한상권경상남도 함양군 수동면 원평리 734-27번지 외 1필지9아스콘2021-07-13
회사명대표자명공장대표주소(지번)종업원수생산품데이터기준일자
115함양영농조합법인이종현경상남도 함양군 수동면 우명리 1056-5번지4곡물도정업(쌀)2021-07-13
116함양제강주식회사임상문경상남도 함양군 휴천면 목현리 840-16번지0인코트2021-07-13
117함양조경석김정희경상남도 함양군 안의면 도림리 447-3.4번지4경계.디딤.기공석2021-07-13
118함양종합포장장병구경상남도 함양군 유림면 옥매리 902-1번지7포장용지2021-07-13
119함양지리산다원오금자경상남도 함양군 서하면 송계리 295-1번지3국화차2021-07-13
120함양축협사료공장노익환경상남도 함양군 수동면 도북리 406-7번지10동물용 사료2021-07-13
121함양축협친환경퇴비사업소김형석경상남도 함양군 수동면 도북리 406번지4친환경퇴비2021-07-13
122함양합동양조장하기식외3경상남도 함양군 함양읍 이은리 289번지2탁주2021-07-13
123햇빛고운마을박윤광경상남도 함양군 안의면 상원리 715-2번지3절임식품및다류2021-07-13
124허브앤티허정탁경상남도 함양군 함양읍 이은리 357-3번지8침출차,녹차,둥굴레차2021-07-13