Overview

Dataset statistics

Number of variables13
Number of observations169
Missing cells20
Missing cells (%)0.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory17.6 KiB
Average record size in memory106.8 B

Variable types

Numeric2
Categorical7
Text4

Dataset

Description산업분류상 건축관련 업종별 업체현황으로 업체명, 대표자명, 도로명주소 등 정보를 제공합니다. 개인정보를 위하여 업체명, 대표자명은 별표처리하였습니다.
Author경상북도 고령군
URLhttps://www.data.go.kr/data/15126606/fileData.do

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
산업대분류명 is highly overall correlated with 순번 and 3 other fieldsHigh correlation
산업소분류명 is highly overall correlated with 순번 and 3 other fieldsHigh correlation
산업중분류명 is highly overall correlated with 순번 and 3 other fieldsHigh correlation
코드업종명 is highly overall correlated with 순번 and 3 other fieldsHigh correlation
순번 is highly overall correlated with 산업대분류명 and 3 other fieldsHigh correlation
종사자수 is highly overall correlated with 업체유형High correlation
업체유형 is highly overall correlated with 종사자수High correlation
도로명주소 has 2 (1.2%) missing valuesMissing
종사자수 has 18 (10.7%) missing valuesMissing
순번 has unique valuesUnique

Reproduction

Analysis started2024-03-14 11:46:00.325239
Analysis finished2024-03-14 11:46:03.193713
Duration2.87 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct169
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean85
Minimum1
Maximum169
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2024-03-14T20:46:03.412890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile9.4
Q143
median85
Q3127
95-th percentile160.6
Maximum169
Range168
Interquartile range (IQR)84

Descriptive statistics

Standard deviation48.930222
Coefficient of variation (CV)0.57564968
Kurtosis-1.2
Mean85
Median Absolute Deviation (MAD)42
Skewness0
Sum14365
Variance2394.1667
MonotonicityStrictly increasing
2024-03-14T20:46:03.976038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.6%
117 1
 
0.6%
109 1
 
0.6%
110 1
 
0.6%
111 1
 
0.6%
112 1
 
0.6%
113 1
 
0.6%
114 1
 
0.6%
115 1
 
0.6%
116 1
 
0.6%
Other values (159) 159
94.1%
ValueCountFrequency (%)
1 1
0.6%
2 1
0.6%
3 1
0.6%
4 1
0.6%
5 1
0.6%
6 1
0.6%
7 1
0.6%
8 1
0.6%
9 1
0.6%
10 1
0.6%
ValueCountFrequency (%)
169 1
0.6%
168 1
0.6%
167 1
0.6%
166 1
0.6%
165 1
0.6%
164 1
0.6%
163 1
0.6%
162 1
0.6%
161 1
0.6%
160 1
0.6%

산업대분류명
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
섬유제품 제조업; 의복 제외
55 
고무 및 플라스틱제품 제조업
44 
화학물질 및 화학제품 제조업; 의약품 제외
32 
목재 및 나무제품 제조업; 가구 제외
31 
비금속광물 광업; 연료용 제외
 
5

Length

Max length23
Median length15
Mean length17.508876
Min length15

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row비금속광물 광업; 연료용 제외
2nd row비금속광물 광업; 연료용 제외
3rd row비금속광물 광업; 연료용 제외
4th row비금속광물 광업; 연료용 제외
5th row비금속광물 광업; 연료용 제외

Common Values

ValueCountFrequency (%)
섬유제품 제조업; 의복 제외 55
32.5%
고무 및 플라스틱제품 제조업 44
26.0%
화학물질 및 화학제품 제조업; 의약품 제외 32
18.9%
목재 및 나무제품 제조업; 가구 제외 31
18.3%
비금속광물 광업; 연료용 제외 5
 
3.0%
코크스, 연탄 및 석유정제품 제조업 2
 
1.2%

Length

2024-03-14T20:46:04.376777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T20:46:04.748283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제조업 164
20.4%
제외 123
15.3%
109
13.6%
섬유제품 55
 
6.8%
의복 55
 
6.8%
고무 44
 
5.5%
플라스틱제품 44
 
5.5%
화학제품 32
 
4.0%
의약품 32
 
4.0%
화학물질 32
 
4.0%
Other values (9) 114
14.2%

산업중분류명
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)5.9%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
직물 직조 및 직물제품 제조업
42 
플라스틱 제품 제조업
33 
합성고무 및 플라스틱 물질 제조업
30 
나무제품 제조업
21 
기타 섬유제품 제조업
13 
Other values (5)
30 

Length

Max length18
Median length16
Mean length12.757396
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row토사석 광업
2nd row토사석 광업
3rd row토사석 광업
4th row토사석 광업
5th row토사석 광업

Common Values

ValueCountFrequency (%)
직물 직조 및 직물제품 제조업 42
24.9%
플라스틱 제품 제조업 33
19.5%
합성고무 및 플라스틱 물질 제조업 30
17.8%
나무제품 제조업 21
12.4%
기타 섬유제품 제조업 13
 
7.7%
고무제품 제조업 11
 
6.5%
제재 및 목재 가공업 10
 
5.9%
토사석 광업 5
 
3.0%
석유 정제품 제조업 2
 
1.2%
기타 화학제품 제조업 2
 
1.2%

Length

2024-03-14T20:46:05.194074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T20:46:05.589410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제조업 154
24.7%
82
13.1%
플라스틱 63
10.1%
직물 42
 
6.7%
직조 42
 
6.7%
직물제품 42
 
6.7%
제품 33
 
5.3%
합성고무 30
 
4.8%
물질 30
 
4.8%
나무제품 21
 
3.4%
Other values (11) 85
13.6%

산업소분류명
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)9.5%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
직물제품 제조업
42 
1차 플라스틱제품 제조업
33 
합성고무 및 플라스틱 물질 제조업
30 
기타 고무제품 제조업
11 
제재 및 목재 가공업
10 
Other values (11)
43 

Length

Max length23
Median length19
Mean length12.83432
Min length8

Unique

Unique2 ?
Unique (%)1.2%

Sample

1st row석회석 및 점토광업
2nd row석재, 쇄석 및 모래, 자갈 채취업
3rd row석재, 쇄석 및 모래, 자갈 채취업
4th row석재, 쇄석 및 모래, 자갈 채취업
5th row석재, 쇄석 및 모래, 자갈 채취업

Common Values

ValueCountFrequency (%)
직물제품 제조업 42
24.9%
1차 플라스틱제품 제조업 33
19.5%
합성고무 및 플라스틱 물질 제조업 30
17.8%
기타 고무제품 제조업 11
 
6.5%
제재 및 목재 가공업 10
 
5.9%
그 외 기타 섬유제품 제조업 9
 
5.3%
건축용 나무제품 제조업 9
 
5.3%
기타 나무제품 제조업 9
 
5.3%
석재, 쇄석 및 모래, 자갈 채취업 4
 
2.4%
카펫, 마루덮개 및 유사 제품 제조업 2
 
1.2%
Other values (6) 10
 
5.9%

Length

2024-03-14T20:46:06.055871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
제조업 152
25.6%
54
 
9.1%
직물제품 42
 
7.1%
1차 33
 
5.6%
플라스틱제품 33
 
5.6%
합성고무 30
 
5.1%
플라스틱 30
 
5.1%
물질 30
 
5.1%
기타 29
 
4.9%
나무제품 18
 
3.0%
Other values (37) 143
24.1%

코드업종명
Categorical

HIGH CORRELATION 

Distinct29
Distinct (%)17.2%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
천막, 텐트 및 유사 제품 제조업
29 
혼성 및 재생 플라스틱 소재 물질 제조업
19 
플라스틱 선, 봉, 관 및 호스 제조업
17 
플라스틱 필름 제조업
15 
합성수지 및 기타 플라스틱 물질 제조업
11 
Other values (24)
78 

Length

Max length24
Median length20
Mean length16.822485
Min length6

Unique

Unique6 ?
Unique (%)3.6%

Sample

1st row석회석 및 점토 광업
2nd row모래 및 자갈 채취업
3rd row건설용 석재 채굴 및 쇄석 생산업
4th row건설용 석재 채굴 및 쇄석 생산업
5th row건설용 석재 채굴 및 쇄석 생산업

Common Values

ValueCountFrequency (%)
천막, 텐트 및 유사 제품 제조업 29
17.2%
혼성 및 재생 플라스틱 소재 물질 제조업 19
11.2%
플라스틱 선, 봉, 관 및 호스 제조업 17
10.1%
플라스틱 필름 제조업 15
 
8.9%
합성수지 및 기타 플라스틱 물질 제조업 11
 
6.5%
부직포 및 펠트 제조업 9
 
5.3%
산업용 그 외 비경화 고무제품 제조업 8
 
4.7%
커튼 및 유사제품 제조업 7
 
4.1%
직물포대 제조업 6
 
3.6%
목재 문 및 관련제품 제조업 6
 
3.6%
Other values (19) 42
24.9%

Length

2024-03-14T20:46:06.488466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
제조업 157
17.7%
125
 
14.1%
플라스틱 63
 
7.1%
유사 32
 
3.6%
물질 30
 
3.4%
텐트 29
 
3.3%
천막 29
 
3.3%
제품 29
 
3.3%
재생 19
 
2.1%
혼성 19
 
2.1%
Other values (69) 357
40.2%
Distinct120
Distinct (%)71.0%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-03-14T20:46:07.355776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length5
Mean length6.1301775
Min length5

Characters and Unicode

Total characters1036
Distinct characters135
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique95 ?
Unique (%)56.2%

Sample

1st row신****토
2nd row희****
3rd row(****개발
4th row주*******산업
5th row대****
ValueCountFrequency (%)
11
 
6.3%
7
 
4.0%
7
 
4.0%
5
 
2.9%
4
 
2.3%
4
 
2.3%
3
 
1.7%
3
 
1.7%
3
 
1.7%
3
 
1.7%
Other values (101) 124
71.3%
2024-03-14T20:46:08.620040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 723
69.8%
( 24
 
2.3%
18
 
1.7%
14
 
1.4%
10
 
1.0%
9
 
0.9%
8
 
0.8%
6
 
0.6%
5
 
0.5%
5
 
0.5%
Other values (125) 214
 
20.7%

Most occurring categories

ValueCountFrequency (%)
Other Punctuation 723
69.8%
Other Letter 268
 
25.9%
Open Punctuation 25
 
2.4%
Lowercase Letter 10
 
1.0%
Space Separator 5
 
0.5%
Close Punctuation 4
 
0.4%
Uppercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
18
 
6.7%
14
 
5.2%
10
 
3.7%
9
 
3.4%
8
 
3.0%
6
 
2.2%
5
 
1.9%
5
 
1.9%
5
 
1.9%
5
 
1.9%
Other values (111) 183
68.3%
Lowercase Letter
ValueCountFrequency (%)
t 2
20.0%
i 2
20.0%
a 1
10.0%
o 1
10.0%
n 1
10.0%
c 1
10.0%
r 1
10.0%
g 1
10.0%
Open Punctuation
ValueCountFrequency (%)
( 24
96.0%
1
 
4.0%
Other Punctuation
ValueCountFrequency (%)
* 723
100.0%
Space Separator
ValueCountFrequency (%)
5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Uppercase Letter
ValueCountFrequency (%)
R 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 757
73.1%
Hangul 268
 
25.9%
Latin 11
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
18
 
6.7%
14
 
5.2%
10
 
3.7%
9
 
3.4%
8
 
3.0%
6
 
2.2%
5
 
1.9%
5
 
1.9%
5
 
1.9%
5
 
1.9%
Other values (111) 183
68.3%
Latin
ValueCountFrequency (%)
t 2
18.2%
i 2
18.2%
R 1
9.1%
a 1
9.1%
o 1
9.1%
n 1
9.1%
c 1
9.1%
r 1
9.1%
g 1
9.1%
Common
ValueCountFrequency (%)
* 723
95.5%
( 24
 
3.2%
5
 
0.7%
) 4
 
0.5%
1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 767
74.0%
Hangul 268
 
25.9%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 723
94.3%
( 24
 
3.1%
5
 
0.7%
) 4
 
0.5%
t 2
 
0.3%
i 2
 
0.3%
R 1
 
0.1%
a 1
 
0.1%
o 1
 
0.1%
n 1
 
0.1%
Other values (3) 3
 
0.4%
Hangul
ValueCountFrequency (%)
18
 
6.7%
14
 
5.2%
10
 
3.7%
9
 
3.4%
8
 
3.0%
6
 
2.2%
5
 
1.9%
5
 
1.9%
5
 
1.9%
5
 
1.9%
Other values (111) 183
68.3%
None
ValueCountFrequency (%)
1
100.0%
Distinct147
Distinct (%)87.0%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-03-14T20:46:10.020107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length3
Mean length3.0177515
Min length3

Characters and Unicode

Total characters510
Distinct characters103
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique126 ?
Unique (%)74.6%

Sample

1st row장*보
2nd row유*희
3rd row배*창
4th row이*식
5th row이*호
ValueCountFrequency (%)
김*호 3
 
1.8%
금*수 2
 
1.2%
이*수 2
 
1.2%
최*국 2
 
1.2%
김*현 2
 
1.2%
김*수 2
 
1.2%
윤*화 2
 
1.2%
이*석 2
 
1.2%
조*순 2
 
1.2%
김*주 2
 
1.2%
Other values (137) 148
87.6%
2024-03-14T20:46:11.833795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 169
33.1%
36
 
7.1%
29
 
5.7%
15
 
2.9%
12
 
2.4%
11
 
2.2%
9
 
1.8%
8
 
1.6%
8
 
1.6%
7
 
1.4%
Other values (93) 206
40.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 340
66.7%
Other Punctuation 169
33.1%
Decimal Number 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
36
 
10.6%
29
 
8.5%
15
 
4.4%
12
 
3.5%
11
 
3.2%
9
 
2.6%
8
 
2.4%
8
 
2.4%
7
 
2.1%
7
 
2.1%
Other values (91) 198
58.2%
Other Punctuation
ValueCountFrequency (%)
* 169
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 340
66.7%
Common 170
33.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
36
 
10.6%
29
 
8.5%
15
 
4.4%
12
 
3.5%
11
 
3.2%
9
 
2.6%
8
 
2.4%
8
 
2.4%
7
 
2.1%
7
 
2.1%
Other values (91) 198
58.2%
Common
ValueCountFrequency (%)
* 169
99.4%
1 1
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 340
66.7%
ASCII 170
33.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 169
99.4%
1 1
 
0.6%
Hangul
ValueCountFrequency (%)
36
 
10.6%
29
 
8.5%
15
 
4.4%
12
 
3.5%
11
 
3.2%
9
 
2.6%
8
 
2.4%
8
 
2.4%
7
 
2.1%
7
 
2.1%
Other values (91) 198
58.2%

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
경상북도
169 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경상북도
2nd row경상북도
3rd row경상북도
4th row경상북도
5th row경상북도

Common Values

ValueCountFrequency (%)
경상북도 169
100.0%

Length

2024-03-14T20:46:12.262773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T20:46:12.583494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경상북도 169
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
고령군
169 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row고령군
2nd row고령군
3rd row고령군
4th row고령군
5th row고령군

Common Values

ValueCountFrequency (%)
고령군 169
100.0%

Length

2024-03-14T20:46:12.938118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T20:46:13.259513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
고령군 169
100.0%

도로명주소
Text

MISSING 

Distinct156
Distinct (%)93.4%
Missing2
Missing (%)1.2%
Memory size1.4 KiB
2024-03-14T20:46:14.754981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length15
Mean length12.341317
Min length9

Characters and Unicode

Total characters2061
Distinct characters93
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique145 ?
Unique (%)86.8%

Sample

1st row대가야읍 월기길 21
2nd row우곡면 우곡로 670
3rd row쌍림면 방아실길 73
4th row쌍림면 쌍쌍로 618-47
5th row성산면 운성로 750-9
ValueCountFrequency (%)
개진면 46
 
9.2%
다산면 43
 
8.6%
성산면 40
 
8.0%
개경포로 23
 
4.6%
대가야읍 19
 
3.8%
회천로 16
 
3.2%
쌍림면 15
 
3.0%
다산산단로 10
 
2.0%
나선로 6
 
1.2%
상용로 6
 
1.2%
Other values (201) 277
55.3%
2024-03-14T20:46:16.479171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
334
 
16.2%
148
 
7.2%
123
 
6.0%
1 102
 
4.9%
86
 
4.2%
81
 
3.9%
2 80
 
3.9%
3 76
 
3.7%
72
 
3.5%
57
 
2.8%
Other values (83) 902
43.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1140
55.3%
Decimal Number 536
26.0%
Space Separator 334
 
16.2%
Dash Punctuation 51
 
2.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
148
 
13.0%
123
 
10.8%
86
 
7.5%
81
 
7.1%
72
 
6.3%
57
 
5.0%
51
 
4.5%
50
 
4.4%
40
 
3.5%
24
 
2.1%
Other values (71) 408
35.8%
Decimal Number
ValueCountFrequency (%)
1 102
19.0%
2 80
14.9%
3 76
14.2%
5 48
9.0%
4 48
9.0%
7 47
8.8%
0 45
8.4%
6 41
7.6%
8 29
 
5.4%
9 20
 
3.7%
Space Separator
ValueCountFrequency (%)
334
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 51
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1140
55.3%
Common 921
44.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
148
 
13.0%
123
 
10.8%
86
 
7.5%
81
 
7.1%
72
 
6.3%
57
 
5.0%
51
 
4.5%
50
 
4.4%
40
 
3.5%
24
 
2.1%
Other values (71) 408
35.8%
Common
ValueCountFrequency (%)
334
36.3%
1 102
 
11.1%
2 80
 
8.7%
3 76
 
8.3%
- 51
 
5.5%
5 48
 
5.2%
4 48
 
5.2%
7 47
 
5.1%
0 45
 
4.9%
6 41
 
4.5%
Other values (2) 49
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1140
55.3%
ASCII 921
44.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
334
36.3%
1 102
 
11.1%
2 80
 
8.7%
3 76
 
8.3%
- 51
 
5.5%
5 48
 
5.2%
4 48
 
5.2%
7 47
 
5.1%
0 45
 
4.9%
6 41
 
4.5%
Other values (2) 49
 
5.3%
Hangul
ValueCountFrequency (%)
148
 
13.0%
123
 
10.8%
86
 
7.5%
81
 
7.1%
72
 
6.3%
57
 
5.0%
51
 
4.5%
50
 
4.4%
40
 
3.5%
24
 
2.1%
Other values (71) 408
35.8%
Distinct159
Distinct (%)94.1%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-03-14T20:46:17.807198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length15
Mean length12.023669
Min length10

Characters and Unicode

Total characters2032
Distinct characters78
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique149 ?
Unique (%)88.2%

Sample

1st row덕곡면 노리 148
2nd row대가야읍 지산리 232-14
3rd row우곡면 월오리 산27-1
4th row쌍림면 안림리 740-1
5th row쌍림면 신곡리 산46
ValueCountFrequency (%)
개진면 47
 
9.3%
다산면 44
 
8.7%
성산면 40
 
7.9%
대가야읍 18
 
3.6%
반운리 17
 
3.4%
송곡리 15
 
3.0%
쌍림면 15
 
3.0%
상곡리 9
 
1.8%
나정리 9
 
1.8%
개포리 7
 
1.4%
Other values (204) 286
56.4%
2024-03-14T20:46:19.654176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
338
16.6%
171
 
8.4%
151
 
7.4%
1 126
 
6.2%
90
 
4.4%
- 73
 
3.6%
3 70
 
3.4%
2 69
 
3.4%
4 64
 
3.1%
7 59
 
2.9%
Other values (68) 821
40.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1019
50.1%
Decimal Number 602
29.6%
Space Separator 338
 
16.6%
Dash Punctuation 73
 
3.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
171
16.8%
151
14.8%
90
 
8.8%
54
 
5.3%
51
 
5.0%
47
 
4.6%
44
 
4.3%
44
 
4.3%
27
 
2.6%
20
 
2.0%
Other values (56) 320
31.4%
Decimal Number
ValueCountFrequency (%)
1 126
20.9%
3 70
11.6%
2 69
11.5%
4 64
10.6%
7 59
9.8%
5 49
 
8.1%
6 48
 
8.0%
8 44
 
7.3%
0 38
 
6.3%
9 35
 
5.8%
Space Separator
ValueCountFrequency (%)
338
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 73
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1019
50.1%
Common 1013
49.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
171
16.8%
151
14.8%
90
 
8.8%
54
 
5.3%
51
 
5.0%
47
 
4.6%
44
 
4.3%
44
 
4.3%
27
 
2.6%
20
 
2.0%
Other values (56) 320
31.4%
Common
ValueCountFrequency (%)
338
33.4%
1 126
 
12.4%
- 73
 
7.2%
3 70
 
6.9%
2 69
 
6.8%
4 64
 
6.3%
7 59
 
5.8%
5 49
 
4.8%
6 48
 
4.7%
8 44
 
4.3%
Other values (2) 73
 
7.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1019
50.1%
ASCII 1013
49.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
338
33.4%
1 126
 
12.4%
- 73
 
7.2%
3 70
 
6.9%
2 69
 
6.8%
4 64
 
6.3%
7 59
 
5.8%
5 49
 
4.8%
6 48
 
4.7%
8 44
 
4.3%
Other values (2) 73
 
7.2%
Hangul
ValueCountFrequency (%)
171
16.8%
151
14.8%
90
 
8.8%
54
 
5.3%
51
 
5.0%
47
 
4.6%
44
 
4.3%
44
 
4.3%
27
 
2.6%
20
 
2.0%
Other values (56) 320
31.4%

업체유형
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
개인사업체
124 
회사법인
45 

Length

Max length5
Median length5
Mean length4.7337278
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개인사업체
2nd row회사법인
3rd row회사법인
4th row회사법인
5th row회사법인

Common Values

ValueCountFrequency (%)
개인사업체 124
73.4%
회사법인 45
 
26.6%

Length

2024-03-14T20:46:20.076070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T20:46:20.407008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개인사업체 124
73.4%
회사법인 45
 
26.6%

종사자수
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct31
Distinct (%)20.5%
Missing18
Missing (%)10.7%
Infinite0
Infinite (%)0.0%
Mean7.7086093
Minimum1
Maximum77
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2024-03-14T20:46:20.737015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median4
Q38
95-th percentile27.5
Maximum77
Range76
Interquartile range (IQR)6

Descriptive statistics

Standard deviation9.9005652
Coefficient of variation (CV)1.2843517
Kurtosis17.36768
Mean7.7086093
Median Absolute Deviation (MAD)2
Skewness3.5163106
Sum1164
Variance98.021192
MonotonicityNot monotonic
2024-03-14T20:46:21.158828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
2 26
15.4%
3 22
13.0%
4 18
10.7%
1 14
8.3%
5 12
7.1%
6 10
 
5.9%
8 8
 
4.7%
9 6
 
3.6%
7 5
 
3.0%
11 3
 
1.8%
Other values (21) 27
16.0%
(Missing) 18
10.7%
ValueCountFrequency (%)
1 14
8.3%
2 26
15.4%
3 22
13.0%
4 18
10.7%
5 12
7.1%
6 10
 
5.9%
7 5
 
3.0%
8 8
 
4.7%
9 6
 
3.6%
10 2
 
1.2%
ValueCountFrequency (%)
77 1
0.6%
45 1
0.6%
40 1
0.6%
35 1
0.6%
34 1
0.6%
31 1
0.6%
29 1
0.6%
28 1
0.6%
27 1
0.6%
25 1
0.6%

Interactions

2024-03-14T20:46:01.701264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:46:01.168869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:46:01.968066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:46:01.432292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T20:46:21.434839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번산업대분류명산업중분류명산업소분류명코드업종명업체유형종사자수
순번1.0000.8750.9600.9240.9770.4340.000
산업대분류명0.8751.0001.0001.0001.0000.2760.081
산업중분류명0.9601.0001.0001.0001.0000.4240.000
산업소분류명0.9241.0001.0001.0001.0000.4640.000
코드업종명0.9771.0001.0001.0001.0000.5220.000
업체유형0.4340.2760.4240.4640.5221.0000.476
종사자수0.0000.0810.0000.0000.0000.4761.000
2024-03-14T20:46:21.721974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
산업대분류명산업소분류명산업중분류명업체유형코드업종명
산업대분류명1.0000.9690.9880.1960.927
산업소분류명0.9691.0000.9810.3490.957
산업중분류명0.9880.9811.0000.3170.938
업체유형0.1960.3490.3171.0000.410
코드업종명0.9270.9570.9380.4101.000
2024-03-14T20:46:21.997294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번종사자수산업대분류명산업중분류명산업소분류명코드업종명업체유형
순번1.0000.0140.7040.6620.6870.8000.320
종사자수0.0141.0000.0450.0000.0000.0000.502
산업대분류명0.7040.0451.0000.9880.9690.9270.196
산업중분류명0.6620.0000.9881.0000.9810.9380.317
산업소분류명0.6870.0000.9690.9811.0000.9570.349
코드업종명0.8000.0000.9270.9380.9571.0000.410
업체유형0.3200.5020.1960.3170.3490.4101.000

Missing values

2024-03-14T20:46:02.356060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T20:46:02.813993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-14T20:46:03.091625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

순번산업대분류명산업중분류명산업소분류명코드업종명업체명대표자명시도명시군구명도로명주소지번주소업체유형종사자수
01비금속광물 광업; 연료용 제외토사석 광업석회석 및 점토광업석회석 및 점토 광업신****토장*보경상북도고령군<NA>덕곡면 노리 148개인사업체<NA>
12비금속광물 광업; 연료용 제외토사석 광업석재, 쇄석 및 모래, 자갈 채취업모래 및 자갈 채취업희****유*희경상북도고령군대가야읍 월기길 21대가야읍 지산리 232-14회사법인1
23비금속광물 광업; 연료용 제외토사석 광업석재, 쇄석 및 모래, 자갈 채취업건설용 석재 채굴 및 쇄석 생산업(****개발배*창경상북도고령군우곡면 우곡로 670우곡면 월오리 산27-1회사법인1
34비금속광물 광업; 연료용 제외토사석 광업석재, 쇄석 및 모래, 자갈 채취업건설용 석재 채굴 및 쇄석 생산업주*******산업이*식경상북도고령군쌍림면 방아실길 73쌍림면 안림리 740-1회사법인16
45비금속광물 광업; 연료용 제외토사석 광업석재, 쇄석 및 모래, 자갈 채취업건설용 석재 채굴 및 쇄석 생산업대****이*호경상북도고령군쌍림면 쌍쌍로 618-47쌍림면 신곡리 산46회사법인21
56섬유제품 제조업; 의복 제외직물 직조 및 직물제품 제조업직물제품 제조업커튼 및 유사제품 제조업비****정*진경상북도고령군성산면 운성로 750-9성산면 고탄리 607개인사업체1
67섬유제품 제조업; 의복 제외직물 직조 및 직물제품 제조업직물제품 제조업커튼 및 유사제품 제조업마*******tion김*희경상북도고령군성산면 상용로 328성산면 용소리 75-2개인사업체2
78섬유제품 제조업; 의복 제외직물 직조 및 직물제품 제조업직물제품 제조업커튼 및 유사제품 제조업(****진김*홍경상북도고령군다산면 아시터길 11-3다산면 나정리 86회사법인34
89섬유제품 제조업; 의복 제외직물 직조 및 직물제품 제조업직물제품 제조업커튼 및 유사제품 제조업메****프최*주경상북도고령군다산면 평리6길 18다산면 곽촌리 129-25개인사업체7
910섬유제품 제조업; 의복 제외직물 직조 및 직물제품 제조업직물제품 제조업커튼 및 유사제품 제조업태****정*웅경상북도고령군다산면 평리2길 1다산면 평리리 232-2개인사업체2
순번산업대분류명산업중분류명산업소분류명코드업종명업체명대표자명시도명시군구명도로명주소지번주소업체유형종사자수
159160고무 및 플라스틱제품 제조업플라스틱 제품 제조업1차 플라스틱제품 제조업플라스틱 필름 제조업주******미칼*강*진경상북도고령군다산면 다산산단로 236다산면 송곡리 1747회사법인77
160161고무 및 플라스틱제품 제조업플라스틱 제품 제조업1차 플라스틱제품 제조업플라스틱 필름 제조업동******곽*환경상북도고령군다산면 벌지로 16-6다산면 송곡리 455개인사업체5
161162고무 및 플라스틱제품 제조업플라스틱 제품 제조업1차 플라스틱제품 제조업플라스틱 필름 제조업태******서*규경상북도고령군다산면 좌학길 61다산면 상곡리 392개인사업체1
162163고무 및 플라스틱제품 제조업플라스틱 제품 제조업1차 플라스틱제품 제조업플라스틱 필름 제조업(******씨금*수경상북도고령군다산면 평리3길 29다산면 상곡리 147-1회사법인2
163164고무 및 플라스틱제품 제조업플라스틱 제품 제조업1차 플라스틱제품 제조업플라스틱 필름 제조업신******손*락경상북도고령군다산면 평리6길 16다산면 곽촌리 129-3개인사업체3
164165고무 및 플라스틱제품 제조업플라스틱 제품 제조업1차 플라스틱제품 제조업플라스틱 필름 제조업한******금*수경상북도고령군다산면 평리6길 13다산면 곽촌리 131-4개인사업체6
165166고무 및 플라스틱제품 제조업플라스틱 제품 제조업1차 플라스틱제품 제조업플라스틱 필름 제조업케******미칼여*수경상북도고령군개진면 반운공단길 14개진면 반운리 140회사법인9
166167고무 및 플라스틱제품 제조업플라스틱 제품 제조업1차 플라스틱제품 제조업플라스틱 필름 제조업일******주)고령지점*정*수경상북도고령군개진면 회천로 305개진면 반운리 740회사법인28
167168고무 및 플라스틱제품 제조업플라스틱 제품 제조업1차 플라스틱제품 제조업플라스틱 필름 제조업(주)******백*철경상북도고령군개진면 회천로 318개진면 반운리 751회사법인5
168169고무 및 플라스틱제품 제조업플라스틱 제품 제조업1차 플라스틱제품 제조업플라스틱 시트 및 판 제조업주******이*성경상북도고령군성산면 개경포로 2118성산면 득성리 237-11회사법인1