Overview

Dataset statistics

Number of variables7
Number of observations271
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.2 KiB
Average record size in memory57.5 B

Variable types

Numeric1
Text4
Categorical2

Dataset

Description이 데이터는 대기배출시설을 설치하고 신고한 사업장에 대한 현황으로, 업체명, 대표자, 주소, 업종 등을 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=347&beforeMenuCd=DOM_000000201001001000&publicdatapk=15080575

Alerts

데이터기준일자 has constant value ""Constant
종별 is highly imbalanced (53.7%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2024-01-09 22:24:38.234718
Analysis finished2024-01-09 22:24:38.869842
Duration0.64 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct271
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean136
Minimum1
Maximum271
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.5 KiB
2024-01-10T07:24:38.931642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile14.5
Q168.5
median136
Q3203.5
95-th percentile257.5
Maximum271
Range270
Interquartile range (IQR)135

Descriptive statistics

Standard deviation78.375166
Coefficient of variation (CV)0.57628799
Kurtosis-1.2
Mean136
Median Absolute Deviation (MAD)68
Skewness0
Sum36856
Variance6142.6667
MonotonicityStrictly increasing
2024-01-10T07:24:39.039754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
180 1
 
0.4%
186 1
 
0.4%
185 1
 
0.4%
184 1
 
0.4%
183 1
 
0.4%
182 1
 
0.4%
181 1
 
0.4%
179 1
 
0.4%
2 1
 
0.4%
Other values (261) 261
96.3%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
271 1
0.4%
270 1
0.4%
269 1
0.4%
268 1
0.4%
267 1
0.4%
266 1
0.4%
265 1
0.4%
264 1
0.4%
263 1
0.4%
262 1
0.4%
Distinct262
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2024-01-10T07:24:39.212102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length13
Mean length7.6900369
Min length2

Characters and Unicode

Total characters2084
Distinct characters277
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique253 ?
Unique (%)93.4%

Sample

1st row삼남제약(주)
2nd row(주)광성화학
3rd row경기광업(주)
4th row중앙목욕탕
5th row광흥제면
ValueCountFrequency (%)
주식회사 9
 
3.0%
금산공장 3
 
1.0%
주)에스코알티에스 2
 
0.7%
주)이에스에프씨티 2
 
0.7%
아스폴리머 2
 
0.7%
제2공장 2
 
0.7%
주)휴온스네이처 2
 
0.7%
농업회사법인 2
 
0.7%
주)광성화학 2
 
0.7%
주)유성화연테크 2
 
0.7%
Other values (260) 268
90.5%
2024-01-10T07:24:39.503469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
187
 
9.0%
) 175
 
8.4%
( 175
 
8.4%
78
 
3.7%
65
 
3.1%
45
 
2.2%
38
 
1.8%
37
 
1.8%
36
 
1.7%
32
 
1.5%
Other values (267) 1216
58.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1687
81.0%
Close Punctuation 176
 
8.4%
Open Punctuation 176
 
8.4%
Space Separator 25
 
1.2%
Decimal Number 14
 
0.7%
Uppercase Letter 4
 
0.2%
Other Punctuation 1
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
187
 
11.1%
78
 
4.6%
65
 
3.9%
45
 
2.7%
38
 
2.3%
37
 
2.2%
36
 
2.1%
32
 
1.9%
32
 
1.9%
26
 
1.5%
Other values (251) 1111
65.9%
Decimal Number
ValueCountFrequency (%)
2 7
50.0%
1 2
 
14.3%
8 2
 
14.3%
9 2
 
14.3%
3 1
 
7.1%
Uppercase Letter
ValueCountFrequency (%)
G 1
25.0%
E 1
25.0%
S 1
25.0%
M 1
25.0%
Close Punctuation
ValueCountFrequency (%)
) 175
99.4%
] 1
 
0.6%
Open Punctuation
ValueCountFrequency (%)
( 175
99.4%
[ 1
 
0.6%
Space Separator
ValueCountFrequency (%)
25
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1687
81.0%
Common 393
 
18.9%
Latin 4
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
187
 
11.1%
78
 
4.6%
65
 
3.9%
45
 
2.7%
38
 
2.3%
37
 
2.2%
36
 
2.1%
32
 
1.9%
32
 
1.9%
26
 
1.5%
Other values (251) 1111
65.9%
Common
ValueCountFrequency (%)
) 175
44.5%
( 175
44.5%
25
 
6.4%
2 7
 
1.8%
1 2
 
0.5%
8 2
 
0.5%
9 2
 
0.5%
] 1
 
0.3%
/ 1
 
0.3%
[ 1
 
0.3%
Other values (2) 2
 
0.5%
Latin
ValueCountFrequency (%)
G 1
25.0%
E 1
25.0%
S 1
25.0%
M 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1687
81.0%
ASCII 397
 
19.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
187
 
11.1%
78
 
4.6%
65
 
3.9%
45
 
2.7%
38
 
2.3%
37
 
2.2%
36
 
2.1%
32
 
1.9%
32
 
1.9%
26
 
1.5%
Other values (251) 1111
65.9%
ASCII
ValueCountFrequency (%)
) 175
44.1%
( 175
44.1%
25
 
6.3%
2 7
 
1.8%
1 2
 
0.5%
8 2
 
0.5%
9 2
 
0.5%
] 1
 
0.3%
/ 1
 
0.3%
[ 1
 
0.3%
Other values (6) 6
 
1.5%
Distinct212
Distinct (%)78.2%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2024-01-10T07:24:39.762348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length3
Mean length3.3468635
Min length3

Characters and Unicode

Total characters907
Distinct characters153
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique199 ?
Unique (%)73.4%

Sample

1st row대표이사
2nd row김광래
3rd row권희문
4th row김민수
5th row최광식
ValueCountFrequency (%)
대표이사 46
 
16.6%
장호윤 3
 
1.1%
이병호 3
 
1.1%
송재인 2
 
0.7%
임종득 2
 
0.7%
김용기 2
 
0.7%
노민성 2
 
0.7%
송석진 2
 
0.7%
신상오 2
 
0.7%
김정림 2
 
0.7%
Other values (207) 211
76.2%
2024-01-10T07:24:40.124532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
84
 
9.3%
50
 
5.5%
48
 
5.3%
47
 
5.2%
35
 
3.9%
27
 
3.0%
17
 
1.9%
17
 
1.9%
15
 
1.7%
15
 
1.7%
Other values (143) 552
60.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 892
98.3%
Space Separator 9
 
1.0%
Open Punctuation 2
 
0.2%
Close Punctuation 2
 
0.2%
Decimal Number 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
84
 
9.4%
50
 
5.6%
48
 
5.4%
47
 
5.3%
35
 
3.9%
27
 
3.0%
17
 
1.9%
17
 
1.9%
15
 
1.7%
15
 
1.7%
Other values (139) 537
60.2%
Space Separator
ValueCountFrequency (%)
9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Decimal Number
ValueCountFrequency (%)
1 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 892
98.3%
Common 15
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
84
 
9.4%
50
 
5.6%
48
 
5.4%
47
 
5.3%
35
 
3.9%
27
 
3.0%
17
 
1.9%
17
 
1.9%
15
 
1.7%
15
 
1.7%
Other values (139) 537
60.2%
Common
ValueCountFrequency (%)
9
60.0%
( 2
 
13.3%
) 2
 
13.3%
1 2
 
13.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 892
98.3%
ASCII 15
 
1.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
84
 
9.4%
50
 
5.6%
48
 
5.4%
47
 
5.3%
35
 
3.9%
27
 
3.0%
17
 
1.9%
17
 
1.9%
15
 
1.7%
15
 
1.7%
Other values (139) 537
60.2%
ASCII
ValueCountFrequency (%)
9
60.0%
( 2
 
13.3%
) 2
 
13.3%
1 2
 
13.3%

주소
Text

Distinct249
Distinct (%)91.9%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2024-01-10T07:24:40.346972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length35
Mean length22.402214
Min length17

Characters and Unicode

Total characters6071
Distinct characters169
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique233 ?
Unique (%)86.0%

Sample

1st row충청남도 금산군 금산읍 인삼로 77
2nd row충청남도 금산군 추부면 마전리 176-2
3rd row충청남도 금산군 진산면 휴양림로 2224
4th row충청남도 금산군 금산읍 중도리 506
5th row충청남도 금산군 추부면 추풍로 152
ValueCountFrequency (%)
충청남도 271
19.2%
금산군 271
19.2%
추부면 101
 
7.2%
복수면 62
 
4.4%
금성면 35
 
2.5%
진산면 24
 
1.7%
다복로 23
 
1.6%
군북면 19
 
1.3%
금산읍 18
 
1.3%
용천로 17
 
1.2%
Other values (352) 567
40.3%
2024-01-10T07:24:40.648792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1213
20.0%
354
 
5.8%
348
 
5.7%
305
 
5.0%
278
 
4.6%
273
 
4.5%
271
 
4.5%
271
 
4.5%
254
 
4.2%
1 174
 
2.9%
Other values (159) 2330
38.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3817
62.9%
Space Separator 1213
 
20.0%
Decimal Number 894
 
14.7%
Dash Punctuation 88
 
1.4%
Close Punctuation 26
 
0.4%
Open Punctuation 26
 
0.4%
Other Symbol 5
 
0.1%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
354
 
9.3%
348
 
9.1%
305
 
8.0%
278
 
7.3%
273
 
7.2%
271
 
7.1%
271
 
7.1%
254
 
6.7%
163
 
4.3%
109
 
2.9%
Other values (143) 1191
31.2%
Decimal Number
ValueCountFrequency (%)
1 174
19.5%
2 127
14.2%
4 101
11.3%
5 92
10.3%
3 87
9.7%
6 87
9.7%
7 79
8.8%
8 65
 
7.3%
9 42
 
4.7%
0 40
 
4.5%
Space Separator
ValueCountFrequency (%)
1213
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 88
100.0%
Close Punctuation
ValueCountFrequency (%)
) 26
100.0%
Open Punctuation
ValueCountFrequency (%)
( 26
100.0%
Other Symbol
ValueCountFrequency (%)
5
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3822
63.0%
Common 2247
37.0%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
354
 
9.3%
348
 
9.1%
305
 
8.0%
278
 
7.3%
273
 
7.1%
271
 
7.1%
271
 
7.1%
254
 
6.6%
163
 
4.3%
109
 
2.9%
Other values (144) 1196
31.3%
Common
ValueCountFrequency (%)
1213
54.0%
1 174
 
7.7%
2 127
 
5.7%
4 101
 
4.5%
5 92
 
4.1%
- 88
 
3.9%
3 87
 
3.9%
6 87
 
3.9%
7 79
 
3.5%
8 65
 
2.9%
Other values (4) 134
 
6.0%
Latin
ValueCountFrequency (%)
B 2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3817
62.9%
ASCII 2249
37.0%
None 5
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1213
53.9%
1 174
 
7.7%
2 127
 
5.6%
4 101
 
4.5%
5 92
 
4.1%
- 88
 
3.9%
3 87
 
3.9%
6 87
 
3.9%
7 79
 
3.5%
8 65
 
2.9%
Other values (5) 136
 
6.0%
Hangul
ValueCountFrequency (%)
354
 
9.3%
348
 
9.1%
305
 
8.0%
278
 
7.3%
273
 
7.2%
271
 
7.1%
271
 
7.1%
254
 
6.7%
163
 
4.3%
109
 
2.9%
Other values (143) 1191
31.2%
None
ValueCountFrequency (%)
5
100.0%

업종
Text

Distinct71
Distinct (%)26.2%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2024-01-10T07:24:40.886632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length1
Mean length5.9409594
Min length1

Characters and Unicode

Total characters1610
Distinct characters139
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)18.5%

Sample

1st row의약품 제조업
2nd row기타 비금속광물 광업
3rd row
4th row
5th row
ValueCountFrequency (%)
제조업 76
 
19.7%
31
 
8.0%
기타 30
 
7.8%
인삼식품 9
 
2.3%
폐기물 8
 
2.1%
처리업 8
 
2.1%
자동차 7
 
1.8%
수리업 7
 
1.8%
생산업 7
 
1.8%
산업용 6
 
1.6%
Other values (119) 197
51.0%
2024-01-10T07:24:41.262016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
423
26.3%
124
 
7.7%
114
 
7.1%
93
 
5.8%
54
 
3.4%
46
 
2.9%
33
 
2.0%
33
 
2.0%
27
 
1.7%
24
 
1.5%
Other values (129) 639
39.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1186
73.7%
Space Separator 423
 
26.3%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
124
 
10.5%
114
 
9.6%
93
 
7.8%
54
 
4.6%
46
 
3.9%
33
 
2.8%
33
 
2.8%
27
 
2.3%
24
 
2.0%
22
 
1.9%
Other values (127) 616
51.9%
Space Separator
ValueCountFrequency (%)
423
100.0%
Other Punctuation
ValueCountFrequency (%)
· 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1186
73.7%
Common 424
 
26.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
124
 
10.5%
114
 
9.6%
93
 
7.8%
54
 
4.6%
46
 
3.9%
33
 
2.8%
33
 
2.8%
27
 
2.3%
24
 
2.0%
22
 
1.9%
Other values (127) 616
51.9%
Common
ValueCountFrequency (%)
423
99.8%
· 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1186
73.7%
ASCII 423
 
26.3%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
423
100.0%
Hangul
ValueCountFrequency (%)
124
 
10.5%
114
 
9.6%
93
 
7.8%
54
 
4.6%
46
 
3.9%
33
 
2.8%
33
 
2.8%
27
 
2.3%
24
 
2.0%
22
 
1.9%
Other values (127) 616
51.9%
None
ValueCountFrequency (%)
· 1
100.0%

종별
Categorical

IMBALANCE 

Distinct5
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
5종
190 
4종
71 
3종
 
8
2종
 
1
 
1

Length

Max length2
Median length2
Mean length1.99631
Min length1

Unique

Unique2 ?
Unique (%)0.7%

Sample

1st row4종
2nd row4종
3rd row4종
4th row5종
5th row5종

Common Values

ValueCountFrequency (%)
5종 190
70.1%
4종 71
 
26.2%
3종 8
 
3.0%
2종 1
 
0.4%
1
 
0.4%

Length

2024-01-10T07:24:41.370182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:24:41.450449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5종 190
70.4%
4종 71
 
26.3%
3종 8
 
3.0%
2종 1
 
0.4%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2021-04-14
271 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-04-14
2nd row2021-04-14
3rd row2021-04-14
4th row2021-04-14
5th row2021-04-14

Common Values

ValueCountFrequency (%)
2021-04-14 271
100.0%

Length

2024-01-10T07:24:41.532892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:24:41.600360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-04-14 271
100.0%

Interactions

2024-01-10T07:24:38.655797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T07:24:41.643927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종종별
연번1.0000.4810.242
업종0.4811.0000.000
종별0.2420.0001.000
2024-01-10T07:24:41.708134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번종별
연번1.0000.106
종별0.1061.000

Missing values

2024-01-10T07:24:38.746061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:24:38.831946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업체명대표자주소업종종별데이터기준일자
01삼남제약(주)대표이사충청남도 금산군 금산읍 인삼로 77의약품 제조업4종2021-04-14
12(주)광성화학김광래충청남도 금산군 추부면 마전리 176-2기타 비금속광물 광업4종2021-04-14
23경기광업(주)권희문충청남도 금산군 진산면 휴양림로 22244종2021-04-14
34중앙목욕탕김민수충청남도 금산군 금산읍 중도리 5065종2021-04-14
45광흥제면최광식충청남도 금산군 추부면 추풍로 1525종2021-04-14
56주안아스콘(주)유인식충청남도 금산군 진산면 실학로 463아스콘 제조업3종2021-04-14
67(주)삼진당박동선충청남도 금산군 금산읍 양지리 16-14종2021-04-14
78(주)EG대표이사충청남도 금산군 추부면 서대산로 4594종2021-04-14
89대륙화학공업(주) 금산공장송인혁충청남도 금산군 복수면 용진리 115-5산업용 비경화고무제품 제조업3종2021-04-14
910(주)금성방적윤용근충청남도 금산군 복수면 용진리 115-75종2021-04-14
연번업체명대표자주소업종종별데이터기준일자
261262휴모터스정화영충청남도 금산군 추부면 다복로 655자동차 종합 수리업4종2021-04-14
262263(주)휴온스네이처대표이사충청남도 금산군 금산읍 인삼광장로 19인삼식품 제조업4종2021-04-14
263264(주)신화기전대표이사충청남도 금산군 추부면 이터골길 71-35기타 절연선 및 케이블 제조업5종2021-04-14
264265(주)금풍제과박종원충청남도 금산군 추부면 다복로 646 금풍제과코코아 제품 및 과자류 제조업5종2021-04-14
265266한국아카데미하우스조현식충청남도 금산군 부리면 적벽강로 378 한국타이어아카데미하우스사회서비스 관리 행정5종2021-04-14
266267세정산업조규형충청남도 금산군 복수면 복수로 709도장및기타피막처리업5종2021-04-14
267268영광수지김덕기충청남도 금산군 군북면 군북로 828 824지정외 폐기물 처리업5종2021-04-14
268269금산군 가축분뇨공공처리시설금산군수충청남도 금산군 금산읍 신대리 54-4지방행정 집행기관5종2021-04-14
269270대주개발(주)오용대 외 1인충청남도 금산군 진산면 실학로 4635종2021-04-14
270271대산철강공업(주)한평용충청남도 금산군 추부면 추풍로 168 대산철강공업㈜기타 식품 첨가물 제조업4종2021-04-14