Overview

Dataset statistics

Number of variables5
Number of observations44
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 KiB
Average record size in memory44.0 B

Variable types

Numeric1
Text2
Categorical2

Dataset

Description대기환경보전법 제23조 배출시설의 설치 허가 및 신고에 따라 3종, 4종, 5종의 경우 신고 및 관리를 진행하고 있습니다. 1종과 2종의 경우는 경기도에 신고를 받고 관리를 하고 있습니다.
URLhttps://www.data.go.kr/data/15080919/fileData.do

Alerts

번호 has unique valuesUnique
주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 01:14:21.696626
Analysis finished2023-12-12 01:14:22.613688
Duration0.92 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct44
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22.5
Minimum1
Maximum44
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size528.0 B
2023-12-12T10:14:22.692216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.15
Q111.75
median22.5
Q333.25
95-th percentile41.85
Maximum44
Range43
Interquartile range (IQR)21.5

Descriptive statistics

Standard deviation12.845233
Coefficient of variation (CV)0.57089923
Kurtosis-1.2
Mean22.5
Median Absolute Deviation (MAD)11
Skewness0
Sum990
Variance165
MonotonicityStrictly increasing
2023-12-12T10:14:22.838718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=44)
ValueCountFrequency (%)
1 1
 
2.3%
24 1
 
2.3%
26 1
 
2.3%
27 1
 
2.3%
28 1
 
2.3%
29 1
 
2.3%
30 1
 
2.3%
31 1
 
2.3%
32 1
 
2.3%
33 1
 
2.3%
Other values (34) 34
77.3%
ValueCountFrequency (%)
1 1
2.3%
2 1
2.3%
3 1
2.3%
4 1
2.3%
5 1
2.3%
6 1
2.3%
7 1
2.3%
8 1
2.3%
9 1
2.3%
10 1
2.3%
ValueCountFrequency (%)
44 1
2.3%
43 1
2.3%
42 1
2.3%
41 1
2.3%
40 1
2.3%
39 1
2.3%
38 1
2.3%
37 1
2.3%
36 1
2.3%
35 1
2.3%
Distinct43
Distinct (%)97.7%
Missing0
Missing (%)0.0%
Memory size484.0 B
2023-12-12T10:14:23.111234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length12
Mean length9.2727273
Min length3

Characters and Unicode

Total characters408
Distinct characters125
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique42 ?
Unique (%)95.5%

Sample

1st row(주)하남자동차서비스
2nd row(주)1급 현대신광서비스
3rd row(주)대원산업
4th row한성주물
5th row신장택시(유)
ValueCountFrequency (%)
주식회사 5
 
7.2%
모터스 4
 
5.8%
서하남서비스 2
 
2.9%
기아오토큐 2
 
2.9%
한국가스공사 2
 
2.9%
미사 2
 
2.9%
경기지역본부 2
 
2.9%
동부자동차공업사 1
 
1.4%
카독크 1
 
1.4%
서하남공업사 1
 
1.4%
Other values (47) 47
68.1%
2023-12-12T10:14:23.560280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
25
 
6.1%
25
 
6.1%
22
 
5.4%
) 16
 
3.9%
( 16
 
3.9%
13
 
3.2%
13
 
3.2%
12
 
2.9%
12
 
2.9%
9
 
2.2%
Other values (115) 245
60.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 343
84.1%
Space Separator 25
 
6.1%
Close Punctuation 16
 
3.9%
Open Punctuation 16
 
3.9%
Decimal Number 2
 
0.5%
Other Symbol 2
 
0.5%
Lowercase Letter 2
 
0.5%
Uppercase Letter 1
 
0.2%
Dash Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
25
 
7.3%
22
 
6.4%
13
 
3.8%
13
 
3.8%
12
 
3.5%
12
 
3.5%
9
 
2.6%
9
 
2.6%
9
 
2.6%
8
 
2.3%
Other values (106) 211
61.5%
Lowercase Letter
ValueCountFrequency (%)
p 1
50.0%
s 1
50.0%
Space Separator
ValueCountFrequency (%)
25
100.0%
Close Punctuation
ValueCountFrequency (%)
) 16
100.0%
Open Punctuation
ValueCountFrequency (%)
( 16
100.0%
Decimal Number
ValueCountFrequency (%)
1 2
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Uppercase Letter
ValueCountFrequency (%)
K 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 345
84.6%
Common 60
 
14.7%
Latin 3
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
25
 
7.2%
22
 
6.4%
13
 
3.8%
13
 
3.8%
12
 
3.5%
12
 
3.5%
9
 
2.6%
9
 
2.6%
9
 
2.6%
8
 
2.3%
Other values (107) 213
61.7%
Common
ValueCountFrequency (%)
25
41.7%
) 16
26.7%
( 16
26.7%
1 2
 
3.3%
- 1
 
1.7%
Latin
ValueCountFrequency (%)
K 1
33.3%
p 1
33.3%
s 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 343
84.1%
ASCII 63
 
15.4%
None 2
 
0.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
25
 
7.3%
22
 
6.4%
13
 
3.8%
13
 
3.8%
12
 
3.5%
12
 
3.5%
9
 
2.6%
9
 
2.6%
9
 
2.6%
8
 
2.3%
Other values (106) 211
61.5%
ASCII
ValueCountFrequency (%)
25
39.7%
) 16
25.4%
( 16
25.4%
1 2
 
3.2%
K 1
 
1.6%
- 1
 
1.6%
p 1
 
1.6%
s 1
 
1.6%
None
ValueCountFrequency (%)
2
100.0%

사업장업종
Categorical

Distinct17
Distinct (%)38.6%
Missing0
Missing (%)0.0%
Memory size484.0 B
자동차 종합 수리업
17 
자동차 수리업
도장 및 기타 피막처리업
레미콘 제조업
기타 대형 종합 소매업
Other values (12)
12 

Length

Max length20
Median length18
Mean length9.7045455
Min length3

Unique

Unique12 ?
Unique (%)27.3%

Sample

1st row자동차 종합 수리업
2nd row자동차 종합 수리업
3rd row플라스틱제품 제조업
4th row주물제조
5th row택시 운송업

Common Values

ValueCountFrequency (%)
자동차 종합 수리업 17
38.6%
자동차 수리업 8
18.2%
도장 및 기타 피막처리업 3
 
6.8%
레미콘 제조업 2
 
4.5%
기타 대형 종합 소매업 2
 
4.5%
모조 귀금속 및 모조 장신용품 제조업 1
 
2.3%
가스제조 및 배관 공급업 1
 
2.3%
가스 제조 및 배관공급업 1
 
2.3%
기타 스포츠 서비스업 1
 
2.3%
골프장 운영업 1
 
2.3%
Other values (7) 7
15.9%

Length

2023-12-12T10:14:23.712147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
자동차 26
20.5%
수리업 26
20.5%
종합 20
15.7%
7
 
5.5%
기타 6
 
4.7%
제조업 5
 
3.9%
도장 3
 
2.4%
피막처리업 3
 
2.4%
대형 3
 
2.4%
소매업 3
 
2.4%
Other values (22) 25
19.7%

종별
Categorical

Distinct2
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Memory size484.0 B
5종
30 
4종
14 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row4종
2nd row5종
3rd row4종
4th row4종
5th row5종

Common Values

ValueCountFrequency (%)
5종 30
68.2%
4종 14
31.8%

Length

2023-12-12T10:14:23.840010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:14:23.961484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5종 30
68.2%
4종 14
31.8%

주소
Text

UNIQUE 

Distinct44
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size484.0 B
2023-12-12T10:14:24.237484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length35
Mean length28.159091
Min length18

Characters and Unicode

Total characters1239
Distinct characters90
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique44 ?
Unique (%)100.0%

Sample

1st row경기도 하남시 하남대로 801 (신장동)
2nd row경기도 하남시 신장1로 5 (신장동)
3rd row경기도 하남시 하남대로622번길 33 (천현동)
4th row경기도 하남시 검단남로26번길 33 (하산곡동)
5th row경기도 하남시 대청로21번길 31 (신장동)
ValueCountFrequency (%)
경기도 44
 
16.7%
하남시 44
 
16.7%
광암동 14
 
5.3%
신장동 9
 
3.4%
초이동 8
 
3.0%
8 7
 
2.7%
초광산단동로 6
 
2.3%
5 6
 
2.3%
초광산단동로6번길 5
 
1.9%
5층 4
 
1.5%
Other values (81) 116
44.1%
2023-12-12T10:14:24.663867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
243
19.6%
57
 
4.6%
57
 
4.6%
55
 
4.4%
45
 
3.6%
44
 
3.6%
44
 
3.6%
44
 
3.6%
( 42
 
3.4%
42
 
3.4%
Other values (80) 566
45.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 749
60.5%
Space Separator 243
 
19.6%
Decimal Number 155
 
12.5%
Open Punctuation 42
 
3.4%
Close Punctuation 42
 
3.4%
Dash Punctuation 4
 
0.3%
Math Symbol 2
 
0.2%
Uppercase Letter 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
57
 
7.6%
57
 
7.6%
55
 
7.3%
45
 
6.0%
44
 
5.9%
44
 
5.9%
44
 
5.9%
42
 
5.6%
30
 
4.0%
29
 
3.9%
Other values (63) 302
40.3%
Decimal Number
ValueCountFrequency (%)
1 30
19.4%
5 23
14.8%
2 21
13.5%
8 16
10.3%
4 15
9.7%
3 15
9.7%
6 13
8.4%
0 10
 
6.5%
7 9
 
5.8%
9 3
 
1.9%
Uppercase Letter
ValueCountFrequency (%)
B 1
50.0%
F 1
50.0%
Space Separator
ValueCountFrequency (%)
243
100.0%
Open Punctuation
ValueCountFrequency (%)
( 42
100.0%
Close Punctuation
ValueCountFrequency (%)
) 42
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 749
60.5%
Common 488
39.4%
Latin 2
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
57
 
7.6%
57
 
7.6%
55
 
7.3%
45
 
6.0%
44
 
5.9%
44
 
5.9%
44
 
5.9%
42
 
5.6%
30
 
4.0%
29
 
3.9%
Other values (63) 302
40.3%
Common
ValueCountFrequency (%)
243
49.8%
( 42
 
8.6%
) 42
 
8.6%
1 30
 
6.1%
5 23
 
4.7%
2 21
 
4.3%
8 16
 
3.3%
4 15
 
3.1%
3 15
 
3.1%
6 13
 
2.7%
Other values (5) 28
 
5.7%
Latin
ValueCountFrequency (%)
B 1
50.0%
F 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 749
60.5%
ASCII 490
39.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
243
49.6%
( 42
 
8.6%
) 42
 
8.6%
1 30
 
6.1%
5 23
 
4.7%
2 21
 
4.3%
8 16
 
3.3%
4 15
 
3.1%
3 15
 
3.1%
6 13
 
2.7%
Other values (7) 30
 
6.1%
Hangul
ValueCountFrequency (%)
57
 
7.6%
57
 
7.6%
55
 
7.3%
45
 
6.0%
44
 
5.9%
44
 
5.9%
44
 
5.9%
42
 
5.6%
30
 
4.0%
29
 
3.9%
Other values (63) 302
40.3%

Interactions

2023-12-12T10:14:22.345212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T10:14:24.785663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호사업장명사업장업종종별주소
번호1.0000.9150.7040.2081.000
사업장명0.9151.0000.9960.0001.000
사업장업종0.7040.9961.0000.5951.000
종별0.2080.0000.5951.0001.000
주소1.0001.0001.0001.0001.000
2023-12-12T10:14:24.884345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
종별사업장업종
종별1.0000.425
사업장업종0.4251.000
2023-12-12T10:14:25.029304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호사업장업종종별
번호1.0000.3110.126
사업장업종0.3111.0000.425
종별0.1260.4251.000

Missing values

2023-12-12T10:14:22.462923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:14:22.565737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호사업장명사업장업종종별주소
01(주)하남자동차서비스자동차 종합 수리업4종경기도 하남시 하남대로 801 (신장동)
12(주)1급 현대신광서비스자동차 종합 수리업5종경기도 하남시 신장1로 5 (신장동)
23(주)대원산업플라스틱제품 제조업4종경기도 하남시 하남대로622번길 33 (천현동)
34한성주물주물제조4종경기도 하남시 검단남로26번길 33 (하산곡동)
45신장택시(유)택시 운송업5종경기도 하남시 대청로21번길 31 (신장동)
56동부자동차공업사자동차 전문 수리업5종경기도 하남시 하남대로787번길 6 (신장동)
67보람범퍼귀금속 장신구 및 관련제품 제조업4종경기도 하남시 하남대로232번길 31 (상산곡동)
78주심유황참숯가마도장 및 기타 피막처리업4종경기도 하남시 초광로 169 (초이동)
89하남종합서비스기아오토큐(주)숯가마5종경기도 하남시 하남대로802번길 5-6 (신장동)
910(주)을지전기자동차 수리업5종경기도 하남시 감일로15번길 54 (감일동)
번호사업장명사업장업종종별주소
3435주식회사 덴판도자동차 수리업5종경기도 하남시 초광산단동로6번길 8 디엠모터스 주식회사 2층 (광암동)
3536(주)이마트 하남점자동차 수리업4종경기도 하남시 덕풍서로 70 하남풍산지구 이마트 (덕풍동)
3637(주)라인디자인모형대형 종합 소매업5종경기도 하남시 미사강변서로 25 미사 테스타타워 지식산업센터 2층 224호 (풍산동)
3738더클래스효성 주식회사 하남지점자동차 종합 수리업5종경기도 하남시 감초로 188 5층 (초이동)
3839서하남서비스 기아오토큐자동차 종합 수리업4종경기도 하남시 광암동 401-1
3940(주)테브코리아자동차 종합 수리업5종경기도 하남시 초광산단동로 5 지하1층 (광암동)
4041에이투지자동차 종합 수리업5종경기도 하남시 초광산단동로 5 5층 (광암동)
4142초이 현대 모터스자동차 종합 수리업5종경기도 하남시 초광산단동로 5 4층 (광암동)
4243럭키 카독크자동차 종합 수리업5종경기도 하남시 초광산단동로 5 2층 (광암동)
4344리드라인 모터스자동차 종합 수리업5종경기도 하남시 초광산단동로 5 3층 (광암동)