Overview

Dataset statistics

Number of variables7
Number of observations458
Missing cells36
Missing cells (%)1.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory25.6 KiB
Average record size in memory57.3 B

Variable types

Numeric1
Categorical3
Text3

Dataset

Description남양주시 소음 진동 배출업소 현황에 대한 데이터로 업소명, 주소, 업종(제조, 위생용기, 세탁업 등), 등급(우수관리) 등의 항목을 제공합니다.
Author경기도 남양주시
URLhttps://www.data.go.kr/data/3044899/fileData.do

Alerts

관할기관 has constant value ""Constant
데이터기준일 has constant value ""Constant
연번 is highly overall correlated with 관리등급High correlation
관리등급 is highly overall correlated with 연번High correlation
업종 has 36 (7.9%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:56:52.251721
Analysis finished2023-12-12 12:56:53.082975
Duration0.83 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct458
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean229.5
Minimum1
Maximum458
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.2 KiB
2023-12-12T21:56:53.184256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile23.85
Q1115.25
median229.5
Q3343.75
95-th percentile435.15
Maximum458
Range457
Interquartile range (IQR)228.5

Descriptive statistics

Standard deviation132.35747
Coefficient of variation (CV)0.576721
Kurtosis-1.2
Mean229.5
Median Absolute Deviation (MAD)114.5
Skewness0
Sum105111
Variance17518.5
MonotonicityStrictly increasing
2023-12-12T21:56:53.368924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
316 1
 
0.2%
314 1
 
0.2%
313 1
 
0.2%
312 1
 
0.2%
311 1
 
0.2%
310 1
 
0.2%
309 1
 
0.2%
308 1
 
0.2%
307 1
 
0.2%
Other values (448) 448
97.8%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
458 1
0.2%
457 1
0.2%
456 1
0.2%
455 1
0.2%
454 1
0.2%
453 1
0.2%
452 1
0.2%
451 1
0.2%
450 1
0.2%
449 1
0.2%

관할기관
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
남양주시
458 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row남양주시
2nd row남양주시
3rd row남양주시
4th row남양주시
5th row남양주시

Common Values

ValueCountFrequency (%)
남양주시 458
100.0%

Length

2023-12-12T21:56:53.520454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:56:53.624833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
남양주시 458
100.0%
Distinct457
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
2023-12-12T21:56:53.898799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length17
Mean length5.419214
Min length2

Characters and Unicode

Total characters2482
Distinct characters344
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique456 ?
Unique (%)99.6%

Sample

1st row㈜동림메라민
2nd row한미산업
3rd row태양석재(코리아산업)
4th row세현엔지니어링
5th row삼창금속공업사
ValueCountFrequency (%)
주식회사 14
 
2.9%
㈜한일종합주방 2
 
0.4%
남양주지점 2
 
0.4%
농업회사법인 2
 
0.4%
제2공장 2
 
0.4%
신영포엠 1
 
0.2%
㈜청암 1
 
0.2%
청원유리 1
 
0.2%
대흥도장 1
 
0.2%
동광제재 1
 
0.2%
Other values (459) 459
94.4%
2023-12-12T21:56:54.410868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
152
 
6.1%
90
 
3.6%
80
 
3.2%
69
 
2.8%
62
 
2.5%
47
 
1.9%
47
 
1.9%
39
 
1.6%
38
 
1.5%
36
 
1.5%
Other values (334) 1822
73.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2211
89.1%
Other Symbol 152
 
6.1%
Uppercase Letter 52
 
2.1%
Space Separator 28
 
1.1%
Open Punctuation 12
 
0.5%
Close Punctuation 12
 
0.5%
Decimal Number 10
 
0.4%
Other Punctuation 2
 
0.1%
Lowercase Letter 2
 
0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
90
 
4.1%
80
 
3.6%
69
 
3.1%
62
 
2.8%
47
 
2.1%
47
 
2.1%
39
 
1.8%
38
 
1.7%
36
 
1.6%
32
 
1.4%
Other values (301) 1671
75.6%
Uppercase Letter
ValueCountFrequency (%)
N 6
11.5%
S 5
 
9.6%
E 5
 
9.6%
C 4
 
7.7%
O 4
 
7.7%
A 4
 
7.7%
I 3
 
5.8%
P 3
 
5.8%
T 3
 
5.8%
V 2
 
3.8%
Other values (9) 13
25.0%
Decimal Number
ValueCountFrequency (%)
2 8
80.0%
1 1
 
10.0%
3 1
 
10.0%
Open Punctuation
ValueCountFrequency (%)
( 11
91.7%
[ 1
 
8.3%
Close Punctuation
ValueCountFrequency (%)
) 11
91.7%
] 1
 
8.3%
Other Punctuation
ValueCountFrequency (%)
. 1
50.0%
& 1
50.0%
Lowercase Letter
ValueCountFrequency (%)
a 1
50.0%
b 1
50.0%
Other Symbol
ValueCountFrequency (%)
152
100.0%
Space Separator
ValueCountFrequency (%)
28
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2363
95.2%
Common 65
 
2.6%
Latin 54
 
2.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
152
 
6.4%
90
 
3.8%
80
 
3.4%
69
 
2.9%
62
 
2.6%
47
 
2.0%
47
 
2.0%
39
 
1.7%
38
 
1.6%
36
 
1.5%
Other values (302) 1703
72.1%
Latin
ValueCountFrequency (%)
N 6
 
11.1%
S 5
 
9.3%
E 5
 
9.3%
C 4
 
7.4%
O 4
 
7.4%
A 4
 
7.4%
I 3
 
5.6%
P 3
 
5.6%
T 3
 
5.6%
V 2
 
3.7%
Other values (11) 15
27.8%
Common
ValueCountFrequency (%)
28
43.1%
( 11
 
16.9%
) 11
 
16.9%
2 8
 
12.3%
1 1
 
1.5%
- 1
 
1.5%
] 1
 
1.5%
[ 1
 
1.5%
. 1
 
1.5%
3 1
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2211
89.1%
None 152
 
6.1%
ASCII 119
 
4.8%

Most frequent character per block

None
ValueCountFrequency (%)
152
100.0%
Hangul
ValueCountFrequency (%)
90
 
4.1%
80
 
3.6%
69
 
3.1%
62
 
2.8%
47
 
2.1%
47
 
2.1%
39
 
1.8%
38
 
1.7%
36
 
1.6%
32
 
1.4%
Other values (301) 1671
75.6%
ASCII
ValueCountFrequency (%)
28
23.5%
( 11
 
9.2%
) 11
 
9.2%
2 8
 
6.7%
N 6
 
5.0%
S 5
 
4.2%
E 5
 
4.2%
C 4
 
3.4%
O 4
 
3.4%
A 4
 
3.4%
Other values (22) 33
27.7%
Distinct443
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
2023-12-12T21:56:54.752823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length39
Mean length24.851528
Min length17

Characters and Unicode

Total characters11382
Distinct characters143
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique431 ?
Unique (%)94.1%

Sample

1st row경기도 남양주시 진접읍 부마로 234
2nd row경기도 남양주시 진접읍 부마로80번길 2-1
3rd row경기도 남양주시 진접읍 양진로926번길 46
4th row경기도 남양주시 진접읍 경복대로271번길 38
5th row경기도 남양주시 오남읍 양지로319번길 17-10
ValueCountFrequency (%)
경기도 458
19.7%
남양주시 458
19.7%
진접읍 154
 
6.6%
화도읍 115
 
4.9%
수동면 72
 
3.1%
오남읍 46
 
2.0%
진건읍 29
 
1.2%
소래비로 22
 
0.9%
비룡로 19
 
0.8%
와부읍 19
 
0.8%
Other values (551) 937
40.2%
2023-12-12T21:56:55.260180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1879
 
16.5%
574
 
5.0%
529
 
4.6%
506
 
4.4%
505
 
4.4%
472
 
4.1%
1 467
 
4.1%
465
 
4.1%
458
 
4.0%
438
 
3.8%
Other values (133) 5089
44.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6908
60.7%
Decimal Number 2256
 
19.8%
Space Separator 1879
 
16.5%
Dash Punctuation 253
 
2.2%
Other Punctuation 28
 
0.2%
Open Punctuation 25
 
0.2%
Close Punctuation 25
 
0.2%
Uppercase Letter 6
 
0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
574
 
8.3%
529
 
7.7%
506
 
7.3%
505
 
7.3%
472
 
6.8%
465
 
6.7%
458
 
6.6%
438
 
6.3%
364
 
5.3%
292
 
4.2%
Other values (115) 2305
33.4%
Decimal Number
ValueCountFrequency (%)
1 467
20.7%
2 329
14.6%
3 245
10.9%
4 230
10.2%
5 178
 
7.9%
7 172
 
7.6%
0 161
 
7.1%
6 161
 
7.1%
8 161
 
7.1%
9 152
 
6.7%
Uppercase Letter
ValueCountFrequency (%)
B 4
66.7%
F 2
33.3%
Space Separator
ValueCountFrequency (%)
1879
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 253
100.0%
Other Punctuation
ValueCountFrequency (%)
, 28
100.0%
Open Punctuation
ValueCountFrequency (%)
( 25
100.0%
Close Punctuation
ValueCountFrequency (%)
) 25
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6908
60.7%
Common 4468
39.3%
Latin 6
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
574
 
8.3%
529
 
7.7%
506
 
7.3%
505
 
7.3%
472
 
6.8%
465
 
6.7%
458
 
6.6%
438
 
6.3%
364
 
5.3%
292
 
4.2%
Other values (115) 2305
33.4%
Common
ValueCountFrequency (%)
1879
42.1%
1 467
 
10.5%
2 329
 
7.4%
- 253
 
5.7%
3 245
 
5.5%
4 230
 
5.1%
5 178
 
4.0%
7 172
 
3.8%
0 161
 
3.6%
6 161
 
3.6%
Other values (6) 393
 
8.8%
Latin
ValueCountFrequency (%)
B 4
66.7%
F 2
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6908
60.7%
ASCII 4474
39.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1879
42.0%
1 467
 
10.4%
2 329
 
7.4%
- 253
 
5.7%
3 245
 
5.5%
4 230
 
5.1%
5 178
 
4.0%
7 172
 
3.8%
0 161
 
3.6%
6 161
 
3.6%
Other values (8) 399
 
8.9%
Hangul
ValueCountFrequency (%)
574
 
8.3%
529
 
7.7%
506
 
7.3%
505
 
7.3%
472
 
6.8%
465
 
6.7%
458
 
6.6%
438
 
6.3%
364
 
5.3%
292
 
4.2%
Other values (115) 2305
33.4%

업종
Text

MISSING 

Distinct267
Distinct (%)63.3%
Missing36
Missing (%)7.9%
Memory size3.7 KiB
2023-12-12T21:56:55.563711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length24
Mean length10.348341
Min length2

Characters and Unicode

Total characters4367
Distinct characters221
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique223 ?
Unique (%)52.8%

Sample

1st row표면가공목재및특수제재목제조업
2nd row기타건축용플라스틱조립제품제조
3rd row석재성형가공품제조업
4th row화물자동차및기타특수자동차제조업
5th row알루미늄금속주물주조업
ValueCountFrequency (%)
79
 
10.4%
제조업 53
 
7.0%
기타목재가구제조업 32
 
4.2%
기타 25
 
3.3%
도장 25
 
3.3%
기타피막처리업 20
 
2.6%
일반철물제조업 20
 
2.6%
금속가구제조업 17
 
2.2%
도장및기타피막처리업 11
 
1.5%
도금 8
 
1.1%
Other values (328) 466
61.6%
2023-12-12T21:56:55.948019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
427
 
9.8%
374
 
8.6%
337
 
7.7%
334
 
7.6%
193
 
4.4%
143
 
3.3%
141
 
3.2%
127
 
2.9%
123
 
2.8%
111
 
2.5%
Other values (211) 2057
47.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3982
91.2%
Space Separator 334
 
7.6%
Other Punctuation 23
 
0.5%
Decimal Number 9
 
0.2%
Open Punctuation 8
 
0.2%
Close Punctuation 8
 
0.2%
Lowercase Letter 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
427
 
10.7%
374
 
9.4%
337
 
8.5%
193
 
4.8%
143
 
3.6%
141
 
3.5%
127
 
3.2%
123
 
3.1%
111
 
2.8%
110
 
2.8%
Other values (198) 1896
47.6%
Decimal Number
ValueCountFrequency (%)
9 3
33.3%
2 2
22.2%
8 1
 
11.1%
4 1
 
11.1%
1 1
 
11.1%
7 1
 
11.1%
Lowercase Letter
ValueCountFrequency (%)
c 1
33.3%
v 1
33.3%
p 1
33.3%
Space Separator
ValueCountFrequency (%)
334
100.0%
Other Punctuation
ValueCountFrequency (%)
, 23
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3982
91.2%
Common 382
 
8.7%
Latin 3
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
427
 
10.7%
374
 
9.4%
337
 
8.5%
193
 
4.8%
143
 
3.6%
141
 
3.5%
127
 
3.2%
123
 
3.1%
111
 
2.8%
110
 
2.8%
Other values (198) 1896
47.6%
Common
ValueCountFrequency (%)
334
87.4%
, 23
 
6.0%
( 8
 
2.1%
) 8
 
2.1%
9 3
 
0.8%
2 2
 
0.5%
8 1
 
0.3%
4 1
 
0.3%
1 1
 
0.3%
7 1
 
0.3%
Latin
ValueCountFrequency (%)
c 1
33.3%
v 1
33.3%
p 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3982
91.2%
ASCII 385
 
8.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
427
 
10.7%
374
 
9.4%
337
 
8.5%
193
 
4.8%
143
 
3.6%
141
 
3.5%
127
 
3.2%
123
 
3.1%
111
 
2.8%
110
 
2.8%
Other values (198) 1896
47.6%
ASCII
ValueCountFrequency (%)
334
86.8%
, 23
 
6.0%
( 8
 
2.1%
) 8
 
2.1%
9 3
 
0.8%
2 2
 
0.5%
c 1
 
0.3%
v 1
 
0.3%
p 1
 
0.3%
8 1
 
0.3%
Other values (3) 3
 
0.8%

관리등급
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
우수
278 
일반
180 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반
2nd row일반
3rd row일반
4th row일반
5th row일반

Common Values

ValueCountFrequency (%)
우수 278
60.7%
일반 180
39.3%

Length

2023-12-12T21:56:56.107081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:56:56.220567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
우수 278
60.7%
일반 180
39.3%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
2023-01-01
458 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-01-01
2nd row2023-01-01
3rd row2023-01-01
4th row2023-01-01
5th row2023-01-01

Common Values

ValueCountFrequency (%)
2023-01-01 458
100.0%

Length

2023-12-12T21:56:56.338741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:56:56.437274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-01-01 458
100.0%

Interactions

2023-12-12T21:56:52.755821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:56:56.502472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번관리등급
연번1.0000.980
관리등급0.9801.000
2023-12-12T21:56:56.595372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번관리등급
연번1.0000.869
관리등급0.8691.000

Missing values

2023-12-12T21:56:52.887913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:56:53.028451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번관할기관사업장명소재지 주소업종관리등급데이터기준일
01남양주시㈜동림메라민경기도 남양주시 진접읍 부마로 234표면가공목재및특수제재목제조업일반2023-01-01
12남양주시한미산업경기도 남양주시 진접읍 부마로80번길 2-1기타건축용플라스틱조립제품제조일반2023-01-01
23남양주시태양석재(코리아산업)경기도 남양주시 진접읍 양진로926번길 46석재성형가공품제조업일반2023-01-01
34남양주시세현엔지니어링경기도 남양주시 진접읍 경복대로271번길 38화물자동차및기타특수자동차제조업일반2023-01-01
45남양주시삼창금속공업사경기도 남양주시 오남읍 양지로319번길 17-10알루미늄금속주물주조업일반2023-01-01
56남양주시㈜성원종합유리경기도 남양주시 퇴계원읍 퇴계원로25번길 14 (퇴계원리, 동국안전유리)판유가가공제품제조일반2023-01-01
67남양주시전일석건경기도 남양주시 진접읍 부마로106번길 81석재성형가공품제조일반2023-01-01
78남양주시중앙하이텍㈜경기도 남양주시 오남읍 진건오남로새능길 14기타 비철금속제련,정련 및 합금제조업일반2023-01-01
89남양주시현대조합기계경기도 남양주시 진접읍 진벌로199번길 21-2식품산업용 가열 및 냉각기제조업일반2023-01-01
910남양주시㈜캐스터경기도 남양주시 진접읍 경복대로 334-27일반철물제조업일반2023-01-01
연번관할기관사업장명소재지 주소업종관리등급데이터기준일
448449남양주시주식회사 미래에스티엘경기도 남양주시 화도읍 폭포로 487, 489금속가구제조업우수2023-01-01
449450남양주시정명제재소경기도 남양주시 화도읍 폭포로170목재가구제조업우수2023-01-01
450451남양주시블루텍정공㈜경기도 남양주시 화도읍 폭포로215특장차제조우수2023-01-01
451452남양주시원일퍼니처경기도 남양주시 화도읍 폭포로242번길 15목재가구제조우수2023-01-01
452453남양주시성삼기업경기도 남양주시 화도읍 폭포로242번안길 6기타 신발제조업우수2023-01-01
453454남양주시㈜두봄경기도 남양주시 화도읍 폭포로242번안길 60기타목재가구제조업우수2023-01-01
454455남양주시제일교구경기도 남양주시 화도읍 폭포로358번길 17-17금속가구제조업우수2023-01-01
455456남양주시ING금속경기도 남양주시 화도읍 폭포로358번안길 30-7기타목재가구제조업우수2023-01-01
456457남양주시효성산업경기도 남양주시 화도읍 폭포로515목재가공제조업우수2023-01-01
457458남양주시㈜재성목재경기도 남양주시 화도읍 폭포로85일반제재업우수2023-01-01