Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory722.7 KiB
Average record size in memory74.0 B

Variable types

Numeric2
Categorical4
Text1
DateTime1

Dataset

Description소음유형 중 환경소음자동측정망을 통해 전송한 데이터를 지역, 법적구분, 용도구분 등의 형태로 제공하는 일평균 데이터
Author한국환경공단
URLhttps://www.data.go.kr/data/3042013/fileData.do

Alerts

지역 has constant value ""Constant
용도구분 is highly overall correlated with 법적High correlation
법적 is highly overall correlated with 용도구분High correlation
번호 is highly overall correlated with 도시High correlation
도시 is highly overall correlated with 번호High correlation
번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 08:17:32.232314
Analysis finished2023-12-12 08:17:33.583492
Duration1.35 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10054.468
Minimum1
Maximum19802
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T17:17:33.683450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile915.9
Q14429.5
median11072.5
Q315456.25
95-th percentile18943.05
Maximum19802
Range19801
Interquartile range (IQR)11026.75

Descriptive statistics

Standard deviation6059.8986
Coefficient of variation (CV)0.60270701
Kurtosis-1.3867873
Mean10054.468
Median Absolute Deviation (MAD)5560
Skewness-0.059392188
Sum1.0054468 × 108
Variance36722371
MonotonicityNot monotonic
2023-12-12T17:17:33.871861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
19112 1
 
< 0.1%
10281 1
 
< 0.1%
12885 1
 
< 0.1%
8315 1
 
< 0.1%
13958 1
 
< 0.1%
3208 1
 
< 0.1%
5294 1
 
< 0.1%
5333 1
 
< 0.1%
9386 1
 
< 0.1%
10335 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
13 1
< 0.1%
14 1
< 0.1%
21 1
< 0.1%
ValueCountFrequency (%)
19802 1
< 0.1%
19800 1
< 0.1%
19798 1
< 0.1%
19795 1
< 0.1%
19794 1
< 0.1%
19793 1
< 0.1%
19792 1
< 0.1%
19790 1
< 0.1%
19789 1
< 0.1%
19788 1
< 0.1%

도시
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
서울특별시
1363 
대전광역시
855 
울산광역시
828 
세종특별자치시
822 
광주광역시
808 
Other values (11)
5324 

Length

Max length7
Median length5
Mean length5.1352
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경남창원시
2nd row울산광역시
3rd row경기수원시
4th row경남창원시
5th row강원원주시

Common Values

ValueCountFrequency (%)
서울특별시 1363
13.6%
대전광역시 855
 
8.6%
울산광역시 828
 
8.3%
세종특별자치시 822
 
8.2%
광주광역시 808
 
8.1%
대구광역시 732
 
7.3%
인천광역시 627
 
6.3%
경기수원시 611
 
6.1%
충북청주시 554
 
5.5%
경남창원시 510
 
5.1%
Other values (6) 2290
22.9%

Length

2023-12-12T17:17:34.063506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울특별시 1363
13.6%
대전광역시 855
 
8.6%
울산광역시 828
 
8.3%
세종특별자치시 822
 
8.2%
광주광역시 808
 
8.1%
대구광역시 732
 
7.3%
인천광역시 627
 
6.3%
경기수원시 611
 
6.1%
충북청주시 554
 
5.5%
경남창원시 510
 
5.1%
Other values (6) 2290
22.9%

법적
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
5142 
4198 
600 
 
60

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
5142
51.4%
4198
42.0%
600
 
6.0%
60
 
0.6%

Length

2023-12-12T17:17:34.231818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:17:34.363105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5142
51.4%
4198
42.0%
600
 
6.0%
60
 
0.6%

용도구분
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
일반주거지역
3433 
종합병원지역
1990 
학교지역
1886 
준주거지역
1709 
상업지역
600 
Other values (2)
382 

Length

Max length6
Median length6
Mean length5.3319
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전용주거지역
2nd row학교지역
3rd row상업지역
4th row학교지역
5th row종합병원지역

Common Values

ValueCountFrequency (%)
일반주거지역 3433
34.3%
종합병원지역 1990
19.9%
학교지역 1886
18.9%
준주거지역 1709
17.1%
상업지역 600
 
6.0%
전용주거지역 322
 
3.2%
일반공업지역 60
 
0.6%

Length

2023-12-12T17:17:34.501207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:17:34.665232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반주거지역 3433
34.3%
종합병원지역 1990
19.9%
학교지역 1886
18.9%
준주거지역 1709
17.1%
상업지역 600
 
6.0%
전용주거지역 322
 
3.2%
일반공업지역 60
 
0.6%
Distinct73
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T17:17:34.953576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length10
Mean length6.5574
Min length3

Characters and Unicode

Total characters65574
Distinct characters207
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row삼화페인트
2nd row무룡초교 뒤
3rd row호텔엠베서더
4th row반송중학교
5th row대륙마트
ValueCountFrequency (%)
1625
 
12.0%
정문 552
 
4.1%
건너편 270
 
2.0%
논곡초등학교앞 233
 
1.7%
일곡중학교 211
 
1.6%
동대전산정현교회 207
 
1.5%
estn당구클럽 201
 
1.5%
계룡아파트 198
 
1.5%
새뜸중학교 196
 
1.4%
열매마을7단지 195
 
1.4%
Other values (76) 9643
71.3%
2023-12-12T17:17:35.424561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3531
 
5.4%
2666
 
4.1%
2022
 
3.1%
1734
 
2.6%
1717
 
2.6%
1103
 
1.7%
1081
 
1.6%
1033
 
1.6%
961
 
1.5%
942
 
1.4%
Other values (197) 48784
74.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 58506
89.2%
Space Separator 3531
 
5.4%
Uppercase Letter 1380
 
2.1%
Decimal Number 1332
 
2.0%
Open Punctuation 334
 
0.5%
Close Punctuation 334
 
0.5%
Other Punctuation 157
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2666
 
4.6%
2022
 
3.5%
1734
 
3.0%
1717
 
2.9%
1103
 
1.9%
1081
 
1.8%
1033
 
1.8%
961
 
1.6%
942
 
1.6%
936
 
1.6%
Other values (180) 44311
75.7%
Decimal Number
ValueCountFrequency (%)
1 348
26.1%
7 331
24.8%
5 168
12.6%
6 148
11.1%
3 148
11.1%
0 136
 
10.2%
2 53
 
4.0%
Uppercase Letter
ValueCountFrequency (%)
G 288
20.9%
L 288
20.9%
T 201
14.6%
N 201
14.6%
S 201
14.6%
E 201
14.6%
Space Separator
ValueCountFrequency (%)
3531
100.0%
Open Punctuation
ValueCountFrequency (%)
( 334
100.0%
Close Punctuation
ValueCountFrequency (%)
) 334
100.0%
Other Punctuation
ValueCountFrequency (%)
· 157
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 58506
89.2%
Common 5688
 
8.7%
Latin 1380
 
2.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2666
 
4.6%
2022
 
3.5%
1734
 
3.0%
1717
 
2.9%
1103
 
1.9%
1081
 
1.8%
1033
 
1.8%
961
 
1.6%
942
 
1.6%
936
 
1.6%
Other values (180) 44311
75.7%
Common
ValueCountFrequency (%)
3531
62.1%
1 348
 
6.1%
( 334
 
5.9%
) 334
 
5.9%
7 331
 
5.8%
5 168
 
3.0%
· 157
 
2.8%
6 148
 
2.6%
3 148
 
2.6%
0 136
 
2.4%
Latin
ValueCountFrequency (%)
G 288
20.9%
L 288
20.9%
T 201
14.6%
N 201
14.6%
S 201
14.6%
E 201
14.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 58506
89.2%
ASCII 6911
 
10.5%
None 157
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3531
51.1%
1 348
 
5.0%
( 334
 
4.8%
) 334
 
4.8%
7 331
 
4.8%
G 288
 
4.2%
L 288
 
4.2%
T 201
 
2.9%
N 201
 
2.9%
S 201
 
2.9%
Other values (6) 854
 
12.4%
Hangul
ValueCountFrequency (%)
2666
 
4.6%
2022
 
3.5%
1734
 
3.0%
1717
 
2.9%
1103
 
1.9%
1081
 
1.8%
1033
 
1.8%
961
 
1.6%
942
 
1.6%
936
 
1.6%
Other values (180) 44311
75.7%
None
ValueCountFrequency (%)
· 157
100.0%

지역
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
도로
10000 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row도로
2nd row도로
3rd row도로
4th row도로
5th row도로

Common Values

ValueCountFrequency (%)
도로 10000
100.0%

Length

2023-12-12T17:17:35.586445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:17:35.726887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
도로 10000
100.0%
Distinct365
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2022-01-01 00:00:00
Maximum2022-12-31 00:00:00
2023-12-12T17:17:35.849659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:17:36.012594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

소음도
Real number (ℝ)

Distinct1824
Distinct (%)18.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean67.113625
Minimum55.7
Maximum81.9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T17:17:36.185276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum55.7
5-th percentile59.6895
Q163.65
median67.77
Q370.64
95-th percentile73.17
Maximum81.9
Range26.2
Interquartile range (IQR)6.99

Descriptive statistics

Standard deviation4.334157
Coefficient of variation (CV)0.064579391
Kurtosis-0.7551761
Mean67.113625
Median Absolute Deviation (MAD)3.44
Skewness-0.18043396
Sum671136.25
Variance18.784917
MonotonicityNot monotonic
2023-12-12T17:17:36.365993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
67.95 20
 
0.2%
68.73 19
 
0.2%
67.93 18
 
0.2%
68.53 18
 
0.2%
68.06 18
 
0.2%
68.33 17
 
0.2%
68.26 17
 
0.2%
71.56 17
 
0.2%
68.21 17
 
0.2%
67.91 17
 
0.2%
Other values (1814) 9822
98.2%
ValueCountFrequency (%)
55.7 1
< 0.1%
56.16 1
< 0.1%
56.24 1
< 0.1%
56.39 1
< 0.1%
56.46 1
< 0.1%
56.5 1
< 0.1%
56.51 1
< 0.1%
56.54 1
< 0.1%
56.55 1
< 0.1%
56.61 1
< 0.1%
ValueCountFrequency (%)
81.9 1
< 0.1%
79.13 1
< 0.1%
78.57 1
< 0.1%
77.85 1
< 0.1%
77.69 1
< 0.1%
77.66 1
< 0.1%
77.64 1
< 0.1%
77.59 1
< 0.1%
77.56 1
< 0.1%
77.46 1
< 0.1%

Interactions

2023-12-12T17:17:33.063440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:17:32.790516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:17:33.196121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:17:32.908022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:17:36.480510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호도시법적용도구분측정지점소음도
번호1.0000.9770.4610.5360.9990.696
도시0.9771.0000.7240.6981.0000.681
법적0.4610.7241.0001.0001.0000.511
용도구분0.5360.6981.0001.0001.0000.575
측정지점0.9991.0001.0001.0001.0000.924
소음도0.6960.6810.5110.5750.9241.000
2023-12-12T17:17:36.600971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도시용도구분법적
도시1.0000.4150.450
용도구분0.4151.0001.000
법적0.4501.0001.000
2023-12-12T17:17:37.044179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호소음도도시법적용도구분
번호1.000-0.1000.8420.2930.307
소음도-0.1001.0000.3300.3310.337
도시0.8420.3301.0000.4500.415
법적0.2930.3310.4501.0001.000
용도구분0.3070.3370.4151.0001.000

Missing values

2023-12-12T17:17:33.342323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:17:33.517908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호도시법적용도구분측정지점지역측정일소음도
1674819112경남창원시전용주거지역삼화페인트도로2022-05-2862.57
868011044울산광역시학교지역무룡초교 뒤도로2022-12-1973.78
1238514749경기수원시상업지역호텔엠베서더도로2022-12-1370.35
1637018734경남창원시학교지역반송중학교도로2022-02-0864.2
1330815672강원원주시종합병원지역대륙마트도로2022-05-2070.81
1024112605세종특별자치시학교지역새뜸중학교도로2022-05-2559.49
47524753인천광역시준주거지역서해상가도로2022-12-1171.31
835836서울특별시준주거지역기업은행도로2022-02-0771.66
1424716611충북청주시종합병원지역청주 흰돌감리교회 앞도로2022-08-2769.08
927811642울산광역시상업지역우체국도로2022-08-1967.54
번호도시법적용도구분측정지점지역측정일소음도
986312227울산광역시일반주거지역패밀리아파트도로2022-12-0560.68
1350615870강원원주시일반주거지역제일주유소도로2022-03-0670.93
33483349대구광역시종합병원지역새동산약국앞도로2022-05-1271.71
12181219서울특별시학교지역불광동도로2022-05-2865.16
79269516대전광역시일반주거지역열매마을7단지 정문도로2022-08-2364.17
22082209서울특별시학교지역숙명여고 앞도로2022-02-2672.54
1581018174전북전주시일반주거지역전주요단교회 앞도로2022-01-2567.48
1491617280충북청주시준주거지역운천·신봉동사무소 앞도로2022-08-0570.62
1722319587경남창원시종합병원지역창원병원 정문도로2022-12-2871.66
1137013734경기수원시상업지역화서동도로2022-03-1171.97