Overview

Dataset statistics

Number of variables6
Number of observations49
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.6 KiB
Average record size in memory54.7 B

Variable types

Numeric3
Text2
Categorical1

Dataset

Description샘플 데이터
Author지디에스컨설팅그룹
URLhttps://www.bigdata-environment.kr/user/data_market/detail.do?id=2a2d9710-2e00-11ea-9713-eb3e5186fb38

Alerts

전국 미세먼지 수치 has constant value ""Constant
화력발전소 미세먼지 수치 is highly overall correlated with 화력발전소 미세먼지 비율High correlation
화력발전소 미세먼지 비율 is highly overall correlated with 화력발전소 미세먼지 수치High correlation
화력발전소 고유번호 has unique valuesUnique
화력발전소 명 has unique valuesUnique

Reproduction

Analysis started2023-12-10 12:33:02.640157
Analysis finished2023-12-10 12:33:05.292021
Duration2.65 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

화력발전소 고유번호
Real number (ℝ)

UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25
Minimum1
Maximum49
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2023-12-10T21:33:05.521482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.4
Q113
median25
Q337
95-th percentile46.6
Maximum49
Range48
Interquartile range (IQR)24

Descriptive statistics

Standard deviation14.28869
Coefficient of variation (CV)0.57154761
Kurtosis-1.2
Mean25
Median Absolute Deviation (MAD)12
Skewness0
Sum1225
Variance204.16667
MonotonicityStrictly increasing
2023-12-10T21:33:05.800949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%)
1 1
 
2.0%
38 1
 
2.0%
28 1
 
2.0%
29 1
 
2.0%
30 1
 
2.0%
31 1
 
2.0%
32 1
 
2.0%
33 1
 
2.0%
34 1
 
2.0%
35 1
 
2.0%
Other values (39) 39
79.6%
ValueCountFrequency (%)
1 1
2.0%
2 1
2.0%
3 1
2.0%
4 1
2.0%
5 1
2.0%
6 1
2.0%
7 1
2.0%
8 1
2.0%
9 1
2.0%
10 1
2.0%
ValueCountFrequency (%)
49 1
2.0%
48 1
2.0%
47 1
2.0%
46 1
2.0%
45 1
2.0%
44 1
2.0%
43 1
2.0%
42 1
2.0%
41 1
2.0%
40 1
2.0%

화력발전소 명
Text

UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size524.0 B
2023-12-10T21:33:06.174245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length8.0204082
Min length4

Characters and Unicode

Total characters393
Distinct characters66
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)100.0%

Sample

1st row광양복합화력발전소
2nd row나주열병합발전소
3rd row당진복합화력발전소
4th row북평화력발전소
5th row대산복합화력발전소
ValueCountFrequency (%)
광양복합화력발전소 1
 
2.0%
신인천복합화력발전소 1
 
2.0%
남제주화력발전소 1
 
2.0%
영월복합화력발전소 1
 
2.0%
삼척그린파워발전소 1
 
2.0%
안동복합화력발전소 1
 
2.0%
삼천포발전본부 1
 
2.0%
영흥발전본부 1
 
2.0%
분당발전본부 1
 
2.0%
영동발전본부 1
 
2.0%
Other values (39) 39
79.6%
2023-12-10T21:33:06.823614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
48
 
12.2%
48
 
12.2%
43
 
10.9%
31
 
7.9%
31
 
7.9%
24
 
6.1%
17
 
4.3%
13
 
3.3%
7
 
1.8%
7
 
1.8%
Other values (56) 124
31.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 390
99.2%
Uppercase Letter 3
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
48
 
12.3%
48
 
12.3%
43
 
11.0%
31
 
7.9%
31
 
7.9%
24
 
6.2%
17
 
4.4%
13
 
3.3%
7
 
1.8%
7
 
1.8%
Other values (53) 121
31.0%
Uppercase Letter
ValueCountFrequency (%)
G 1
33.3%
N 1
33.3%
L 1
33.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 390
99.2%
Latin 3
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
48
 
12.3%
48
 
12.3%
43
 
11.0%
31
 
7.9%
31
 
7.9%
24
 
6.2%
17
 
4.4%
13
 
3.3%
7
 
1.8%
7
 
1.8%
Other values (53) 121
31.0%
Latin
ValueCountFrequency (%)
G 1
33.3%
N 1
33.3%
L 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 390
99.2%
ASCII 3
 
0.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
48
 
12.3%
48
 
12.3%
43
 
11.0%
31
 
7.9%
31
 
7.9%
24
 
6.2%
17
 
4.4%
13
 
3.3%
7
 
1.8%
7
 
1.8%
Other values (53) 121
31.0%
ASCII
ValueCountFrequency (%)
G 1
33.3%
N 1
33.3%
L 1
33.3%
Distinct48
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size524.0 B
2023-12-10T21:33:07.458803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length24
Mean length20.183673
Min length15

Characters and Unicode

Total characters989
Distinct characters130
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique47 ?
Unique (%)95.9%

Sample

1st row전라남도 광양시 제철로 2148-567
2nd row전라남도 나주시 산포면 신도산단길 65 (신도리 1304)
3rd row충남 당진시 송악읍 부곡공단로 241
4th row강원도 동해시 공단 2로 15-5(구호동)
5th row충남 서산시 대산읍 독곶1로 82
ValueCountFrequency (%)
경기도 13
 
5.6%
강원도 6
 
2.6%
전라남도 5
 
2.1%
충청남도 5
 
2.1%
인천광역시 5
 
2.1%
서구 4
 
1.7%
남구 3
 
1.3%
분당로 2
 
0.9%
201 2
 
0.9%
분당구 2
 
0.9%
Other values (166) 186
79.8%
2023-12-10T21:33:08.526420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
184
 
18.6%
45
 
4.6%
40
 
4.0%
36
 
3.6%
1 28
 
2.8%
5 25
 
2.5%
3 23
 
2.3%
2 21
 
2.1%
21
 
2.1%
19
 
1.9%
Other values (120) 547
55.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 611
61.8%
Space Separator 184
 
18.6%
Decimal Number 176
 
17.8%
Dash Punctuation 11
 
1.1%
Open Punctuation 3
 
0.3%
Close Punctuation 3
 
0.3%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
45
 
7.4%
40
 
6.5%
36
 
5.9%
21
 
3.4%
19
 
3.1%
19
 
3.1%
17
 
2.8%
17
 
2.8%
15
 
2.5%
14
 
2.3%
Other values (105) 368
60.2%
Decimal Number
ValueCountFrequency (%)
1 28
15.9%
5 25
14.2%
3 23
13.1%
2 21
11.9%
7 18
10.2%
0 16
9.1%
4 14
8.0%
6 12
6.8%
9 12
6.8%
8 7
 
4.0%
Space Separator
ValueCountFrequency (%)
184
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 611
61.8%
Common 378
38.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
45
 
7.4%
40
 
6.5%
36
 
5.9%
21
 
3.4%
19
 
3.1%
19
 
3.1%
17
 
2.8%
17
 
2.8%
15
 
2.5%
14
 
2.3%
Other values (105) 368
60.2%
Common
ValueCountFrequency (%)
184
48.7%
1 28
 
7.4%
5 25
 
6.6%
3 23
 
6.1%
2 21
 
5.6%
7 18
 
4.8%
0 16
 
4.2%
4 14
 
3.7%
6 12
 
3.2%
9 12
 
3.2%
Other values (5) 25
 
6.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 611
61.8%
ASCII 378
38.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
184
48.7%
1 28
 
7.4%
5 25
 
6.6%
3 23
 
6.1%
2 21
 
5.6%
7 18
 
4.8%
0 16
 
4.2%
4 14
 
3.7%
6 12
 
3.2%
9 12
 
3.2%
Other values (5) 25
 
6.6%
Hangul
ValueCountFrequency (%)
45
 
7.4%
40
 
6.5%
36
 
5.9%
21
 
3.4%
19
 
3.1%
19
 
3.1%
17
 
2.8%
17
 
2.8%
15
 
2.5%
14
 
2.3%
Other values (105) 368
60.2%

화력발전소 미세먼지 수치
Real number (ℝ)

HIGH CORRELATION 

Distinct37
Distinct (%)75.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean33.022429
Minimum27.337
Maximum43.974
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2023-12-10T21:33:08.769662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum27.337
5-th percentile29.1454
Q131.022
median32.652
Q333.863
95-th percentile38.0882
Maximum43.974
Range16.637
Interquartile range (IQR)2.841

Descriptive statistics

Standard deviation3.0632701
Coefficient of variation (CV)0.09276332
Kurtosis2.8909105
Mean33.022429
Median Absolute Deviation (MAD)1.553
Skewness1.3283068
Sum1618.099
Variance9.3836237
MonotonicityNot monotonic
2023-12-10T21:33:08.993187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=37)
ValueCountFrequency (%)
32.652 7
 
14.3%
35.825 2
 
4.1%
33.172 2
 
4.1%
30.216 2
 
4.1%
32.667 2
 
4.1%
30.353 2
 
4.1%
30.232 2
 
4.1%
31.099 1
 
2.0%
32.842 1
 
2.0%
36.217 1
 
2.0%
Other values (27) 27
55.1%
ValueCountFrequency (%)
27.337 1
2.0%
28.93 1
2.0%
28.977 1
2.0%
29.398 1
2.0%
30.216 2
4.1%
30.232 2
4.1%
30.353 2
4.1%
30.77 1
2.0%
30.791 1
2.0%
31.022 1
2.0%
ValueCountFrequency (%)
43.974 1
2.0%
40.998 1
2.0%
38.701 1
2.0%
37.169 1
2.0%
36.914 1
2.0%
36.85 1
2.0%
36.217 1
2.0%
35.825 2
4.1%
35.729 1
2.0%
34.899 1
2.0%

전국 미세먼지 수치
Categorical

CONSTANT 

Distinct1
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size524.0 B
33.022
49 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row33.022
2nd row33.022
3rd row33.022
4th row33.022
5th row33.022

Common Values

ValueCountFrequency (%)
33.022 49
100.0%

Length

2023-12-10T21:33:09.225383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:33:09.372403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
33.022 49
100.0%

화력발전소 미세먼지 비율
Real number (ℝ)

HIGH CORRELATION 

Distinct34
Distinct (%)69.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.0001224
Minimum0.828
Maximum1.332
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2023-12-10T21:33:09.525996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.828
5-th percentile0.8828
Q10.939
median0.989
Q31.025
95-th percentile1.1536
Maximum1.332
Range0.504
Interquartile range (IQR)0.086

Descriptive statistics

Standard deviation0.092818513
Coefficient of variation (CV)0.092807149
Kurtosis2.894443
Mean1.0001224
Median Absolute Deviation (MAD)0.047
Skewness1.3297013
Sum49.006
Variance0.0086152764
MonotonicityNot monotonic
2023-12-10T21:33:09.733052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
0.989 9
 
18.4%
1.005 2
 
4.1%
0.995 2
 
4.1%
0.915 2
 
4.1%
0.932 2
 
4.1%
1.085 2
 
4.1%
0.919 2
 
4.1%
0.916 2
 
4.1%
1.097 1
 
2.0%
1.332 1
 
2.0%
Other values (24) 24
49.0%
ValueCountFrequency (%)
0.828 1
2.0%
0.876 1
2.0%
0.878 1
2.0%
0.89 1
2.0%
0.915 2
4.1%
0.916 2
4.1%
0.919 2
4.1%
0.932 2
4.1%
0.939 1
2.0%
0.942 1
2.0%
ValueCountFrequency (%)
1.332 1
2.0%
1.242 1
2.0%
1.172 1
2.0%
1.126 1
2.0%
1.118 1
2.0%
1.116 1
2.0%
1.097 1
2.0%
1.085 2
4.1%
1.082 1
2.0%
1.057 1
2.0%

Interactions

2023-12-10T21:33:04.167255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:33:03.119824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:33:03.600490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:33:04.332383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:33:03.308365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:33:03.844105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:33:04.503644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:33:03.437166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:33:04.005410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T21:33:09.880457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
화력발전소 고유번호화력발전소 명화력발전소 주소화력발전소 미세먼지 수치화력발전소 미세먼지 비율
화력발전소 고유번호1.0001.0000.9170.1920.192
화력발전소 명1.0001.0001.0001.0001.000
화력발전소 주소0.9171.0001.0001.0001.000
화력발전소 미세먼지 수치0.1921.0001.0001.0001.000
화력발전소 미세먼지 비율0.1921.0001.0001.0001.000
2023-12-10T21:33:10.042132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
화력발전소 고유번호화력발전소 미세먼지 수치화력발전소 미세먼지 비율
화력발전소 고유번호1.000-0.185-0.198
화력발전소 미세먼지 수치-0.1851.0000.998
화력발전소 미세먼지 비율-0.1980.9981.000

Missing values

2023-12-10T21:33:04.763479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T21:33:05.128945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

화력발전소 고유번호화력발전소 명화력발전소 주소화력발전소 미세먼지 수치전국 미세먼지 수치화력발전소 미세먼지 비율
01광양복합화력발전소전라남도 광양시 제철로 2148-56731.75933.0220.962
12나주열병합발전소전라남도 나주시 산포면 신도산단길 65 (신도리 1304)28.97733.0220.878
23당진복합화력발전소충남 당진시 송악읍 부곡공단로 24136.8533.0221.116
34북평화력발전소강원도 동해시 공단 2로 15-5(구호동)30.23233.0220.916
45대산복합화력발전소충남 서산시 대산읍 독곶1로 8231.98933.0220.969
56동두천복합화력발전소경기도 동두천시 광암동 25633.91633.0221.027
67부천열병합발전소경기도 부천시 오정구 삼정동 363-337.16933.0221.126
78분당복합화력발전소경기도 성남시 분당구 분당동 분당로 33633.17233.0221.005
89안산복합화력발전소경기도 안산시 단원구 원시동 83938.70133.0221.172
910안양열병합발전소경기도 안양시 동안구 평안동 부림로 10036.91433.0221.118
화력발전소 고유번호화력발전소 명화력발전소 주소화력발전소 미세먼지 수치전국 미세먼지 수치화력발전소 미세먼지 비율
3940동해화력발전소강원도 동해시 공단9로 14530.23233.0220.916
4041일산열병합발전소경기도 고양시 일산동구 경의로 20143.97433.0221.332
4142보령화력발전소충청남도 보령시 오천면 오천해안로 89-3732.66733.0220.989
4243인천복합화력발전소인천광역시 서구 중봉대로405번길 41130.21633.0220.915
4344서울화력발전소서울특별시 마포구 토정로 5636.21733.0221.097
4445서천화력발전소충청남도 서천시 서면 서인로235번길 8531.09933.0220.942
4546제주화력발전소제주특별자치도 제주시 원당로 13332.84233.0220.995
4647원주그린열병합발전소강원도 원주시 지정면 신평로 231.69233.0220.96
4748세종천연가스발전소세종특별자치시 금송로 62529.39833.0220.89
4849신보령화력발전소충청남도 보령시 주교면 송도길 20132.66733.0220.989