Overview

Dataset statistics

Number of variables6
Number of observations49
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.6 KiB
Average record size in memory54.7 B

Variable types

Numeric3
Text2
Categorical1

Dataset

Description샘플 데이터
Author지디에스컨설팅그룹
URLhttps://www.bigdata-environment.kr/user/data_market/detail.do?id=2a2d9710-2e00-11ea-9713-eb3e5186fb38

Alerts

전국미세먼지지수 has constant value ""Constant
화력발전소미세먼지지수 is highly overall correlated with 화력발전소미세먼지율High correlation
화력발전소미세먼지율 is highly overall correlated with 화력발전소미세먼지지수High correlation
화력발전소번호 has unique valuesUnique
화력발전소명 has unique valuesUnique

Reproduction

Analysis started2023-12-10 12:32:54.382241
Analysis finished2023-12-10 12:32:57.210867
Duration2.83 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

화력발전소번호
Real number (ℝ)

UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25
Minimum1
Maximum49
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2023-12-10T21:32:57.342010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.4
Q113
median25
Q337
95-th percentile46.6
Maximum49
Range48
Interquartile range (IQR)24

Descriptive statistics

Standard deviation14.28869
Coefficient of variation (CV)0.57154761
Kurtosis-1.2
Mean25
Median Absolute Deviation (MAD)12
Skewness0
Sum1225
Variance204.16667
MonotonicityStrictly increasing
2023-12-10T21:32:57.580451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%)
1 1
 
2.0%
38 1
 
2.0%
28 1
 
2.0%
29 1
 
2.0%
30 1
 
2.0%
31 1
 
2.0%
32 1
 
2.0%
33 1
 
2.0%
34 1
 
2.0%
35 1
 
2.0%
Other values (39) 39
79.6%
ValueCountFrequency (%)
1 1
2.0%
2 1
2.0%
3 1
2.0%
4 1
2.0%
5 1
2.0%
6 1
2.0%
7 1
2.0%
8 1
2.0%
9 1
2.0%
10 1
2.0%
ValueCountFrequency (%)
49 1
2.0%
48 1
2.0%
47 1
2.0%
46 1
2.0%
45 1
2.0%
44 1
2.0%
43 1
2.0%
42 1
2.0%
41 1
2.0%
40 1
2.0%

화력발전소명
Text

UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size524.0 B
2023-12-10T21:32:57.958925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length8.0204082
Min length4

Characters and Unicode

Total characters393
Distinct characters66
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)100.0%

Sample

1st row광양복합화력발전소
2nd row나주열병합발전소
3rd row당진복합화력발전소
4th row북평화력발전소
5th row대산복합화력발전소
ValueCountFrequency (%)
광양복합화력발전소 1
 
2.0%
신인천복합화력발전소 1
 
2.0%
남제주화력발전소 1
 
2.0%
영월복합화력발전소 1
 
2.0%
삼척그린파워발전소 1
 
2.0%
안동복합화력발전소 1
 
2.0%
삼천포발전본부 1
 
2.0%
영흥발전본부 1
 
2.0%
분당발전본부 1
 
2.0%
영동발전본부 1
 
2.0%
Other values (39) 39
79.6%
2023-12-10T21:32:58.912293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
48
 
12.2%
48
 
12.2%
43
 
10.9%
31
 
7.9%
31
 
7.9%
24
 
6.1%
17
 
4.3%
13
 
3.3%
7
 
1.8%
7
 
1.8%
Other values (56) 124
31.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 390
99.2%
Uppercase Letter 3
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
48
 
12.3%
48
 
12.3%
43
 
11.0%
31
 
7.9%
31
 
7.9%
24
 
6.2%
17
 
4.4%
13
 
3.3%
7
 
1.8%
7
 
1.8%
Other values (53) 121
31.0%
Uppercase Letter
ValueCountFrequency (%)
G 1
33.3%
N 1
33.3%
L 1
33.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 390
99.2%
Latin 3
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
48
 
12.3%
48
 
12.3%
43
 
11.0%
31
 
7.9%
31
 
7.9%
24
 
6.2%
17
 
4.4%
13
 
3.3%
7
 
1.8%
7
 
1.8%
Other values (53) 121
31.0%
Latin
ValueCountFrequency (%)
G 1
33.3%
N 1
33.3%
L 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 390
99.2%
ASCII 3
 
0.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
48
 
12.3%
48
 
12.3%
43
 
11.0%
31
 
7.9%
31
 
7.9%
24
 
6.2%
17
 
4.4%
13
 
3.3%
7
 
1.8%
7
 
1.8%
Other values (53) 121
31.0%
ASCII
ValueCountFrequency (%)
G 1
33.3%
N 1
33.3%
L 1
33.3%
Distinct48
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size524.0 B
2023-12-10T21:32:59.425640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length24
Mean length20.183673
Min length15

Characters and Unicode

Total characters989
Distinct characters130
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique47 ?
Unique (%)95.9%

Sample

1st row전라남도 광양시 제철로 2148-567
2nd row전라남도 나주시 산포면 신도산단길 65 (신도리 1304)
3rd row충남 당진시 송악읍 부곡공단로 241
4th row강원도 동해시 공단 2로 15-5(구호동)
5th row충남 서산시 대산읍 독곶1로 82
ValueCountFrequency (%)
경기도 13
 
5.6%
강원도 6
 
2.6%
전라남도 5
 
2.1%
충청남도 5
 
2.1%
인천광역시 5
 
2.1%
서구 4
 
1.7%
남구 3
 
1.3%
분당로 2
 
0.9%
201 2
 
0.9%
분당구 2
 
0.9%
Other values (166) 186
79.8%
2023-12-10T21:33:00.190946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
184
 
18.6%
45
 
4.6%
40
 
4.0%
36
 
3.6%
1 28
 
2.8%
5 25
 
2.5%
3 23
 
2.3%
2 21
 
2.1%
21
 
2.1%
19
 
1.9%
Other values (120) 547
55.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 611
61.8%
Space Separator 184
 
18.6%
Decimal Number 176
 
17.8%
Dash Punctuation 11
 
1.1%
Open Punctuation 3
 
0.3%
Close Punctuation 3
 
0.3%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
45
 
7.4%
40
 
6.5%
36
 
5.9%
21
 
3.4%
19
 
3.1%
19
 
3.1%
17
 
2.8%
17
 
2.8%
15
 
2.5%
14
 
2.3%
Other values (105) 368
60.2%
Decimal Number
ValueCountFrequency (%)
1 28
15.9%
5 25
14.2%
3 23
13.1%
2 21
11.9%
7 18
10.2%
0 16
9.1%
4 14
8.0%
6 12
6.8%
9 12
6.8%
8 7
 
4.0%
Space Separator
ValueCountFrequency (%)
184
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 611
61.8%
Common 378
38.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
45
 
7.4%
40
 
6.5%
36
 
5.9%
21
 
3.4%
19
 
3.1%
19
 
3.1%
17
 
2.8%
17
 
2.8%
15
 
2.5%
14
 
2.3%
Other values (105) 368
60.2%
Common
ValueCountFrequency (%)
184
48.7%
1 28
 
7.4%
5 25
 
6.6%
3 23
 
6.1%
2 21
 
5.6%
7 18
 
4.8%
0 16
 
4.2%
4 14
 
3.7%
6 12
 
3.2%
9 12
 
3.2%
Other values (5) 25
 
6.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 611
61.8%
ASCII 378
38.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
184
48.7%
1 28
 
7.4%
5 25
 
6.6%
3 23
 
6.1%
2 21
 
5.6%
7 18
 
4.8%
0 16
 
4.2%
4 14
 
3.7%
6 12
 
3.2%
9 12
 
3.2%
Other values (5) 25
 
6.6%
Hangul
ValueCountFrequency (%)
45
 
7.4%
40
 
6.5%
36
 
5.9%
21
 
3.4%
19
 
3.1%
19
 
3.1%
17
 
2.8%
17
 
2.8%
15
 
2.5%
14
 
2.3%
Other values (105) 368
60.2%

화력발전소미세먼지지수
Real number (ℝ)

HIGH CORRELATION 

Distinct37
Distinct (%)75.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16.867143
Minimum13.369
Maximum23.12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2023-12-10T21:33:00.438955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum13.369
5-th percentile14.3246
Q116.436
median16.615
Q317.369
95-th percentile19.3828
Maximum23.12
Range9.751
Interquartile range (IQR)0.933

Descriptive statistics

Standard deviation1.7079982
Coefficient of variation (CV)0.10126186
Kurtosis2.7489577
Mean16.867143
Median Absolute Deviation (MAD)0.645
Skewness0.82053204
Sum826.49
Variance2.917258
MonotonicityNot monotonic
2023-12-10T21:33:00.696722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=37)
ValueCountFrequency (%)
16.523 7
 
14.3%
19.183 2
 
4.1%
19.516 2
 
4.1%
17.049 2
 
4.1%
17.26 2
 
4.1%
16.688 2
 
4.1%
14.348 2
 
4.1%
17.317 1
 
2.0%
16.806 1
 
2.0%
16.773 1
 
2.0%
Other values (27) 27
55.1%
ValueCountFrequency (%)
13.369 1
2.0%
13.94 1
2.0%
14.309 1
2.0%
14.348 2
4.1%
14.681 1
2.0%
14.894 1
2.0%
15.077 1
2.0%
15.099 1
2.0%
16.03 1
2.0%
16.061 1
2.0%
ValueCountFrequency (%)
23.12 1
2.0%
19.516 2
4.1%
19.183 2
4.1%
18.927 1
2.0%
18.846 1
2.0%
18.477 1
2.0%
18.466 1
2.0%
18.408 1
2.0%
17.65 1
2.0%
17.512 1
2.0%

전국미세먼지지수
Categorical

CONSTANT 

Distinct1
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size524.0 B
16.867
49 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row16.867
2nd row16.867
3rd row16.867
4th row16.867
5th row16.867

Common Values

ValueCountFrequency (%)
16.867 49
100.0%

Length

2023-12-10T21:33:00.906731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:33:01.041584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
16.867 49
100.0%

화력발전소미세먼지율
Real number (ℝ)

HIGH CORRELATION 

Distinct35
Distinct (%)71.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.99997959
Minimum0.793
Maximum1.371
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2023-12-10T21:33:01.187371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.793
5-th percentile0.8492
Q10.974
median0.985
Q31.03
95-th percentile1.149
Maximum1.371
Range0.578
Interquartile range (IQR)0.056

Descriptive statistics

Standard deviation0.10123827
Coefficient of variation (CV)0.10124033
Kurtosis2.7644533
Mean0.99997959
Median Absolute Deviation (MAD)0.038
Skewness0.82282789
Sum48.999
Variance0.010249187
MonotonicityNot monotonic
2023-12-10T21:33:01.405666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)
ValueCountFrequency (%)
0.98 7
 
14.3%
1.023 2
 
4.1%
0.989 2
 
4.1%
0.851 2
 
4.1%
1.095 2
 
4.1%
1.157 2
 
4.1%
1.137 2
 
4.1%
0.976 2
 
4.1%
1.011 2
 
4.1%
1.038 1
 
2.0%
Other values (25) 25
51.0%
ValueCountFrequency (%)
0.793 1
2.0%
0.826 1
2.0%
0.848 1
2.0%
0.851 2
4.1%
0.87 1
2.0%
0.883 1
2.0%
0.894 1
2.0%
0.895 1
2.0%
0.95 1
2.0%
0.952 1
2.0%
ValueCountFrequency (%)
1.371 1
2.0%
1.157 2
4.1%
1.137 2
4.1%
1.122 1
2.0%
1.117 1
2.0%
1.095 2
4.1%
1.091 1
2.0%
1.046 1
2.0%
1.038 1
2.0%
1.03 1
2.0%

Interactions

2023-12-10T21:32:56.192073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:32:54.957033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:32:55.535691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:32:56.417070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:32:55.135679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:32:55.791651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:32:56.609750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:32:55.306899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:32:55.983543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T21:33:01.651470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
화력발전소번호화력발전소명화력발전소주소화력발전소미세먼지지수화력발전소미세먼지율
화력발전소번호1.0001.0000.9170.5060.506
화력발전소명1.0001.0001.0001.0001.000
화력발전소주소0.9171.0001.0001.0001.000
화력발전소미세먼지지수0.5061.0001.0001.0001.000
화력발전소미세먼지율0.5061.0001.0001.0001.000
2023-12-10T21:33:01.776826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
화력발전소번호화력발전소미세먼지지수화력발전소미세먼지율
화력발전소번호1.000-0.069-0.069
화력발전소미세먼지지수-0.0691.0001.000
화력발전소미세먼지율-0.0691.0001.000

Missing values

2023-12-10T21:32:56.813004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T21:32:57.075920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

화력발전소번호화력발전소명화력발전소주소화력발전소미세먼지지수전국미세먼지지수화력발전소미세먼지율
01광양복합화력발전소전라남도 광양시 제철로 2148-56716.61516.8670.985
12나주열병합발전소전라남도 나주시 산포면 신도산단길 65 (신도리 1304)15.09916.8670.895
23당진복합화력발전소충남 당진시 송악읍 부곡공단로 24118.84616.8671.117
34북평화력발전소강원도 동해시 공단 2로 15-5(구호동)14.34816.8670.851
45대산복합화력발전소충남 서산시 대산읍 독곶1로 8218.46616.8671.095
56동두천복합화력발전소경기도 동두천시 광암동 25613.9416.8670.826
67부천열병합발전소경기도 부천시 오정구 삼정동 363-314.30916.8670.848
78분당복합화력발전소경기도 성남시 분당구 분당동 분당로 33619.51616.8671.157
89안산복합화력발전소경기도 안산시 단원구 원시동 83917.6516.8671.046
910안양열병합발전소경기도 안양시 동안구 평안동 부림로 10018.47716.8671.095
화력발전소번호화력발전소명화력발전소주소화력발전소미세먼지지수전국미세먼지지수화력발전소미세먼지율
3940동해화력발전소강원도 동해시 공단9로 14514.34816.8670.851
4041일산열병합발전소경기도 고양시 일산동구 경의로 20115.07716.8670.894
4142보령화력발전소충청남도 보령시 오천면 오천해안로 89-3717.2616.8671.023
4243인천복합화력발전소인천광역시 서구 중봉대로405번길 41117.04916.8671.011
4344서울화력발전소서울특별시 마포구 토정로 5616.77316.8670.994
4445서천화력발전소충청남도 서천시 서면 서인로235번길 8517.31716.8671.027
4546제주화력발전소제주특별자치도 제주시 원당로 13316.80616.8670.996
4647원주그린열병합발전소강원도 원주시 지정면 신평로 216.39216.8670.972
4748세종천연가스발전소세종특별자치시 금송로 62514.68116.8670.87
4849신보령화력발전소충청남도 보령시 주교면 송도길 20117.2616.8671.023