Overview

Dataset statistics

Number of variables6
Number of observations49
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.6 KiB
Average record size in memory54.7 B

Variable types

Numeric3
Text2
Categorical1

Dataset

Description샘플 데이터
Author지디에스컨설팅그룹
URLhttps://www.bigdata-environment.kr/user/data_market/detail.do?id=2a2d9710-2e00-11ea-9713-eb3e5186fb38

Alerts

전국 미세먼지 수치 has constant value ""Constant
화력발전소 미세먼지 수치 is highly overall correlated with 화력발전소 미세먼지 비율High correlation
화력발전소 미세먼지 비율 is highly overall correlated with 화력발전소 미세먼지 수치High correlation
화력발전소 고유번호 has unique valuesUnique
화력발전소 명 has unique valuesUnique

Reproduction

Analysis started2023-12-10 12:32:46.091066
Analysis finished2023-12-10 12:32:48.816679
Duration2.73 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

화력발전소 고유번호
Real number (ℝ)

UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25
Minimum1
Maximum49
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2023-12-10T21:32:48.927487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.4
Q113
median25
Q337
95-th percentile46.6
Maximum49
Range48
Interquartile range (IQR)24

Descriptive statistics

Standard deviation14.28869
Coefficient of variation (CV)0.57154761
Kurtosis-1.2
Mean25
Median Absolute Deviation (MAD)12
Skewness0
Sum1225
Variance204.16667
MonotonicityStrictly increasing
2023-12-10T21:32:49.173357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%)
1 1
 
2.0%
38 1
 
2.0%
28 1
 
2.0%
29 1
 
2.0%
30 1
 
2.0%
31 1
 
2.0%
32 1
 
2.0%
33 1
 
2.0%
34 1
 
2.0%
35 1
 
2.0%
Other values (39) 39
79.6%
ValueCountFrequency (%)
1 1
2.0%
2 1
2.0%
3 1
2.0%
4 1
2.0%
5 1
2.0%
6 1
2.0%
7 1
2.0%
8 1
2.0%
9 1
2.0%
10 1
2.0%
ValueCountFrequency (%)
49 1
2.0%
48 1
2.0%
47 1
2.0%
46 1
2.0%
45 1
2.0%
44 1
2.0%
43 1
2.0%
42 1
2.0%
41 1
2.0%
40 1
2.0%

화력발전소 명
Text

UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size524.0 B
2023-12-10T21:32:49.590939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length8.0204082
Min length4

Characters and Unicode

Total characters393
Distinct characters66
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)100.0%

Sample

1st row광양복합화력발전소
2nd row나주열병합발전소
3rd row당진복합화력발전소
4th row북평화력발전소
5th row대산복합화력발전소
ValueCountFrequency (%)
광양복합화력발전소 1
 
2.0%
신인천복합화력발전소 1
 
2.0%
남제주화력발전소 1
 
2.0%
영월복합화력발전소 1
 
2.0%
삼척그린파워발전소 1
 
2.0%
안동복합화력발전소 1
 
2.0%
삼천포발전본부 1
 
2.0%
영흥발전본부 1
 
2.0%
분당발전본부 1
 
2.0%
영동발전본부 1
 
2.0%
Other values (39) 39
79.6%
2023-12-10T21:32:50.249005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
48
 
12.2%
48
 
12.2%
43
 
10.9%
31
 
7.9%
31
 
7.9%
24
 
6.1%
17
 
4.3%
13
 
3.3%
7
 
1.8%
7
 
1.8%
Other values (56) 124
31.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 390
99.2%
Uppercase Letter 3
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
48
 
12.3%
48
 
12.3%
43
 
11.0%
31
 
7.9%
31
 
7.9%
24
 
6.2%
17
 
4.4%
13
 
3.3%
7
 
1.8%
7
 
1.8%
Other values (53) 121
31.0%
Uppercase Letter
ValueCountFrequency (%)
G 1
33.3%
N 1
33.3%
L 1
33.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 390
99.2%
Latin 3
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
48
 
12.3%
48
 
12.3%
43
 
11.0%
31
 
7.9%
31
 
7.9%
24
 
6.2%
17
 
4.4%
13
 
3.3%
7
 
1.8%
7
 
1.8%
Other values (53) 121
31.0%
Latin
ValueCountFrequency (%)
G 1
33.3%
N 1
33.3%
L 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 390
99.2%
ASCII 3
 
0.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
48
 
12.3%
48
 
12.3%
43
 
11.0%
31
 
7.9%
31
 
7.9%
24
 
6.2%
17
 
4.4%
13
 
3.3%
7
 
1.8%
7
 
1.8%
Other values (53) 121
31.0%
ASCII
ValueCountFrequency (%)
G 1
33.3%
N 1
33.3%
L 1
33.3%
Distinct48
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size524.0 B
2023-12-10T21:32:50.844451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length24
Mean length20.183673
Min length15

Characters and Unicode

Total characters989
Distinct characters130
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique47 ?
Unique (%)95.9%

Sample

1st row전라남도 광양시 제철로 2148-567
2nd row전라남도 나주시 산포면 신도산단길 65 (신도리 1304)
3rd row충남 당진시 송악읍 부곡공단로 241
4th row강원도 동해시 공단 2로 15-5(구호동)
5th row충남 서산시 대산읍 독곶1로 82
ValueCountFrequency (%)
경기도 13
 
5.6%
강원도 6
 
2.6%
전라남도 5
 
2.1%
충청남도 5
 
2.1%
인천광역시 5
 
2.1%
서구 4
 
1.7%
남구 3
 
1.3%
분당로 2
 
0.9%
201 2
 
0.9%
분당구 2
 
0.9%
Other values (166) 186
79.8%
2023-12-10T21:32:51.681659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
184
 
18.6%
45
 
4.6%
40
 
4.0%
36
 
3.6%
1 28
 
2.8%
5 25
 
2.5%
3 23
 
2.3%
2 21
 
2.1%
21
 
2.1%
19
 
1.9%
Other values (120) 547
55.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 611
61.8%
Space Separator 184
 
18.6%
Decimal Number 176
 
17.8%
Dash Punctuation 11
 
1.1%
Open Punctuation 3
 
0.3%
Close Punctuation 3
 
0.3%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
45
 
7.4%
40
 
6.5%
36
 
5.9%
21
 
3.4%
19
 
3.1%
19
 
3.1%
17
 
2.8%
17
 
2.8%
15
 
2.5%
14
 
2.3%
Other values (105) 368
60.2%
Decimal Number
ValueCountFrequency (%)
1 28
15.9%
5 25
14.2%
3 23
13.1%
2 21
11.9%
7 18
10.2%
0 16
9.1%
4 14
8.0%
6 12
6.8%
9 12
6.8%
8 7
 
4.0%
Space Separator
ValueCountFrequency (%)
184
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 611
61.8%
Common 378
38.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
45
 
7.4%
40
 
6.5%
36
 
5.9%
21
 
3.4%
19
 
3.1%
19
 
3.1%
17
 
2.8%
17
 
2.8%
15
 
2.5%
14
 
2.3%
Other values (105) 368
60.2%
Common
ValueCountFrequency (%)
184
48.7%
1 28
 
7.4%
5 25
 
6.6%
3 23
 
6.1%
2 21
 
5.6%
7 18
 
4.8%
0 16
 
4.2%
4 14
 
3.7%
6 12
 
3.2%
9 12
 
3.2%
Other values (5) 25
 
6.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 611
61.8%
ASCII 378
38.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
184
48.7%
1 28
 
7.4%
5 25
 
6.6%
3 23
 
6.1%
2 21
 
5.6%
7 18
 
4.8%
0 16
 
4.2%
4 14
 
3.7%
6 12
 
3.2%
9 12
 
3.2%
Other values (5) 25
 
6.6%
Hangul
ValueCountFrequency (%)
45
 
7.4%
40
 
6.5%
36
 
5.9%
21
 
3.4%
19
 
3.1%
19
 
3.1%
17
 
2.8%
17
 
2.8%
15
 
2.5%
14
 
2.3%
Other values (105) 368
60.2%

화력발전소 미세먼지 수치
Real number (ℝ)

HIGH CORRELATION 

Distinct37
Distinct (%)75.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean27.438796
Minimum16.593
Maximum33.06
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2023-12-10T21:32:51.912648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum16.593
5-th percentile24.876
Q126.524
median26.797
Q328.973
95-th percentile31.1728
Maximum33.06
Range16.467
Interquartile range (IQR)2.449

Descriptive statistics

Standard deviation2.5993062
Coefficient of variation (CV)0.094731059
Kurtosis5.3877517
Mean27.438796
Median Absolute Deviation (MAD)1.033
Skewness-1.0135234
Sum1344.501
Variance6.7563927
MonotonicityNot monotonic
2023-12-10T21:32:52.113270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=37)
ValueCountFrequency (%)
26.572 7
 
14.3%
28.621 2
 
4.1%
26.989 2
 
4.1%
30.721 2
 
4.1%
27.211 2
 
4.1%
30.209 2
 
4.1%
24.876 2
 
4.1%
26.675 1
 
2.0%
26.458 1
 
2.0%
27.065 1
 
2.0%
Other values (27) 27
55.1%
ValueCountFrequency (%)
16.593 1
2.0%
24.633 1
2.0%
24.876 2
4.1%
24.895 1
2.0%
24.96 1
2.0%
25.105 1
2.0%
25.764 1
2.0%
25.914 1
2.0%
26.276 1
2.0%
26.284 1
2.0%
ValueCountFrequency (%)
33.06 1
2.0%
32.357 1
2.0%
31.472 1
2.0%
30.724 1
2.0%
30.721 2
4.1%
30.716 1
2.0%
30.209 2
4.1%
29.42 1
2.0%
29.348 1
2.0%
28.981 1
2.0%

전국 미세먼지 수치
Categorical

CONSTANT 

Distinct1
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size524.0 B
27.439
49 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row27.439
2nd row27.439
3rd row27.439
4th row27.439
5th row27.439

Common Values

ValueCountFrequency (%)
27.439 49
100.0%

Length

2023-12-10T21:32:52.317763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:32:52.482594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
27.439 49
100.0%

화력발전소 미세먼지 비율
Real number (ℝ)

HIGH CORRELATION 

Distinct32
Distinct (%)65.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.0000408
Minimum0.605
Maximum1.205
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2023-12-10T21:32:52.635374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.605
5-th percentile0.907
Q10.967
median0.977
Q31.056
95-th percentile1.1362
Maximum1.205
Range0.6
Interquartile range (IQR)0.089

Descriptive statistics

Standard deviation0.094706115
Coefficient of variation (CV)0.09470225
Kurtosis5.3790598
Mean1.0000408
Median Absolute Deviation (MAD)0.038
Skewness-1.0112154
Sum49.002
Variance0.0089692483
MonotonicityNot monotonic
2023-12-10T21:32:52.875789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
0.968 7
 
14.3%
0.907 3
 
6.1%
1.12 3
 
6.1%
0.992 2
 
4.1%
0.972 2
 
4.1%
1.101 2
 
4.1%
1.043 2
 
4.1%
0.984 2
 
4.1%
0.958 2
 
4.1%
1.056 2
 
4.1%
Other values (22) 22
44.9%
ValueCountFrequency (%)
0.605 1
 
2.0%
0.898 1
 
2.0%
0.907 3
6.1%
0.91 1
 
2.0%
0.915 1
 
2.0%
0.939 1
 
2.0%
0.944 1
 
2.0%
0.958 2
4.1%
0.964 1
 
2.0%
0.967 1
 
2.0%
ValueCountFrequency (%)
1.205 1
 
2.0%
1.179 1
 
2.0%
1.147 1
 
2.0%
1.12 3
6.1%
1.119 1
 
2.0%
1.101 2
4.1%
1.072 1
 
2.0%
1.07 1
 
2.0%
1.056 2
4.1%
1.05 1
 
2.0%

Interactions

2023-12-10T21:32:47.682333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:32:46.653610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:32:47.132316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:32:47.971142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:32:46.817288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:32:47.289859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:32:48.253553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:32:46.965514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:32:47.530973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T21:32:53.027863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
화력발전소 고유번호화력발전소 명화력발전소 주소화력발전소 미세먼지 수치화력발전소 미세먼지 비율
화력발전소 고유번호1.0001.0000.9170.4590.459
화력발전소 명1.0001.0001.0001.0001.000
화력발전소 주소0.9171.0001.0001.0001.000
화력발전소 미세먼지 수치0.4591.0001.0001.0001.000
화력발전소 미세먼지 비율0.4591.0001.0001.0001.000
2023-12-10T21:32:53.206318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
화력발전소 고유번호화력발전소 미세먼지 수치화력발전소 미세먼지 비율
화력발전소 고유번호1.0000.0870.088
화력발전소 미세먼지 수치0.0871.0001.000
화력발전소 미세먼지 비율0.0881.0001.000

Missing values

2023-12-10T21:32:48.455796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T21:32:48.751027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

화력발전소 고유번호화력발전소 명화력발전소 주소화력발전소 미세먼지 수치전국 미세먼지 수치화력발전소 미세먼지 비율
01광양복합화력발전소전라남도 광양시 제철로 2148-56726.7427.4390.975
12나주열병합발전소전라남도 나주시 산포면 신도산단길 65 (신도리 1304)25.76427.4390.939
23당진복합화력발전소충남 당진시 송악읍 부곡공단로 24129.34827.4391.07
34북평화력발전소강원도 동해시 공단 2로 15-5(구호동)24.87627.4390.907
45대산복합화력발전소충남 서산시 대산읍 독곶1로 8226.28427.4390.958
56동두천복합화력발전소경기도 동두천시 광암동 25624.89527.4390.907
67부천열병합발전소경기도 부천시 오정구 삼정동 363-328.77427.4391.049
78분당복합화력발전소경기도 성남시 분당구 분당동 분당로 33626.98927.4390.984
89안산복합화력발전소경기도 안산시 단원구 원시동 83931.47227.4391.147
910안양열병합발전소경기도 안양시 동안구 평안동 부림로 10016.59327.4390.605
화력발전소 고유번호화력발전소 명화력발전소 주소화력발전소 미세먼지 수치전국 미세먼지 수치화력발전소 미세먼지 비율
3940동해화력발전소강원도 동해시 공단9로 14524.87627.4390.907
4041일산열병합발전소경기도 고양시 일산동구 경의로 20132.35727.4391.179
4142보령화력발전소충청남도 보령시 오천면 오천해안로 89-3727.21127.4390.992
4243인천복합화력발전소인천광역시 서구 중봉대로405번길 41130.72127.4391.12
4344서울화력발전소서울특별시 마포구 토정로 5627.06527.4390.986
4445서천화력발전소충청남도 서천시 서면 서인로235번길 8526.67527.4390.972
4546제주화력발전소제주특별자치도 제주시 원당로 13326.45827.4390.964
4647원주그린열병합발전소강원도 원주시 지정면 신평로 227.34427.4390.997
4748세종천연가스발전소세종특별자치시 금송로 62525.10527.4390.915
4849신보령화력발전소충청남도 보령시 주교면 송도길 20127.21127.4390.992