Dataset statistics
Number of variables | 12 |
---|---|
Number of observations | 49 |
Missing cells | 57 |
Missing cells (%) | 9.7% |
Duplicate rows | 3 |
Duplicate rows (%) | 6.1% |
Total size in memory | 4.9 KiB |
Average record size in memory | 101.7 B |
Variable types
Text | 1 |
---|---|
Categorical | 2 |
Unsupported | 6 |
Numeric | 3 |
Dataset
Description | 강원도 원주시의 2020년 월별 미세먼지 농도(PM-10)측정 결과입니다. EX)1월 중앙동, 반곡동, 문막읍, 도시평균의 미세먼지 농도(PM-10)관련 다양한 측정결과) |
---|---|
Author | 강원도 원주시 |
URL | https://www.data.go.kr/data/15092042/fileData.do |
Dataset has 3 (6.1%) duplicate rows | Duplicates |
도시명 is highly overall correlated with 유효 측정일수 and 3 other fields | High correlation |
측정소명 is highly overall correlated with 유효 측정일수 and 2 other fields | High correlation |
유효 측정일수 is highly overall correlated with 유효 측정시간 and 2 other fields | High correlation |
유효 측정시간 is highly overall correlated with 유효 측정일수 and 2 other fields | High correlation |
월평균 (㎍/㎥) is highly overall correlated with 도시명 | High correlation |
시,도명 has 37 (75.5%) missing values | Missing |
유효자료 획득율(%) has 1 (2.0%) missing values | Missing |
유효 측정일수 has 3 (6.1%) missing values | Missing |
유효 측정시간 has 3 (6.1%) missing values | Missing |
월평균 (㎍/㎥) has 3 (6.1%) missing values | Missing |
24시간치 has 2 (4.1%) missing values | Missing |
Unnamed: 8 has 2 (4.1%) missing values | Missing |
Unnamed: 9 has 2 (4.1%) missing values | Missing |
Unnamed: 10 has 2 (4.1%) missing values | Missing |
Unnamed: 11 has 2 (4.1%) missing values | Missing |
유효자료 획득율(%) is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
24시간치 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2023-12-12 03:19:10.185485 |
---|---|
Analysis finished | 2023-12-12 03:19:12.037327 |
Duration | 1.85 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
시,도명
Text
MISSING
 
Distinct | 12 |
---|---|
Distinct (%) | 100.0% |
Missing | 37 |
Missing (%) | 75.5% |
Memory size | 524.0 B |
Value | Count | Frequency (%) |
1월 | 1 | |
2월 | 1 | |
3월 | 1 | |
4월 | 1 | |
5월 | 1 | |
6월 | 1 | |
7월 | 1 | |
8월 | 1 | |
9월 | 1 | |
10월 | 1 | |
Other values (2) | 2 |
Most occurring characters
Value | Count | Frequency (%) |
월 | 12 | |
1 | 5 | |
2 | 2 | 7.4% |
3 | 1 | 3.7% |
4 | 1 | 3.7% |
5 | 1 | 3.7% |
6 | 1 | 3.7% |
7 | 1 | 3.7% |
8 | 1 | 3.7% |
9 | 1 | 3.7% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 15 | |
Other Letter | 12 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 5 | |
2 | 2 | 13.3% |
3 | 1 | 6.7% |
4 | 1 | 6.7% |
5 | 1 | 6.7% |
6 | 1 | 6.7% |
7 | 1 | 6.7% |
8 | 1 | 6.7% |
9 | 1 | 6.7% |
0 | 1 | 6.7% |
Other Letter
Value | Count | Frequency (%) |
월 | 12 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 15 | |
Hangul | 12 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 5 | |
2 | 2 | 13.3% |
3 | 1 | 6.7% |
4 | 1 | 6.7% |
5 | 1 | 6.7% |
6 | 1 | 6.7% |
7 | 1 | 6.7% |
8 | 1 | 6.7% |
9 | 1 | 6.7% |
0 | 1 | 6.7% |
Hangul
Value | Count | Frequency (%) |
월 | 12 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 15 | |
Hangul | 12 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
월 | 12 |
ASCII
Value | Count | Frequency (%) |
1 | 5 | |
2 | 2 | 13.3% |
3 | 1 | 6.7% |
4 | 1 | 6.7% |
5 | 1 | 6.7% |
6 | 1 | 6.7% |
7 | 1 | 6.7% |
8 | 1 | 6.7% |
9 | 1 | 6.7% |
0 | 1 | 6.7% |
도시명
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 4.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 524.0 B |
<NA> | |
---|---|
원주시 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.755102 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | 원주시 |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 37 | |
원주시 | 12 | 24.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 37 | |
원주시 | 12 | 24.5% |
측정소명
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 10.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 524.0 B |
중앙동 | |
---|---|
반곡동 | |
문막읍 | |
도시평균 | |
<NA> | 1 |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 3.2653061 |
Min length | 3 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 2.0% |
Sample
1st row | <NA> |
---|---|
2nd row | 중앙동 |
3rd row | 반곡동 |
4th row | 문막읍 |
5th row | 도시평균 |
Common Values
Value | Count | Frequency (%) |
중앙동 | 12 | |
반곡동 | 12 | |
문막읍 | 12 | |
도시평균 | 12 | |
<NA> | 1 | 2.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
중앙동 | 12 | |
반곡동 | 12 | |
문막읍 | 12 | |
도시평균 | 12 | |
na | 1 | 2.0% |
유효자료 획득율(%)
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 1 |
---|---|
Missing (%) | 2.0% |
Memory size | 524.0 B |
유효 측정일수
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 14 |
---|---|
Distinct (%) | 30.4% |
Missing | 3 |
Missing (%) | 6.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 43.782609 |
Minimum | 16 |
---|---|
Maximum | 93 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 573.0 B |
Quantile statistics
Minimum | 16 |
---|---|
5-th percentile | 27 |
Q1 | 30 |
median | 31 |
Q3 | 51.25 |
95-th percentile | 93 |
Maximum | 93 |
Range | 77 |
Interquartile range (IQR) | 21.25 |
Descriptive statistics
Standard deviation | 25.040335 |
---|---|
Coefficient of variation (CV) | 0.57192423 |
Kurtosis | -0.19719112 |
Mean | 43.782609 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 1.2663854 |
Sum | 2014 |
Variance | 627.01836 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
31 | 16 | |
30 | 9 | |
93 | 4 | 8.2% |
29 | 4 | 8.2% |
27 | 3 | 6.1% |
90 | 2 | 4.1% |
28 | 1 | 2.0% |
86 | 1 | 2.0% |
89 | 1 | 2.0% |
16 | 1 | 2.0% |
Other values (4) | 4 | 8.2% |
(Missing) | 3 | 6.1% |
Value | Count | Frequency (%) |
16 | 1 | 2.0% |
27 | 3 | 6.1% |
28 | 1 | 2.0% |
29 | 4 | 8.2% |
30 | 9 | |
31 | 16 | |
58 | 1 | 2.0% |
59 | 1 | 2.0% |
75 | 1 | 2.0% |
86 | 1 | 2.0% |
Value | Count | Frequency (%) |
93 | 4 | 8.2% |
90 | 2 | 4.1% |
89 | 1 | 2.0% |
88 | 1 | 2.0% |
86 | 1 | 2.0% |
75 | 1 | 2.0% |
59 | 1 | 2.0% |
58 | 1 | 2.0% |
31 | 16 | |
30 | 9 |
유효 측정시간
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 34 |
---|---|
Distinct (%) | 73.9% |
Missing | 3 |
Missing (%) | 6.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1050.7391 |
Minimum | 383 |
---|---|
Maximum | 2216 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 573.0 B |
Quantile statistics
Minimum | 383 |
---|---|
5-th percentile | 685.5 |
Q1 | 715 |
median | 736.5 |
Q3 | 1249 |
95-th percentile | 2212.75 |
Maximum | 2216 |
Range | 1833 |
Interquartile range (IQR) | 534 |
Descriptive statistics
Standard deviation | 599.22129 |
---|---|
Coefficient of variation (CV) | 0.5702855 |
Kurtosis | -0.22627225 |
Mean | 1050.7391 |
Median Absolute Deviation (MAD) | 24.5 |
Skewness | 1.2595886 |
Sum | 48334 |
Variance | 359066.15 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
740 | 5 | 10.2% |
717 | 3 | 6.1% |
730 | 2 | 4.1% |
715 | 2 | 4.1% |
709 | 2 | 4.1% |
741 | 2 | 4.1% |
736 | 2 | 4.1% |
2216 | 2 | 4.1% |
2063 | 1 | 2.0% |
739 | 1 | 2.0% |
Other values (24) | 24 | |
(Missing) | 3 | 6.1% |
Value | Count | Frequency (%) |
383 | 1 | |
681 | 1 | |
685 | 1 | |
687 | 1 | |
691 | 1 | |
695 | 1 | |
697 | 1 | |
698 | 1 | |
701 | 1 | |
709 | 2 |
Value | Count | Frequency (%) |
2216 | 2 | |
2213 | 1 | |
2212 | 1 | |
2166 | 1 | |
2149 | 1 | |
2141 | 1 | |
2140 | 1 | |
2063 | 1 | |
1798 | 1 | |
1435 | 1 |
월평균 (㎍/㎥)
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 23 |
---|---|
Distinct (%) | 50.0% |
Missing | 3 |
Missing (%) | 6.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 31.782609 |
Minimum | 15 |
---|---|
Maximum | 48 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 573.0 B |
Quantile statistics
Minimum | 15 |
---|---|
5-th percentile | 17 |
Q1 | 25.25 |
median | 33 |
Q3 | 38 |
95-th percentile | 43 |
Maximum | 48 |
Range | 33 |
Interquartile range (IQR) | 12.75 |
Descriptive statistics
Standard deviation | 8.991139 |
---|---|
Coefficient of variation (CV) | 0.28289493 |
Kurtosis | -0.93381235 |
Mean | 31.782609 |
Median Absolute Deviation (MAD) | 6 |
Skewness | -0.43498647 |
Sum | 1462 |
Variance | 80.84058 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
38 | 7 | |
17 | 4 | 8.2% |
43 | 4 | 8.2% |
39 | 3 | 6.1% |
18 | 3 | 6.1% |
33 | 3 | 6.1% |
29 | 2 | 4.1% |
36 | 2 | 4.1% |
32 | 2 | 4.1% |
30 | 2 | 4.1% |
Other values (13) | 14 | |
(Missing) | 3 | 6.1% |
Value | Count | Frequency (%) |
15 | 1 | 2.0% |
17 | 4 | |
18 | 3 | |
20 | 1 | 2.0% |
23 | 1 | 2.0% |
24 | 1 | 2.0% |
25 | 1 | 2.0% |
26 | 1 | 2.0% |
27 | 1 | 2.0% |
29 | 2 |
Value | Count | Frequency (%) |
48 | 1 | 2.0% |
43 | 4 | |
42 | 1 | 2.0% |
41 | 1 | 2.0% |
40 | 1 | 2.0% |
39 | 3 | |
38 | 7 | |
36 | 2 | 4.1% |
35 | 2 | 4.1% |
33 | 3 |
24시간치
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 2 |
---|---|
Missing (%) | 4.1% |
Memory size | 524.0 B |
Unnamed: 8
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 2 |
---|---|
Missing (%) | 4.1% |
Memory size | 524.0 B |
Unnamed: 9
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 2 |
---|---|
Missing (%) | 4.1% |
Memory size | 524.0 B |
Unnamed: 10
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 2 |
---|---|
Missing (%) | 4.1% |
Memory size | 524.0 B |
Unnamed: 11
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 2 |
---|---|
Missing (%) | 4.1% |
Memory size | 524.0 B |
시,도명 | 측정소명 | 유효 측정일수 | 유효 측정시간 | 월평균 (㎍/㎥) | |
---|---|---|---|---|---|
시,도명 | 1.000 | NaN | NaN | NaN | 1.000 |
측정소명 | NaN | 1.000 | 0.616 | 0.870 | 0.000 |
유효\n측정일수 | NaN | 0.616 | 1.000 | 1.000 | 0.772 |
유효\n측정시간 | NaN | 0.870 | 1.000 | 1.000 | 0.000 |
월평균\n(㎍/㎥) | 1.000 | 0.000 | 0.772 | 0.000 | 1.000 |
도시명 | 측정소명 | |
---|---|---|
도시명 | 1.000 | 1.000 |
측정소명 | 1.000 | 1.000 |
유효 측정일수 | 유효 측정시간 | 월평균 (㎍/㎥) | 도시명 | 측정소명 | |
---|---|---|---|---|---|
유효\n측정일수 | 1.000 | 0.971 | 0.018 | 1.000 | 0.537 |
유효\n측정시간 | 0.971 | 1.000 | 0.016 | 1.000 | 0.537 |
월평균\n(㎍/㎥) | 0.018 | 0.016 | 1.000 | 1.000 | 0.000 |
도시명 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
측정소명 | 0.537 | 0.537 | 0.000 | 1.000 | 1.000 |
시,도명 | 도시명 | 측정소명 | 유효자료 획득율(%) | 유효 측정일수 | 유효 측정시간 | 월평균 (㎍/㎥) | 24시간치 | Unnamed: 8 | Unnamed: 9 | Unnamed: 10 | Unnamed: 11 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | <NA> | <NA> | <NA> | NaN | <NA> | <NA> | <NA> | 최저\n(㎍/㎥) | 최고\n(㎍/㎥) | 최고일시\n(년월일시) | 기준초과\n(회) | 초과율\n(%) |
1 | 1월 | 원주시 | 중앙동 | 98.92 | 31 | 736 | 36 | 1 | 65 | 20200111 | 0 | 0 |
2 | <NA> | <NA> | 반곡동 | 99.46 | 31 | 740 | 43 | 7 | 79 | 20200124 | 0 | 0 |
3 | <NA> | <NA> | 문막읍 | 99.46 | 31 | 740 | 38 | 9 | 70 | 20200124 | 0 | 0 |
4 | <NA> | <NA> | 도시평균 | 99.28 | 93 | 2216 | 39 | 1 | 79 | 20200124 | 0 | 0 |
5 | 2월 | 원주시 | 중앙동 | 98.42 | 29 | 685 | 30 | 4 | 70 | 20200202 | 0 | 0 |
6 | <NA> | <NA> | 반곡동 | 98.71 | 28 | 687 | 41 | 11 | 85 | 20200202 | 0 | 0 |
7 | <NA> | <NA> | 문막읍 | 99.28 | 29 | 691 | 38 | 9 | 74 | 20200202 | 0 | 0 |
8 | <NA> | <NA> | 도시평균 | 98.8 | 86 | 2063 | 36 | 4 | 85 | 20200202 | 0 | 0 |
9 | 3월 | 원주시 | 중앙동 | 98.12 | 31 | 730 | 32 | 12 | 54 | 20200325 | 0 | 0 |
시,도명 | 도시명 | 측정소명 | 유효자료 획득율(%) | 유효 측정일수 | 유효 측정시간 | 월평균 (㎍/㎥) | 24시간치 | Unnamed: 8 | Unnamed: 9 | Unnamed: 10 | Unnamed: 11 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
39 | <NA> | <NA> | 문막읍 | 99.46 | 31 | 740 | 29 | 10 | 65 | 20201027 | 0 | 0 |
40 | <NA> | <NA> | 도시평균 | 99.28 | 93 | 2216 | 31 | 10 | 76 | 20201027 | 0 | 0 |
41 | 11월 | 원주시 | 중앙동 | 98.47 | 30 | 709 | 40 | 13 | 93 | 20201116 | 0 | 0 |
42 | <NA> | <NA> | 반곡동 | 99.31 | 30 | 715 | 38 | 12 | 85 | 20201116 | 0 | 0 |
43 | <NA> | <NA> | 문막읍 | 99.58 | 30 | 717 | 35 | 15 | 78 | 20201116 | 0 | 0 |
44 | <NA> | <NA> | 도시평균 | 99.12 | 90 | 2141 | 38 | 12 | 93 | 20201116 | 0 | 0 |
45 | 12월 | 원주시 | 중앙동 | 98.79 | 31 | 735 | 43 | 14 | 88 | 20201211 | 0 | 0 |
46 | <NA> | <NA> | 반곡동 | 99.46 | 31 | 740 | 43 | 18 | 85 | 20201211 | 0 | 0 |
47 | <NA> | <NA> | 문막읍 | 99.06 | 31 | 737 | 39 | 18 | 73 | 20201211 | 0 | 0 |
48 | <NA> | <NA> | 도시평균 | 99.1 | 93 | 2212 | 42 | 14 | 88 | 20201211 | 0 | 0 |
Most frequently occurring
시,도명 | 도시명 | 측정소명 | 유효 측정일수 | 유효 측정시간 | 월평균 (㎍/㎥) | # duplicates | |
---|---|---|---|---|---|---|---|
0 | <NA> | <NA> | 문막읍 | 30 | 717 | 35 | 2 |
1 | <NA> | <NA> | 반곡동 | 31 | 740 | 43 | 2 |
2 | <NA> | <NA> | 반곡동 | <NA> | <NA> | <NA> | 2 |