Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 332 |
Missing cells | 232 |
Missing cells (%) | 8.7% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 21.2 KiB |
Average record size in memory | 65.4 B |
Variable types
Categorical | 4 |
---|---|
Text | 4 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울교통공사 |
URL | https://data.seoul.go.kr/dataList/OA-2732/F/1/datasetView.do |
시 설 명 (역사명) has 232 (69.9%) missing values | Missing |
Reproduction
Analysis started | 2024-04-29 22:00:01.813393 |
---|---|
Analysis finished | 2024-04-29 22:00:02.283742 |
Duration | 0.47 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
호선
Categorical
Distinct | 5 |
---|---|
Distinct (%) | 1.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.7 KiB |
2 | |
---|---|
3 | |
4 | |
1 | |
<NA> | 2 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.0180723 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
2 | 120 | |
3 | 108 | |
4 | 69 | |
1 | 33 | 9.9% |
<NA> | 2 | 0.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 120 | |
3 | 108 | |
4 | 69 | |
1 | 33 | 9.9% |
na | 2 | 0.6% |
시 설 명 (역사명)
Text
MISSING
 
Distinct | 91 |
---|---|
Distinct (%) | 91.0% |
Missing | 232 |
Missing (%) | 69.9% |
Memory size | 2.7 KiB |
Value | Count | Frequency (%) |
대 | 5 | 3.3% |
신 | 5 | 3.3% |
사 | 3 | 2.0% |
청 | 3 | 2.0% |
수 | 3 | 2.0% |
당 | 3 | 2.0% |
현 | 2 | 1.3% |
삼 | 2 | 1.3% |
금 | 2 | 1.3% |
서 | 2 | 1.3% |
Other values (103) | 121 |
Most occurring characters
Value | Count | Frequency (%) |
137 | ||
대 | 17 | 3.9% |
신 | 13 | 2.9% |
동 | 11 | 2.5% |
구 | 11 | 2.5% |
로 | 10 | 2.3% |
문 | 9 | 2.0% |
가 | 7 | 1.6% |
입 | 7 | 1.6% |
청 | 6 | 1.4% |
Other values (115) | 213 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 295 | |
Space Separator | 137 | |
Decimal Number | 6 | 1.4% |
Control | 3 | 0.7% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
대 | 17 | 5.8% |
신 | 13 | 4.4% |
동 | 11 | 3.7% |
구 | 11 | 3.7% |
로 | 10 | 3.4% |
문 | 9 | 3.1% |
가 | 7 | 2.4% |
입 | 7 | 2.4% |
청 | 6 | 2.0% |
사 | 5 | 1.7% |
Other values (110) | 199 |
Decimal Number
Value | Count | Frequency (%) |
3 | 4 | |
5 | 1 | 16.7% |
4 | 1 | 16.7% |
Space Separator
Value | Count | Frequency (%) |
137 |
Control
Value | Count | Frequency (%) |
3 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 295 | |
Common | 146 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
대 | 17 | 5.8% |
신 | 13 | 4.4% |
동 | 11 | 3.7% |
구 | 11 | 3.7% |
로 | 10 | 3.4% |
문 | 9 | 3.1% |
가 | 7 | 2.4% |
입 | 7 | 2.4% |
청 | 6 | 2.0% |
사 | 5 | 1.7% |
Other values (110) | 199 |
Common
Value | Count | Frequency (%) |
137 | ||
3 | 4 | 2.7% |
3 | 2.1% | |
5 | 1 | 0.7% |
4 | 1 | 0.7% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 295 | |
ASCII | 146 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
137 | ||
3 | 4 | 2.7% |
3 | 2.1% | |
5 | 1 | 0.7% |
4 | 1 | 0.7% |
Hangul
Value | Count | Frequency (%) |
대 | 17 | 5.8% |
신 | 13 | 4.4% |
동 | 11 | 3.7% |
구 | 11 | 3.7% |
로 | 10 | 3.4% |
문 | 9 | 3.1% |
가 | 7 | 2.4% |
입 | 7 | 2.4% |
청 | 6 | 2.0% |
사 | 5 | 1.7% |
Other values (110) | 199 |
측정 지점
Categorical
Distinct | 8 |
---|---|
Distinct (%) | 2.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.7 KiB |
평 균 | |
---|---|
승강장 | |
대합실 | |
대합실-1 | |
대합실-2 | |
Other values (3) |
Length
Max length | 6 |
---|---|
Median length | 3 |
Mean length | 3.5662651 |
Min length | 3 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 0.6% |
Sample
1st row | <NA> |
---|---|
2nd row | 공기질 기준 |
3rd row | 평 균 |
4th row | 승강장 |
5th row | 대합실-1 |
Common Values
Value | Count | Frequency (%) |
평 균 | 100 | |
승강장 | 100 | |
대합실 | 82 | |
대합실-1 | 18 | 5.4% |
대합실-2 | 18 | 5.4% |
환승통로 | 12 | 3.6% |
<NA> | 1 | 0.3% |
공기질 기준 | 1 | 0.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
평 | 100 | |
균 | 100 | |
승강장 | 100 | |
대합실 | 82 | |
대합실-1 | 18 | 4.2% |
대합실-2 | 18 | 4.2% |
환승통로 | 12 | 2.8% |
na | 1 | 0.2% |
공기질 | 1 | 0.2% |
기준 | 1 | 0.2% |
유지기준
Text
Distinct | 195 |
---|---|
Distinct (%) | 58.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.7 KiB |
Value | Count | Frequency (%) |
97.5 | 6 | 1.8% |
98.4 | 5 | 1.5% |
90.9 | 5 | 1.5% |
93.6 | 5 | 1.5% |
92.8 | 4 | 1.2% |
87.2 | 4 | 1.2% |
95.8 | 4 | 1.2% |
97.4 | 4 | 1.2% |
95.7 | 4 | 1.2% |
95.1 | 4 | 1.2% |
Other values (185) | 287 |
Most occurring characters
Value | Count | Frequency (%) |
. | 297 | |
9 | 235 | |
8 | 176 | |
1 | 109 | 8.4% |
7 | 86 | 6.6% |
5 | 75 | 5.8% |
0 | 72 | 5.5% |
4 | 70 | 5.4% |
3 | 60 | 4.6% |
2 | 60 | 4.6% |
Other values (6) | 63 | 4.8% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 1001 | |
Other Punctuation | 298 | 22.9% |
Other Symbol | 2 | 0.2% |
Uppercase Letter | 2 | 0.2% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
9 | 235 | |
8 | 176 | |
1 | 109 | |
7 | 86 | 8.6% |
5 | 75 | 7.5% |
0 | 72 | 7.2% |
4 | 70 | 7.0% |
3 | 60 | 6.0% |
2 | 60 | 6.0% |
6 | 58 | 5.8% |
Other Punctuation
Value | Count | Frequency (%) |
. | 297 | |
/ | 1 | 0.3% |
Other Symbol
Value | Count | Frequency (%) |
㎍ | 1 | |
㎥ | 1 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 1 | |
M | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1301 | |
Latin | 2 | 0.2% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
. | 297 | |
9 | 235 | |
8 | 176 | |
1 | 109 | 8.4% |
7 | 86 | 6.6% |
5 | 75 | 5.8% |
0 | 72 | 5.5% |
4 | 70 | 5.4% |
3 | 60 | 4.6% |
2 | 60 | 4.6% |
Other values (4) | 61 | 4.7% |
Latin
Value | Count | Frequency (%) |
P | 1 | |
M | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1301 | |
CJK Compat | 2 | 0.2% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
. | 297 | |
9 | 235 | |
8 | 176 | |
1 | 109 | 8.4% |
7 | 86 | 6.6% |
5 | 75 | 5.8% |
0 | 72 | 5.5% |
4 | 70 | 5.4% |
3 | 60 | 4.6% |
2 | 60 | 4.6% |
Other values (4) | 61 | 4.7% |
CJK Compat
Value | Count | Frequency (%) |
㎍ | 1 | |
㎥ | 1 |
유지기준.1
Text
Distinct | 177 |
---|---|
Distinct (%) | 53.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.7 KiB |
Value | Count | Frequency (%) |
482 | 6 | 1.8% |
466 | 4 | 1.2% |
528 | 4 | 1.2% |
415 | 4 | 1.2% |
503 | 4 | 1.2% |
502 | 4 | 1.2% |
508 | 4 | 1.2% |
530 | 4 | 1.2% |
454 | 4 | 1.2% |
520 | 4 | 1.2% |
Other values (167) | 290 |
Most occurring characters
Value | Count | Frequency (%) |
4 | 229 | |
5 | 198 | |
6 | 101 | |
0 | 79 | 7.9% |
7 | 76 | 7.6% |
8 | 70 | 7.0% |
3 | 68 | 6.8% |
2 | 66 | 6.6% |
1 | 60 | 6.0% |
9 | 48 | 4.8% |
Other values (5) | 6 | 0.6% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 995 | |
Lowercase Letter | 3 | 0.3% |
Uppercase Letter | 2 | 0.2% |
Other Punctuation | 1 | 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
4 | 229 | |
5 | 198 | |
6 | 101 | |
0 | 79 | 7.9% |
7 | 76 | 7.6% |
8 | 70 | 7.0% |
3 | 68 | 6.8% |
2 | 66 | 6.6% |
1 | 60 | 6.0% |
9 | 48 | 4.8% |
Lowercase Letter
Value | Count | Frequency (%) |
p | 2 | |
m | 1 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 1 | |
O | 1 |
Other Punctuation
Value | Count | Frequency (%) |
, | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 996 | |
Latin | 5 | 0.5% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
4 | 229 | |
5 | 198 | |
6 | 101 | |
0 | 79 | 7.9% |
7 | 76 | 7.6% |
8 | 70 | 7.0% |
3 | 68 | 6.8% |
2 | 66 | 6.6% |
1 | 60 | 6.0% |
9 | 48 | 4.8% |
Latin
Value | Count | Frequency (%) |
p | 2 | |
m | 1 | |
C | 1 | |
O | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1001 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
4 | 229 | |
5 | 198 | |
6 | 101 | |
0 | 79 | 7.9% |
7 | 76 | 7.6% |
8 | 70 | 7.0% |
3 | 68 | 6.8% |
2 | 66 | 6.6% |
1 | 60 | 6.0% |
9 | 48 | 4.8% |
Other values (5) | 6 | 0.6% |
유지기준.2
Text
Distinct | 182 |
---|---|
Distinct (%) | 54.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.7 KiB |
Value | Count | Frequency (%) |
12.8 | 6 | 1.8% |
16.8 | 6 | 1.8% |
16.3 | 6 | 1.8% |
13.4 | 5 | 1.5% |
16 | 5 | 1.5% |
7.9 | 5 | 1.5% |
17 | 5 | 1.5% |
13.9 | 5 | 1.5% |
14.7 | 5 | 1.5% |
15.9 | 4 | 1.2% |
Other values (172) | 280 |
Most occurring characters
Value | Count | Frequency (%) |
. | 293 | |
1 | 282 | |
2 | 108 | 8.9% |
3 | 82 | 6.8% |
7 | 77 | 6.4% |
6 | 72 | 5.9% |
4 | 69 | 5.7% |
5 | 68 | 5.6% |
9 | 62 | 5.1% |
8 | 60 | 5.0% |
Other values (7) | 39 | 3.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 912 | |
Other Punctuation | 294 | 24.3% |
Uppercase Letter | 4 | 0.3% |
Other Symbol | 2 | 0.2% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 282 | |
2 | 108 | 11.8% |
3 | 82 | 9.0% |
7 | 77 | 8.4% |
6 | 72 | 7.9% |
4 | 69 | 7.6% |
5 | 68 | 7.5% |
9 | 62 | 6.8% |
8 | 60 | 6.6% |
0 | 32 | 3.5% |
Uppercase Letter
Value | Count | Frequency (%) |
H | 2 | |
C | 1 | |
O | 1 |
Other Punctuation
Value | Count | Frequency (%) |
. | 293 | |
/ | 1 | 0.3% |
Other Symbol
Value | Count | Frequency (%) |
㎍ | 1 | |
㎥ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1208 | |
Latin | 4 | 0.3% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
. | 293 | |
1 | 282 | |
2 | 108 | 8.9% |
3 | 82 | 6.8% |
7 | 77 | 6.4% |
6 | 72 | 6.0% |
4 | 69 | 5.7% |
5 | 68 | 5.6% |
9 | 62 | 5.1% |
8 | 60 | 5.0% |
Other values (4) | 35 | 2.9% |
Latin
Value | Count | Frequency (%) |
H | 2 | |
C | 1 | |
O | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1210 | |
CJK Compat | 2 | 0.2% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
. | 293 | |
1 | 282 | |
2 | 108 | 8.9% |
3 | 82 | 6.8% |
7 | 77 | 6.4% |
6 | 72 | 6.0% |
4 | 69 | 5.7% |
5 | 68 | 5.6% |
9 | 62 | 5.1% |
8 | 60 | 5.0% |
Other values (5) | 37 | 3.1% |
CJK Compat
Value | Count | Frequency (%) |
㎍ | 1 | |
㎥ | 1 |
유지기준.3
Categorical
Distinct | 19 |
---|---|
Distinct (%) | 5.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.7 KiB |
1 | |
---|---|
0.5 | |
0.7 | |
0.8 | |
0.6 | |
Other values (14) |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 2.686747 |
Min length | 1 |
Unique
Unique | 5 ? |
---|---|
Unique (%) | 1.5% |
Sample
1st row | CO |
---|---|
2nd row | 9ppm |
3rd row | 0.6 |
4th row | 0.6 |
5th row | 0.6 |
Common Values
Value | Count | Frequency (%) |
1 | 50 | |
0.5 | 47 | |
0.7 | 40 | |
0.8 | 37 | |
0.6 | 37 | |
0.4 | 32 | |
0.9 | 32 | |
0.3 | 17 | 5.1% |
1.1 | 15 | 4.5% |
0.2 | 6 | 1.8% |
Other values (9) | 19 | 5.7% |
Length
Value | Count | Frequency (%) |
1 | 50 | |
0.5 | 47 | |
0.7 | 40 | |
0.8 | 37 | |
0.6 | 37 | |
0.4 | 32 | |
0.9 | 32 | |
0.3 | 17 | 5.1% |
1.1 | 15 | 4.5% |
0.2 | 6 | 1.8% |
Other values (9) | 19 | 5.7% |
유지기준.4
Categorical
Distinct | 16 |
---|---|
Distinct (%) | 4.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.7 KiB |
0.0004 | |
---|---|
0.0008 | |
0.0006 | |
0.0012 | 14 |
0 | 13 |
Other values (11) |
Length
Max length | 8 |
---|---|
Median length | 6 |
Mean length | 5.7771084 |
Min length | 1 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 0.9% |
Sample
1st row | 석면 |
---|---|
2nd row | 0.01개/cc |
3rd row | 0.0005 |
4th row | 0.0008 |
5th row | 0.0004 |
Common Values
Value | Count | Frequency (%) |
0.0004 | 141 | |
0.0008 | 96 | |
0.0006 | 17 | 5.1% |
0.0012 | 14 | 4.2% |
0 | 13 | 3.9% |
0.0005 | 12 | 3.6% |
0.0009 | 11 | 3.3% |
0.0013 | 8 | 2.4% |
0.001 | 7 | 2.1% |
0.0011 | 4 | 1.2% |
Other values (6) | 9 | 2.7% |
Length
Value | Count | Frequency (%) |
0.0004 | 141 | |
0.0008 | 96 | |
0.0006 | 17 | 5.1% |
0.0012 | 14 | 4.2% |
0 | 13 | 3.9% |
0.0005 | 12 | 3.6% |
0.0009 | 11 | 3.3% |
0.0013 | 8 | 2.4% |
0.001 | 7 | 2.1% |
0.0011 | 4 | 1.2% |
Other values (6) | 9 | 2.7% |
호선 | 시 설 명 (역사명) | 측정 지점 | 유지기준.3 | 유지기준.4 | |
---|---|---|---|---|---|
호선 | 1.000 | 0.000 | 0.000 | 0.352 | 0.039 |
시 설 명\n(역사명) | 0.000 | 1.000 | NaN | 0.911 | 0.695 |
측정\n지점 | 0.000 | NaN | 1.000 | 0.702 | 0.760 |
유지기준.3 | 0.352 | 0.911 | 0.702 | 1.000 | 0.748 |
유지기준.4 | 0.039 | 0.695 | 0.760 | 0.748 | 1.000 |
유지기준.4 | 측정 지점 | 유지기준.3 | 호선 | |
---|---|---|---|---|
유지기준.4 | 1.000 | 0.468 | 0.341 | 0.019 |
측정\n지점 | 0.468 | 1.000 | 0.400 | 0.000 |
유지기준.3 | 0.341 | 0.400 | 1.000 | 0.198 |
호선 | 0.019 | 0.000 | 0.198 | 1.000 |
호선 | 측정 지점 | 유지기준.3 | 유지기준.4 | |
---|---|---|---|---|
호선 | 1.000 | 0.000 | 0.198 | 0.019 |
측정\n지점 | 0.000 | 1.000 | 0.400 | 0.468 |
유지기준.3 | 0.198 | 0.400 | 1.000 | 0.341 |
유지기준.4 | 0.019 | 0.468 | 0.341 | 1.000 |
호선 | 시 설 명 (역사명) | 측정 지점 | 유지기준 | 유지기준.1 | 유지기준.2 | 유지기준.3 | 유지기준.4 | |
---|---|---|---|---|---|---|---|---|
0 | <NA> | <NA> | <NA> | PM10 | CO2 | HCHO | CO | 석면 |
1 | <NA> | <NA> | 공기질 기준 | 140㎍/㎥ | 1,000ppm | 100㎍/㎥ | 9ppm | 0.01개/cc |
2 | 1 | 서울역 | 평 균 | 97 | 638 | 13.7 | 0.6 | 0.0005 |
3 | 1 | <NA> | 승강장 | 121.7 | 656 | 13.4 | 0.6 | 0.0008 |
4 | 1 | <NA> | 대합실-1 | 81.4 | 616 | 14 | 0.6 | 0.0004 |
5 | 1 | <NA> | 대합실-2 | 87.9 | 643 | 13.7 | 0.6 | 0.0004 |
6 | 1 | 시 청 | 평 균 | 98.5 | 587 | 14.7 | 0.6 | 0.0004 |
7 | 1 | <NA> | 승강장 | 101 | 600 | 13 | 0.7 | 0 |
8 | 1 | <NA> | 대합실-1 | 95.5 | 613 | 16.9 | 0.6 | 0.0008 |
9 | 1 | <NA> | 대합실-2 | 99.1 | 549 | 14.3 | 0.6 | 0.0004 |
호선 | 시 설 명 (역사명) | 측정 지점 | 유지기준 | 유지기준.1 | 유지기준.2 | 유지기준.3 | 유지기준.4 | |
---|---|---|---|---|---|---|---|---|
322 | 4 | 총신대 입구 | 평 균 | 96.8 | 475 | 18.3 | 0.6 | 0.0004 |
323 | 4 | <NA> | 승강장 | 101.3 | 485 | 14.7 | 0.7 | 0.0008 |
324 | 4 | <NA> | 대합실 | 92.2 | 464 | 21.8 | 0.5 | 0.0001 |
325 | 4 | 사 당 | 평 균 | 95.6 | 543 | 20.9 | 0.6 | 0.0004 |
326 | 4 | <NA> | 승강장 | 93.4 | 597 | 22.3 | 0.8 | 0.0004 |
327 | 4 | <NA> | 대합실 | 97.5 | 478 | 21.2 | 0.4 | 0 |
328 | 4 | <NA> | 환승통로 | 96 | 553 | 19.3 | 0.5 | 0.0008 |
329 | 4 | 남태령 | 평 균 | 92.7 | 479 | 7.9 | 0.6 | 0.0008 |
330 | 4 | <NA> | 승강장 | 91.9 | 485 | 6.6 | 0.7 | 0.0008 |
331 | 4 | <NA> | 대합실 | 93.5 | 472 | 9.1 | 0.4 | 0.0008 |