Overview

Dataset statistics

Number of variables8
Number of observations108
Missing cells0
Missing cells (%)0.0%
Duplicate rows4
Duplicate rows (%)3.7%
Total size in memory7.1 KiB
Average record size in memory67.2 B

Variable types

Categorical6
Text1
Numeric1

Dataset

Description인천교통공사에서 운영하는 1호선의 소화기설비에 대한 데이터로 철도운영기관명,선명,역명,지상지하구분,역층,상세위치,소화기종류,보유대수 등이 있습니다.
Author국가철도공단
URLhttps://www.data.go.kr/data/15041478/fileData.do

Alerts

철도운영기관명 has constant value ""Constant
선명 has constant value ""Constant
Dataset has 4 (3.7%) duplicate rowsDuplicates
역명 is highly overall correlated with 지상지하구분High correlation
지상지하구분 is highly overall correlated with 역명High correlation
지상지하구분 is highly imbalanced (81.7%)Imbalance
소화기종류 is highly imbalanced (90.4%)Imbalance

Reproduction

Analysis started2023-12-12 23:10:29.118492
Analysis finished2023-12-12 23:10:29.782313
Duration0.66 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

철도운영기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size996.0 B
인천교통공사
108 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row인천교통공사
2nd row인천교통공사
3rd row인천교통공사
4th row인천교통공사
5th row인천교통공사

Common Values

ValueCountFrequency (%)
인천교통공사 108
100.0%

Length

2023-12-13T08:10:29.855488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:10:29.946385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
인천교통공사 108
100.0%

선명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size996.0 B
인천1호선
108 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row인천1호선
2nd row인천1호선
3rd row인천1호선
4th row인천1호선
5th row인천1호선

Common Values

ValueCountFrequency (%)
인천1호선 108
100.0%

Length

2023-12-13T08:10:30.038167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:10:30.137537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
인천1호선 108
100.0%

역명
Categorical

HIGH CORRELATION 

Distinct30
Distinct (%)27.8%
Missing0
Missing (%)0.0%
Memory size996.0 B
인천대입구
국제업무지구
부평구청
 
7
부평
 
5
간석오거리
 
4
Other values (25)
76 

Length

Max length8
Median length6
Mean length3.9166667
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row간석오거리
2nd row간석오거리
3rd row간석오거리
4th row간석오거리
5th row갈산

Common Values

ValueCountFrequency (%)
인천대입구 8
 
7.4%
국제업무지구 8
 
7.4%
부평구청 7
 
6.5%
부평 5
 
4.6%
간석오거리 4
 
3.7%
센트럴파크 4
 
3.7%
작전 4
 
3.7%
인천시청 4
 
3.7%
부평삼거리 4
 
3.7%
갈산 4
 
3.7%
Other values (20) 56
51.9%

Length

2023-12-13T08:10:30.224120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
인천대입구 8
 
7.4%
국제업무지구 8
 
7.4%
부평구청 7
 
6.5%
부평 5
 
4.6%
부평삼거리 4
 
3.7%
동수 4
 
3.7%
신연수 4
 
3.7%
갈산 4
 
3.7%
인천시청 4
 
3.7%
작전 4
 
3.7%
Other values (20) 56
51.9%

지상지하구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size996.0 B
지하
105 
지상
 
3

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지하
2nd row지하
3rd row지하
4th row지하
5th row지하

Common Values

ValueCountFrequency (%)
지하 105
97.2%
지상 3
 
2.8%

Length

2023-12-13T08:10:30.325512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:10:30.423708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지하 105
97.2%
지상 3
 
2.8%

역층
Categorical

Distinct4
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Memory size996.0 B
1
50 
2
42 
3
4

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row1
3rd row3
4th row4
5th row2

Common Values

ValueCountFrequency (%)
1 50
46.3%
2 42
38.9%
3 9
 
8.3%
4 7
 
6.5%

Length

2023-12-13T08:10:30.520183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:10:30.625035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 50
46.3%
2 42
38.9%
3 9
 
8.3%
4 7
 
6.5%
Distinct95
Distinct (%)88.0%
Missing0
Missing (%)0.0%
Memory size996.0 B
2023-12-13T08:10:30.828353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length89
Median length38
Mean length21.296296
Min length8

Characters and Unicode

Total characters2300
Distinct characters168
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique89 ?
Unique (%)82.4%

Sample

1st row(B2) 역무실/ 상/하선E/V앞/ 대합실/ 공조실/ 임대상가 등
2nd row(B1) 1~9번 출구 앞/
3rd row(B3) 대합실/ 상선 E/V앞/ 복도/ 창고 등
4th row(B4) 계양(상선) 방면/ 인천시청 방면(하선)/ 공조실 등
5th row(B2)계양역 방향 승강장
ValueCountFrequency (%)
승강장 48
 
9.4%
b1 42
 
8.3%
b2 40
 
7.9%
대합실 30
 
5.9%
방면(하행 13
 
2.6%
방면(상행 13
 
2.6%
13
 
2.6%
방향 13
 
2.6%
근처 11
 
2.2%
b3 8
 
1.6%
Other values (153) 278
54.6%
2023-12-13T08:10:31.198494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
409
 
17.8%
( 163
 
7.1%
) 163
 
7.1%
B 109
 
4.7%
1 80
 
3.5%
64
 
2.8%
/ 63
 
2.7%
62
 
2.7%
2 54
 
2.3%
54
 
2.3%
Other values (158) 1079
46.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1129
49.1%
Space Separator 409
 
17.8%
Decimal Number 192
 
8.3%
Open Punctuation 163
 
7.1%
Close Punctuation 163
 
7.1%
Uppercase Letter 157
 
6.8%
Other Punctuation 65
 
2.8%
Dash Punctuation 13
 
0.6%
Lowercase Letter 7
 
0.3%
Math Symbol 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
64
 
5.7%
62
 
5.5%
54
 
4.8%
54
 
4.8%
52
 
4.6%
46
 
4.1%
40
 
3.5%
39
 
3.5%
34
 
3.0%
32
 
2.8%
Other values (126) 652
57.8%
Decimal Number
ValueCountFrequency (%)
1 80
41.7%
2 54
28.1%
4 25
 
13.0%
3 15
 
7.8%
5 6
 
3.1%
8 5
 
2.6%
6 3
 
1.6%
7 2
 
1.0%
0 1
 
0.5%
9 1
 
0.5%
Uppercase Letter
ValueCountFrequency (%)
B 109
69.4%
E 12
 
7.6%
A 11
 
7.0%
F 8
 
5.1%
R 6
 
3.8%
P 3
 
1.9%
D 3
 
1.9%
V 3
 
1.9%
I 2
 
1.3%
Lowercase Letter
ValueCountFrequency (%)
s 2
28.6%
r 1
14.3%
i 1
14.3%
n 1
14.3%
c 1
14.3%
e 1
14.3%
Other Punctuation
ValueCountFrequency (%)
/ 63
96.9%
' 2
 
3.1%
Space Separator
ValueCountFrequency (%)
409
100.0%
Open Punctuation
ValueCountFrequency (%)
( 163
100.0%
Close Punctuation
ValueCountFrequency (%)
) 163
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1129
49.1%
Common 1007
43.8%
Latin 164
 
7.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
64
 
5.7%
62
 
5.5%
54
 
4.8%
54
 
4.8%
52
 
4.6%
46
 
4.1%
40
 
3.5%
39
 
3.5%
34
 
3.0%
32
 
2.8%
Other values (126) 652
57.8%
Common
ValueCountFrequency (%)
409
40.6%
( 163
 
16.2%
) 163
 
16.2%
1 80
 
7.9%
/ 63
 
6.3%
2 54
 
5.4%
4 25
 
2.5%
3 15
 
1.5%
- 13
 
1.3%
5 6
 
0.6%
Other values (7) 16
 
1.6%
Latin
ValueCountFrequency (%)
B 109
66.5%
E 12
 
7.3%
A 11
 
6.7%
F 8
 
4.9%
R 6
 
3.7%
P 3
 
1.8%
D 3
 
1.8%
V 3
 
1.8%
s 2
 
1.2%
I 2
 
1.2%
Other values (5) 5
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1171
50.9%
Hangul 1129
49.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
409
34.9%
( 163
 
13.9%
) 163
 
13.9%
B 109
 
9.3%
1 80
 
6.8%
/ 63
 
5.4%
2 54
 
4.6%
4 25
 
2.1%
3 15
 
1.3%
- 13
 
1.1%
Other values (22) 77
 
6.6%
Hangul
ValueCountFrequency (%)
64
 
5.7%
62
 
5.5%
54
 
4.8%
54
 
4.8%
52
 
4.6%
46
 
4.1%
40
 
3.5%
39
 
3.5%
34
 
3.0%
32
 
2.8%
Other values (126) 652
57.8%

소화기종류
Categorical

IMBALANCE 

Distinct3
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size996.0 B
분말소화기
106 
할론소화기
 
1
이산화탄소소화기
 
1

Length

Max length8
Median length5
Mean length5.0277778
Min length5

Unique

Unique2 ?
Unique (%)1.9%

Sample

1st row분말소화기
2nd row분말소화기
3rd row분말소화기
4th row분말소화기
5th row분말소화기

Common Values

ValueCountFrequency (%)
분말소화기 106
98.1%
할론소화기 1
 
0.9%
이산화탄소소화기 1
 
0.9%

Length

2023-12-13T08:10:31.321655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:10:31.401704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
분말소화기 106
98.1%
할론소화기 1
 
0.9%
이산화탄소소화기 1
 
0.9%

보유대수
Real number (ℝ)

Distinct28
Distinct (%)25.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.444444
Minimum1
Maximum87
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-13T08:10:31.485880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q14
median8
Q312
95-th percentile29.65
Maximum87
Range86
Interquartile range (IQR)8

Descriptive statistics

Standard deviation11.231908
Coefficient of variation (CV)1.0753955
Kurtosis20.295308
Mean10.444444
Median Absolute Deviation (MAD)4
Skewness3.7022874
Sum1128
Variance126.15576
MonotonicityNot monotonic
2023-12-13T08:10:31.588455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=28)
ValueCountFrequency (%)
4 11
 
10.2%
11 10
 
9.3%
8 10
 
9.3%
1 9
 
8.3%
6 9
 
8.3%
5 7
 
6.5%
7 7
 
6.5%
2 7
 
6.5%
16 4
 
3.7%
12 3
 
2.8%
Other values (18) 31
28.7%
ValueCountFrequency (%)
1 9
8.3%
2 7
6.5%
3 3
 
2.8%
4 11
10.2%
5 7
6.5%
6 9
8.3%
7 7
6.5%
8 10
9.3%
9 3
 
2.8%
10 3
 
2.8%
ValueCountFrequency (%)
87 1
0.9%
42 1
0.9%
41 1
0.9%
37 1
0.9%
35 1
0.9%
30 1
0.9%
29 1
0.9%
26 1
0.9%
24 1
0.9%
22 2
1.9%

Interactions

2023-12-13T08:10:29.480246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:10:31.675959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역명지상지하구분역층상세위치소화기종류보유대수
역명1.0000.9700.0000.9650.5190.764
지상지하구분0.9701.0000.0001.0000.0000.000
역층0.0000.0001.0001.0000.0000.000
상세위치0.9651.0001.0001.0001.0000.000
소화기종류0.5190.0000.0001.0001.0000.000
보유대수0.7640.0000.0000.0000.0001.000
2023-12-13T08:10:31.786319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지상지하구분역층역명소화기종류
지상지하구분1.0000.0000.7500.000
역층0.0001.0000.0000.000
역명0.7500.0001.0000.239
소화기종류0.0000.0000.2391.000
2023-12-13T08:10:31.869936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
보유대수역명지상지하구분역층소화기종류
보유대수1.0000.3690.0000.0000.000
역명0.3691.0000.7500.0000.239
지상지하구분0.0000.7501.0000.0000.000
역층0.0000.0000.0001.0000.000
소화기종류0.0000.2390.0000.0001.000

Missing values

2023-12-13T08:10:29.581233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:10:29.722482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

철도운영기관명선명역명지상지하구분역층상세위치소화기종류보유대수
0인천교통공사인천1호선간석오거리지하2(B2) 역무실/ 상/하선E/V앞/ 대합실/ 공조실/ 임대상가 등분말소화기12
1인천교통공사인천1호선간석오거리지하1(B1) 1~9번 출구 앞/분말소화기14
2인천교통공사인천1호선간석오거리지하3(B3) 대합실/ 상선 E/V앞/ 복도/ 창고 등분말소화기12
3인천교통공사인천1호선간석오거리지하4(B4) 계양(상선) 방면/ 인천시청 방면(하선)/ 공조실 등분말소화기24
4인천교통공사인천1호선갈산지하2(B2)계양역 방향 승강장분말소화기4
5인천교통공사인천1호선갈산지하1(B1)출입구 연결통로분말소화기4
6인천교통공사인천1호선갈산지하2(B2)송도달빛축제공원역 방향 승강장분말소화기4
7인천교통공사인천1호선갈산지하1(B1) 대합실분말소화기3
8인천교통공사인천1호선경인교대입구지하2(B2) 대합실 표내는 곳 근처분말소화기9
9인천교통공사인천1호선경인교대입구지하3(B3) 계산 방면(상행) 승강장 4개 (B3) 작전 방면(하행) 승강장 4개분말소화기8
철도운영기관명선명역명지상지하구분역층상세위치소화기종류보유대수
98인천교통공사인천1호선작전지하1(B1) 대합실 표 내는 곳 앞분말소화기2
99인천교통공사인천1호선지식정보단지지하2(B2)계양방면(상행)승강장11개/송도달빛방면(하행)승강장11개분말소화기22
100인천교통공사인천1호선지식정보단지지하1(B1)대합실 및 연결통로분말소화기37
101인천교통공사인천1호선지식정보단지지하1(B1)역무실1개/기계실5개/전기실2개이산화탄소소화기8
102인천교통공사인천1호선캠퍼스타운지하2(B2) 하선 승강장분말소화기11
103인천교통공사인천1호선캠퍼스타운지하1(B1) 대합실분말소화기15
104인천교통공사인천1호선캠퍼스타운지하2(B2) 상선 승강장분말소화기11
105인천교통공사인천1호선테크노파크지하1(B1) 대합실분말소화기87
106인천교통공사인천1호선테크노파크지하2(B2) 캠퍼스타운 방면(상행) 승강장분말소화기16
107인천교통공사인천1호선테크노파크지하2(B2) 지식정보단지 방면(하행) 승강장분말소화기16

Duplicate rows

Most frequently occurring

철도운영기관명선명역명지상지하구분역층상세위치소화기종류보유대수# duplicates
0인천교통공사인천1호선국제업무지구지하1(B1) 표 내는 곳 내부(PAID AREA)분말소화기72
1인천교통공사인천1호선국제업무지구지하1(B1) 표 내는 곳 외부(FREE AREA)분말소화기62
2인천교통공사인천1호선국제업무지구지하2(B2) 센트럴파크역 방향 승강장분말소화기112
3인천교통공사인천1호선국제업무지구지하2(B2) 송도달빛축제공원역 방향 승강장분말소화기112