Overview

Dataset statistics

Number of variables8
Number of observations95
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.3 KiB
Average record size in memory67.4 B

Variable types

Categorical7
Numeric1

Dataset

Description부산교통공사에서 운영하는 1호선의 소화기설비에 대한 데이터로 철도운영기관명,선명,역명,지상지하구분,역층,상세위치,소화기종류,보유대수 등이 있습니다.
Author국가철도공단
URLhttps://www.data.go.kr/data/15041468/fileData.do

Alerts

철도운영기관명 has constant value ""Constant
선명 has constant value ""Constant
보유대수 is highly overall correlated with 상세위치 and 1 other fieldsHigh correlation
역명 is highly overall correlated with 지상지하구분High correlation
지상지하구분 is highly overall correlated with 역명High correlation
상세위치 is highly overall correlated with 보유대수 and 1 other fieldsHigh correlation
소화기종류 is highly overall correlated with 보유대수 and 1 other fieldsHigh correlation
소화기종류 is highly imbalanced (81.5%)Imbalance

Reproduction

Analysis started2023-12-11 23:55:18.563975
Analysis finished2023-12-11 23:55:19.133074
Duration0.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

철도운영기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size892.0 B
부산교통공사
95 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산교통공사
2nd row부산교통공사
3rd row부산교통공사
4th row부산교통공사
5th row부산교통공사

Common Values

ValueCountFrequency (%)
부산교통공사 95
100.0%

Length

2023-12-12T08:55:19.191735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:55:19.270984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산교통공사 95
100.0%

선명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size892.0 B
1호선
95 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1호선
2nd row1호선
3rd row1호선
4th row1호선
5th row1호선

Common Values

ValueCountFrequency (%)
1호선 95
100.0%

Length

2023-12-12T08:55:19.364054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:55:19.443192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1호선 95
100.0%

역명
Categorical

HIGH CORRELATION 

Distinct40
Distinct (%)42.1%
Missing0
Missing (%)0.0%
Memory size892.0 B
서면
 
6
대티
 
5
다대포해수욕장
 
4
신장림
 
4
장림
 
3
Other values (35)
73 

Length

Max length7
Median length2
Mean length2.4947368
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row노포
2nd row노포
3rd row범어사
4th row범어사
5th row남산

Common Values

ValueCountFrequency (%)
서면 6
 
6.3%
대티 5
 
5.3%
다대포해수욕장 4
 
4.2%
신장림 4
 
4.2%
장림 3
 
3.2%
서대신 3
 
3.2%
낫개 3
 
3.2%
다대포항 3
 
3.2%
부산진 2
 
2.1%
장전 2
 
2.1%
Other values (30) 60
63.2%

Length

2023-12-12T08:55:19.540745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서면 6
 
6.3%
대티 5
 
5.3%
다대포해수욕장 4
 
4.2%
신장림 4
 
4.2%
장림 3
 
3.2%
서대신 3
 
3.2%
낫개 3
 
3.2%
다대포항 3
 
3.2%
토성 2
 
2.1%
범어사 2
 
2.1%
Other values (30) 60
63.2%

지상지하구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size892.0 B
지하
80 
지상
15 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지상
2nd row지상
3rd row지하
4th row지하
5th row지하

Common Values

ValueCountFrequency (%)
지하 80
84.2%
지상 15
 
15.8%

Length

2023-12-12T08:55:19.657184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:55:19.749390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지하 80
84.2%
지상 15
 
15.8%

역층
Categorical

Distinct5
Distinct (%)5.3%
Missing0
Missing (%)0.0%
Memory size892.0 B
1
47 
2
41 
3
 
4
4
 
2
5
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)1.1%

Sample

1st row1
2nd row2
3rd row2
4th row3
5th row1

Common Values

ValueCountFrequency (%)
1 47
49.5%
2 41
43.2%
3 4
 
4.2%
4 2
 
2.1%
5 1
 
1.1%

Length

2023-12-12T08:55:19.859536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:55:19.972212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 47
49.5%
2 41
43.2%
3 4
 
4.2%
4 2
 
2.1%
5 1
 
1.1%

상세위치
Categorical

HIGH CORRELATION 

Distinct24
Distinct (%)25.3%
Missing0
Missing (%)0.0%
Memory size892.0 B
대합실
36 
승강장
32 
하선 승강장
 
2
지하1층
 
2
지하2층
 
2
Other values (19)
21 

Length

Max length16
Median length3
Mean length4.3473684
Min length3

Unique

Unique17 ?
Unique (%)17.9%

Sample

1st row승강장
2nd row대합실
3rd row대합실
4th row승강장
5th row대합실

Common Values

ValueCountFrequency (%)
대합실 36
37.9%
승강장 32
33.7%
하선 승강장 2
 
2.1%
지하1층 2
 
2.1%
지하2층 2
 
2.1%
지하1층 임대매장 2
 
2.1%
상선 승강장 2
 
2.1%
대합실/ 기능실/ 고객센터 등 1
 
1.1%
환승통로/ 승강장 1
 
1.1%
대합실/ 승강장 1
 
1.1%
Other values (14) 14
 
14.7%

Length

2023-12-12T08:55:20.114353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
대합실 44
36.1%
승강장 41
33.6%
지하1층 4
 
3.3%
고객센터 4
 
3.3%
4
 
3.3%
임대매장 3
 
2.5%
기능실 3
 
2.5%
수유실 2
 
1.6%
하선 2
 
1.6%
지하2층 2
 
1.6%
Other values (12) 13
 
10.7%

소화기종류
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size892.0 B
분말소화기
90 
이산화탄소소화기
 
3
N2 축압식
 
1
분말소화기
 
1

Length

Max length8
Median length5
Mean length5.1157895
Min length5

Unique

Unique2 ?
Unique (%)2.1%

Sample

1st row분말소화기
2nd row분말소화기
3rd row분말소화기
4th row분말소화기
5th row분말소화기

Common Values

ValueCountFrequency (%)
분말소화기 90
94.7%
이산화탄소소화기 3
 
3.2%
N2 축압식 1
 
1.1%
분말소화기 1
 
1.1%

Length

2023-12-12T08:55:20.247256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:55:20.338004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
분말소화기 91
94.8%
이산화탄소소화기 3
 
3.1%
n2 1
 
1.0%
축압식 1
 
1.0%

보유대수
Real number (ℝ)

HIGH CORRELATION 

Distinct28
Distinct (%)29.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.315789
Minimum1
Maximum53
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size987.0 B
2023-12-12T08:55:20.433670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q15
median11
Q314
95-th percentile27.3
Maximum53
Range52
Interquartile range (IQR)9

Descriptive statistics

Standard deviation8.5430862
Coefficient of variation (CV)0.75497041
Kurtosis5.9724667
Mean11.315789
Median Absolute Deviation (MAD)5
Skewness1.9379746
Sum1075
Variance72.984323
MonotonicityNot monotonic
2023-12-12T08:55:20.563453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=28)
ValueCountFrequency (%)
12 14
14.7%
5 11
 
11.6%
14 6
 
6.3%
13 5
 
5.3%
8 5
 
5.3%
10 5
 
5.3%
2 5
 
5.3%
6 5
 
5.3%
1 4
 
4.2%
3 4
 
4.2%
Other values (18) 31
32.6%
ValueCountFrequency (%)
1 4
 
4.2%
2 5
5.3%
3 4
 
4.2%
4 3
 
3.2%
5 11
11.6%
6 5
5.3%
7 3
 
3.2%
8 5
5.3%
9 2
 
2.1%
10 5
5.3%
ValueCountFrequency (%)
53 1
1.1%
39 1
1.1%
31 1
1.1%
28 2
2.1%
27 1
1.1%
26 1
1.1%
25 1
1.1%
24 1
1.1%
23 2
2.1%
19 1
1.1%

Interactions

2023-12-12T08:55:18.858251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T08:55:20.672253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역명지상지하구분역층상세위치소화기종류보유대수
역명1.0000.9980.0000.0000.0000.261
지상지하구분0.9981.0000.0000.0000.0000.168
역층0.0000.0001.0000.0000.0000.210
상세위치0.0000.0000.0001.0000.9240.953
소화기종류0.0000.0000.0000.9241.0000.866
보유대수0.2610.1680.2100.9530.8661.000
2023-12-12T08:55:20.783355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지상지하구분역명역층상세위치소화기종류
지상지하구분1.0000.7430.0000.0000.000
역명0.7431.0000.0000.0000.000
역층0.0000.0001.0000.0000.000
상세위치0.0000.0000.0001.0000.610
소화기종류0.0000.0000.0000.6101.000
2023-12-12T08:55:20.879039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
보유대수역명지상지하구분역층상세위치소화기종류
보유대수1.0000.0000.1190.1280.6440.533
역명0.0001.0000.7430.0000.0000.000
지상지하구분0.1190.7431.0000.0000.0000.000
역층0.1280.0000.0001.0000.0000.000
상세위치0.6440.0000.0000.0001.0000.610
소화기종류0.5330.0000.0000.0000.6101.000

Missing values

2023-12-12T08:55:18.963011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:55:19.079099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

철도운영기관명선명역명지상지하구분역층상세위치소화기종류보유대수
0부산교통공사1호선노포지상1승강장분말소화기12
1부산교통공사1호선노포지상2대합실분말소화기3
2부산교통공사1호선범어사지하2대합실분말소화기5
3부산교통공사1호선범어사지하3승강장분말소화기12
4부산교통공사1호선남산지하1대합실분말소화기5
5부산교통공사1호선남산지하2승강장분말소화기14
6부산교통공사1호선두실지하1대합실분말소화기5
7부산교통공사1호선두실지하2승강장분말소화기12
8부산교통공사1호선구서지상2대합실분말소화기6
9부산교통공사1호선구서지상3승강장분말소화기12
철도운영기관명선명역명지상지하구분역층상세위치소화기종류보유대수
85부산교통공사1호선다대포해수욕장지하1고객센터분말소화기4
86부산교통공사1호선다대포해수욕장지하1대합실분말소화기14
87부산교통공사1호선다대포해수욕장지하2하선 승강장분말소화기8
88부산교통공사1호선다대포해수욕장지하2상선 승강장분말소화기8
89부산교통공사1호선다대포항지하1대합실분말소화기3
90부산교통공사1호선다대포항지하1대합실분말소화기14
91부산교통공사1호선다대포항지하2승강장분말소화기16
92부산교통공사1호선낫개지하1대합실/ 수유실 등분말소화기28
93부산교통공사1호선낫개지하13번/ 6번 출입구 앞분말소화기2
94부산교통공사1호선낫개지하3승강장분말소화기19