Overview

Dataset statistics

Number of variables8
Number of observations64
Missing cells5
Missing cells (%)1.0%
Duplicate rows1
Duplicate rows (%)1.6%
Total size in memory4.3 KiB
Average record size in memory68.1 B

Variable types

Categorical6
Numeric2

Dataset

Description부산교통공사에서 운영하는 철도운영기관명, 선명, 역명, 지상지하구분, 역층, 상세위치, 소화기종류, 보유대수의 데이터가 있습니다.
Author국가철도공단
URLhttps://www.data.go.kr/data/15041470/fileData.do

Alerts

철도운영기관명 has constant value ""Constant
선명 has constant value ""Constant
Dataset has 1 (1.6%) duplicate rowsDuplicates
역명 is highly overall correlated with 지상지하구분High correlation
지상지하구분 is highly overall correlated with 역명High correlation
소화기종류 is highly imbalanced (58.6%)Imbalance
보유대수 has 5 (7.8%) missing valuesMissing

Reproduction

Analysis started2023-12-12 15:38:14.751468
Analysis finished2023-12-12 15:38:15.993941
Duration1.24 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

철도운영기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size644.0 B
부산교통공사
64 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산교통공사
2nd row부산교통공사
3rd row부산교통공사
4th row부산교통공사
5th row부산교통공사

Common Values

ValueCountFrequency (%)
부산교통공사 64
100.0%

Length

2023-12-13T00:38:16.089435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:38:16.215900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산교통공사 64
100.0%

선명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size644.0 B
3호선
64 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3호선
2nd row3호선
3rd row3호선
4th row3호선
5th row3호선

Common Values

ValueCountFrequency (%)
3호선 64
100.0%

Length

2023-12-13T00:38:16.317348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:38:16.417793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3호선 64
100.0%

역명
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)23.4%
Missing0
Missing (%)0.0%
Memory size644.0 B
만덕
17 
미남
강서구청
연산
망미
Other values (10)
29 

Length

Max length5
Median length2
Mean length2.46875
Min length2

Unique

Unique1 ?
Unique (%)1.6%

Sample

1st row연산
2nd row연산
3rd row연산
4th row연산
5th row덕천

Common Values

ValueCountFrequency (%)
만덕 17
26.6%
미남 5
 
7.8%
강서구청 5
 
7.8%
연산 4
 
6.2%
망미 4
 
6.2%
사직 4
 
6.2%
체육공원 4
 
6.2%
덕천 3
 
4.7%
배산 3
 
4.7%
종합운동장 3
 
4.7%
Other values (5) 12
18.8%

Length

2023-12-13T00:38:16.551633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
만덕 17
26.6%
미남 5
 
7.8%
강서구청 5
 
7.8%
연산 4
 
6.2%
망미 4
 
6.2%
사직 4
 
6.2%
체육공원 4
 
6.2%
덕천 3
 
4.7%
배산 3
 
4.7%
종합운동장 3
 
4.7%
Other values (5) 12
18.8%

지상지하구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size644.0 B
지하
50 
지상
14 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지하
2nd row지하
3rd row지하
4th row지하
5th row지하

Common Values

ValueCountFrequency (%)
지하 50
78.1%
지상 14
 
21.9%

Length

2023-12-13T00:38:16.720530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:38:16.839555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지하 50
78.1%
지상 14
 
21.9%

역층
Real number (ℝ)

Distinct9
Distinct (%)14.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.6875
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size708.0 B
2023-12-13T00:38:16.951425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q33.25
95-th percentile7.85
Maximum9
Range8
Interquartile range (IQR)2.25

Descriptive statistics

Standard deviation2.144576
Coefficient of variation (CV)0.79798178
Kurtosis1.5614969
Mean2.6875
Median Absolute Deviation (MAD)1
Skewness1.4678309
Sum172
Variance4.5992063
MonotonicityNot monotonic
2023-12-13T00:38:17.100226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
1 27
42.2%
2 11
17.2%
3 10
 
15.6%
4 5
 
7.8%
5 4
 
6.2%
6 2
 
3.1%
8 2
 
3.1%
9 2
 
3.1%
7 1
 
1.6%
ValueCountFrequency (%)
1 27
42.2%
2 11
17.2%
3 10
 
15.6%
4 5
 
7.8%
5 4
 
6.2%
6 2
 
3.1%
7 1
 
1.6%
8 2
 
3.1%
9 2
 
3.1%
ValueCountFrequency (%)
9 2
 
3.1%
8 2
 
3.1%
7 1
 
1.6%
6 2
 
3.1%
5 4
 
6.2%
4 5
 
7.8%
3 10
 
15.6%
2 11
17.2%
1 27
42.2%

상세위치
Categorical

Distinct16
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size644.0 B
대합실
28 
승강장
13 
비상계단
대합실/광장연결통로 등
(B1) 대합실
 
2
Other values (11)
11 

Length

Max length12
Median length3
Mean length4.25
Min length3

Unique

Unique11 ?
Unique (%)17.2%

Sample

1st row대합실 및 기능실
2nd row환승통로
3rd row기능실
4th row승강장
5th row대합실

Common Values

ValueCountFrequency (%)
대합실 28
43.8%
승강장 13
20.3%
비상계단 7
 
10.9%
대합실/광장연결통로 등 3
 
4.7%
(B1) 대합실 2
 
3.1%
대합실 및 기능실 1
 
1.6%
환승통로 1
 
1.6%
기능실 1
 
1.6%
연결통로 1
 
1.6%
(B1) 승강장 1
 
1.6%
Other values (6) 6
 
9.4%

Length

2023-12-13T00:38:17.289332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
대합실 31
40.3%
승강장 17
22.1%
비상계단 7
 
9.1%
대합실/광장연결통로 3
 
3.9%
3
 
3.9%
b1 3
 
3.9%
기능실 2
 
2.6%
4호선 1
 
1.3%
지상 1
 
1.3%
1
 
1.3%
Other values (8) 8
 
10.4%

소화기종류
Categorical

IMBALANCE 

Distinct3
Distinct (%)4.7%
Missing0
Missing (%)0.0%
Memory size644.0 B
분말소화기
55 
이산화탄소소화기
할론소화기
 
1

Length

Max length8
Median length5
Mean length5.375
Min length5

Unique

Unique1 ?
Unique (%)1.6%

Sample

1st row분말소화기
2nd row분말소화기
3rd row분말소화기
4th row분말소화기
5th row분말소화기

Common Values

ValueCountFrequency (%)
분말소화기 55
85.9%
이산화탄소소화기 8
 
12.5%
할론소화기 1
 
1.6%

Length

2023-12-13T00:38:17.456405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:38:17.612142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
분말소화기 55
85.9%
이산화탄소소화기 8
 
12.5%
할론소화기 1
 
1.6%

보유대수
Real number (ℝ)

MISSING 

Distinct29
Distinct (%)49.2%
Missing5
Missing (%)7.8%
Infinite0
Infinite (%)0.0%
Mean14.508475
Minimum1
Maximum79
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size708.0 B
2023-12-13T00:38:17.751061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median8
Q320
95-th percentile47.1
Maximum79
Range78
Interquartile range (IQR)18

Descriptive statistics

Standard deviation16.372151
Coefficient of variation (CV)1.1284544
Kurtosis3.7641191
Mean14.508475
Median Absolute Deviation (MAD)7
Skewness1.8293654
Sum856
Variance268.04734
MonotonicityNot monotonic
2023-12-13T00:38:17.892286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
1 14
21.9%
8 4
 
6.2%
5 4
 
6.2%
9 3
 
4.7%
7 3
 
4.7%
2 3
 
4.7%
18 2
 
3.1%
26 2
 
3.1%
17 2
 
3.1%
32 2
 
3.1%
Other values (19) 20
31.2%
(Missing) 5
 
7.8%
ValueCountFrequency (%)
1 14
21.9%
2 3
 
4.7%
4 1
 
1.6%
5 4
 
6.2%
6 1
 
1.6%
7 3
 
4.7%
8 4
 
6.2%
9 3
 
4.7%
10 1
 
1.6%
11 1
 
1.6%
ValueCountFrequency (%)
79 1
1.6%
60 1
1.6%
48 1
1.6%
47 1
1.6%
41 1
1.6%
40 1
1.6%
38 1
1.6%
32 2
3.1%
26 2
3.1%
24 1
1.6%

Interactions

2023-12-13T00:38:15.440795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:38:15.173344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:38:15.590700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:38:15.308231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:38:18.016596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역명지상지하구분역층상세위치소화기종류보유대수
역명1.0001.0000.0000.6190.1600.755
지상지하구분1.0001.0000.0000.0000.0740.000
역층0.0000.0001.0000.0000.0000.000
상세위치0.6190.0000.0001.0000.4390.000
소화기종류0.1600.0740.0000.4391.0000.000
보유대수0.7550.0000.0000.0000.0001.000
2023-12-13T00:38:18.526847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소화기종류역명지상지하구분상세위치
소화기종류1.0000.0260.1200.230
역명0.0261.0000.8890.245
지상지하구분0.1200.8891.0000.000
상세위치0.2300.2450.0001.000
2023-12-13T00:38:18.647078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역층보유대수역명지상지하구분상세위치소화기종류
역층1.000-0.1040.0000.0000.0000.000
보유대수-0.1041.0000.4190.0000.0000.000
역명0.0000.4191.0000.8890.2450.026
지상지하구분0.0000.0000.8891.0000.0000.120
상세위치0.0000.0000.2450.0001.0000.230
소화기종류0.0000.0000.0260.1200.2301.000

Missing values

2023-12-13T00:38:15.783866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:38:15.937431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

철도운영기관명선명역명지상지하구분역층상세위치소화기종류보유대수
0부산교통공사3호선연산지하1대합실 및 기능실분말소화기16
1부산교통공사3호선연산지하2환승통로분말소화기17
2부산교통공사3호선연산지하3기능실분말소화기6
3부산교통공사3호선연산지하4승강장분말소화기16
4부산교통공사3호선덕천지하1대합실분말소화기10
5부산교통공사3호선덕천지하2대합실분말소화기9
6부산교통공사3호선덕천지하3승강장분말소화기12
7부산교통공사3호선망미지하1대합실분말소화기47
8부산교통공사3호선망미지하5대합실이산화탄소소화기1
9부산교통공사3호선망미지하5연결통로분말소화기8
철도운영기관명선명역명지상지하구분역층상세위치소화기종류보유대수
54부산교통공사3호선강서구청지상3대합실분말소화기<NA>
55부산교통공사3호선강서구청지상4대합실분말소화기<NA>
56부산교통공사3호선강서구청지상5승강장분말소화기<NA>
57부산교통공사3호선체육공원지상1주차장 층분말소화기8
58부산교통공사3호선체육공원지상2대합실분말소화기9
59부산교통공사3호선체육공원지상3대합실분말소화기5
60부산교통공사3호선체육공원지상4승강장분말소화기8
61부산교통공사3호선대저지상3승강장분말소화기7
62부산교통공사3호선대저지상2대합실분말소화기4
63부산교통공사3호선대저지상1지상 1층분말소화기5

Duplicate rows

Most frequently occurring

철도운영기관명선명역명지상지하구분역층상세위치소화기종류보유대수# duplicates
0부산교통공사3호선만덕지하1대합실분말소화기16