Overview

Dataset statistics

Number of variables15
Number of observations209
Missing cells0
Missing cells (%)0.0%
Duplicate rows92
Duplicate rows (%)44.0%
Total size in memory27.3 KiB
Average record size in memory133.6 B

Variable types

Numeric1
Categorical14

Dataset

Description광주교통공사 철도사고 및 운행장애에 대한 데이터로 열차사고, 철도교통사상사고 등 매 달 별 사고 정보를 제공합니다.
Author광주교통공사
URLhttps://www.data.go.kr/data/15061858/fileData.do

Alerts

1월 has constant value ""Constant
2월 has constant value ""Constant
3월 has constant value ""Constant
4월 has constant value ""Constant
5월 has constant value ""Constant
6월 has constant value ""Constant
7월 has constant value ""Constant
10월 has constant value ""Constant
11월 has constant value ""Constant
12월 has constant value ""Constant
Dataset has 92 (44.0%) duplicate rowsDuplicates
세분류 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 세분류High correlation
8월 is highly imbalanced (92.2%)Imbalance
9월 is highly imbalanced (95.6%)Imbalance

Reproduction

Analysis started2024-04-06 08:06:32.312603
Analysis finished2024-04-06 08:06:36.434925
Duration4.12 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Real number (ℝ)

Distinct6
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2020.2727
Minimum2018
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.0 KiB
2024-04-06T17:06:36.534741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2018
5-th percentile2018
Q12019
median2020
Q32022
95-th percentile2023
Maximum2023
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.6044627
Coefficient of variation (CV)0.00079418126
Kurtosis-1.1667086
Mean2020.2727
Median Absolute Deviation (MAD)1
Skewness0.083043269
Sum422237
Variance2.5743007
MonotonicityIncreasing
2024-04-06T17:06:36.758543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2018 38
18.2%
2019 38
18.2%
2020 38
18.2%
2021 38
18.2%
2022 38
18.2%
2023 19
9.1%
ValueCountFrequency (%)
2018 38
18.2%
2019 38
18.2%
2020 38
18.2%
2021 38
18.2%
2022 38
18.2%
2023 19
9.1%
ValueCountFrequency (%)
2023 19
9.1%
2022 38
18.2%
2021 38
18.2%
2020 38
18.2%
2019 38
18.2%
2018 38
18.2%

구분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
철도교통사고
77 
철도안전사고
66 
운행장애피해현황
66 

Length

Max length8
Median length6
Mean length6.6315789
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row철도교통사고
2nd row철도교통사고
3rd row철도교통사고
4th row철도교통사고
5th row철도교통사고

Common Values

ValueCountFrequency (%)
철도교통사고 77
36.8%
철도안전사고 66
31.6%
운행장애피해현황 66
31.6%

Length

2024-04-06T17:06:37.028764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:06:37.265344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
철도교통사고 77
36.8%
철도안전사고 66
31.6%
운행장애피해현황 66
31.6%

세분류
Categorical

HIGH CORRELATION 

Distinct19
Distinct (%)9.1%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
열차사고(충돌)
 
11
열차사고(탈선)
 
11
열차사고(화재)
 
11
열차사고(기타)
 
11
철도교통사상사고(여객)
 
11
Other values (14)
154 

Length

Max length12
Median length9
Mean length8.7894737
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row열차사고(충돌)
2nd row열차사고(탈선)
3rd row열차사고(화재)
4th row열차사고(기타)
5th row철도교통사상사고(여객)

Common Values

ValueCountFrequency (%)
열차사고(충돌) 11
 
5.3%
열차사고(탈선) 11
 
5.3%
열차사고(화재) 11
 
5.3%
열차사고(기타) 11
 
5.3%
철도교통사상사고(여객) 11
 
5.3%
철도교통사상사고(공중) 11
 
5.3%
철도교통사상사고(직원) 11
 
5.3%
철도화재사고 11
 
5.3%
철도안전사상사고(여객) 11
 
5.3%
철도안전사상사고(공중) 11
 
5.3%
Other values (9) 99
47.4%

Length

2024-04-06T17:06:37.549062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
열차사고(충돌 11
 
5.3%
철도안전사상사고(직원 11
 
5.3%
인명피해(경상 11
 
5.3%
인명피해(중상 11
 
5.3%
인명피해(사망 11
 
5.3%
지연운행 11
 
5.3%
위험사건 11
 
5.3%
기타철도안전사고 11
 
5.3%
철도시설파손사고 11
 
5.3%
철도안전사상사고(공중 11
 
5.3%
Other values (9) 99
47.4%

1월
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
0
209 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 209
100.0%

Length

2024-04-06T17:06:37.775659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:06:37.946075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 209
100.0%

2월
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
0
209 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 209
100.0%

Length

2024-04-06T17:06:38.177085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:06:38.377322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 209
100.0%

3월
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
0
209 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 209
100.0%

Length

2024-04-06T17:06:38.559169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:06:38.728784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 209
100.0%

4월
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
0
209 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 209
100.0%

Length

2024-04-06T17:06:38.951165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:06:39.146943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 209
100.0%

5월
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
0
209 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 209
100.0%

Length

2024-04-06T17:06:39.335837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:06:39.508251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 209
100.0%

6월
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
0
209 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 209
100.0%

Length

2024-04-06T17:06:39.698179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:06:39.887018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 209
100.0%

7월
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
0
209 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 209
100.0%

Length

2024-04-06T17:06:40.085928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:06:40.302626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 209
100.0%

8월
Categorical

IMBALANCE 

Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
0
207 
1
 
2

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 207
99.0%
1 2
 
1.0%

Length

2024-04-06T17:06:40.569835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:06:40.750961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 207
99.0%
1 2
 
1.0%

9월
Categorical

IMBALANCE 

Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
0
208 
1
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)0.5%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 208
99.5%
1 1
 
0.5%

Length

2024-04-06T17:06:40.944963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:06:41.150176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 208
99.5%
1 1
 
0.5%

10월
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
0
209 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 209
100.0%

Length

2024-04-06T17:06:41.513079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:06:41.882122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 209
100.0%

11월
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
0
209 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 209
100.0%

Length

2024-04-06T17:06:42.116494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:06:42.303369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 209
100.0%

12월
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
0
209 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 209
100.0%

Length

2024-04-06T17:06:42.485723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:06:42.683148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 209
100.0%

Interactions

2024-04-06T17:06:35.245805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-06T17:06:42.830451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도구분세분류8월9월
연도1.0000.0000.0000.0000.000
구분0.0001.0001.0000.0650.018
세분류0.0001.0001.0000.3490.024
8월0.0000.0650.3491.0000.000
9월0.0000.0180.0240.0001.000
2024-04-06T17:06:43.030858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
9월세분류구분8월
9월1.0000.0000.0280.000
세분류0.0001.0000.9600.296
구분0.0280.9601.0000.107
8월0.0000.2960.1071.000
2024-04-06T17:06:43.218875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도구분세분류8월9월
연도1.0000.0000.0000.0000.000
구분0.0001.0000.9600.1070.028
세분류0.0000.9601.0000.2960.000
8월0.0000.1070.2961.0000.000
9월0.0000.0280.0000.0001.000

Missing values

2024-04-06T17:06:35.889682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T17:06:36.300818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도구분세분류1월2월3월4월5월6월7월8월9월10월11월12월
02018철도교통사고열차사고(충돌)000000000000
12018철도교통사고열차사고(탈선)000000000000
22018철도교통사고열차사고(화재)000000000000
32018철도교통사고열차사고(기타)000000000000
42018철도교통사고철도교통사상사고(여객)000000000000
52018철도교통사고철도교통사상사고(공중)000000000000
62018철도교통사고철도교통사상사고(직원)000000000000
72018철도안전사고철도화재사고000000000000
82018철도안전사고철도안전사상사고(여객)000000000000
92018철도안전사고철도안전사상사고(공중)000000000000
연도구분세분류1월2월3월4월5월6월7월8월9월10월11월12월
1992023철도안전사고철도안전사상사고(공중)000000000000
2002023철도안전사고철도안전사상사고(직원)000000000000
2012023철도안전사고철도시설파손사고000000000000
2022023철도안전사고기타철도안전사고000000000000
2032023운행장애피해현황위험사건000000000000
2042023운행장애피해현황지연운행000000000000
2052023운행장애피해현황인명피해(사망)000000000000
2062023운행장애피해현황인명피해(중상)000000000000
2072023운행장애피해현황인명피해(경상)000000000000
2082023운행장애피해현황재산피해(백만원)000000000000

Duplicate rows

Most frequently occurring

연도구분세분류1월2월3월4월5월6월7월8월9월10월11월12월# duplicates
02018운행장애피해현황위험사건0000000000002
12018운행장애피해현황인명피해(경상)0000000000002
22018운행장애피해현황인명피해(사망)0000000000002
32018운행장애피해현황인명피해(중상)0000000000002
42018운행장애피해현황재산피해(백만원)0000000000002
52018운행장애피해현황지연운행0000000000002
62018철도교통사고열차사고(기타)0000000000002
72018철도교통사고열차사고(충돌)0000000000002
82018철도교통사고열차사고(탈선)0000000000002
92018철도교통사고열차사고(화재)0000000000002