Overview

Dataset statistics

Number of variables21
Number of observations25
Missing cells265
Missing cells (%)50.5%
Duplicate rows5
Duplicate rows (%)20.0%
Total size in memory4.2 KiB
Average record size in memory173.1 B

Variable types

Text1
Unsupported20

Dataset

DescriptionKTX 경부고속철도노선(서울-부산) 및 호남고속철도노선(서울-목포) 2023년 12월까지 이용인원 및 운임 통계 데이터 입니다.
Author한국철도공사
URLhttps://www.data.go.kr/data/15119333/fileData.do

Alerts

Dataset has 5 (20.0%) duplicate rowsDuplicates
노선별 KTX 운행횟수 has 9 (36.0%) missing valuesMissing
Unnamed: 1 has 13 (52.0%) missing valuesMissing
Unnamed: 2 has 13 (52.0%) missing valuesMissing
Unnamed: 3 has 13 (52.0%) missing valuesMissing
Unnamed: 4 has 13 (52.0%) missing valuesMissing
Unnamed: 5 has 13 (52.0%) missing valuesMissing
Unnamed: 6 has 13 (52.0%) missing valuesMissing
Unnamed: 7 has 13 (52.0%) missing valuesMissing
Unnamed: 8 has 13 (52.0%) missing valuesMissing
Unnamed: 9 has 13 (52.0%) missing valuesMissing
Unnamed: 10 has 13 (52.0%) missing valuesMissing
Unnamed: 11 has 13 (52.0%) missing valuesMissing
Unnamed: 12 has 13 (52.0%) missing valuesMissing
Unnamed: 13 has 13 (52.0%) missing valuesMissing
Unnamed: 14 has 13 (52.0%) missing valuesMissing
Unnamed: 15 has 13 (52.0%) missing valuesMissing
Unnamed: 16 has 13 (52.0%) missing valuesMissing
Unnamed: 17 has 13 (52.0%) missing valuesMissing
Unnamed: 18 has 13 (52.0%) missing valuesMissing
Unnamed: 19 has 13 (52.0%) missing valuesMissing
Unnamed: 20 has 9 (36.0%) missing valuesMissing
Unnamed: 1 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 2 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 14 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 15 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 16 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 17 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 18 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 19 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 20 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-13 12:57:16.027451
Analysis finished2024-04-13 12:57:16.987457
Duration0.96 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct10
Distinct (%)62.5%
Missing9
Missing (%)36.0%
Memory size328.0 B
2024-04-13T21:57:17.390225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length10.5
Mean length6.8125
Min length2

Characters and Unicode

Total characters109
Distinct characters40
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6 ?
Unique (%)37.5%

Sample

1st row주중(화요일)기준
2nd row구분
3rd row경부선
4th row호남선
5th row연도말 주말(토요일)기준
ValueCountFrequency (%)
구분 4
18.2%
경부선 2
 
9.1%
호남선 2
 
9.1%
경부선(서울-부산 2
 
9.1%
ktx 2
 
9.1%
주중(화요일)기준 1
 
4.5%
연도말 1
 
4.5%
주말(토요일)기준 1
 
4.5%
노선별 1
 
4.5%
운임 1
 
4.5%
Other values (5) 5
22.7%
2024-04-13T21:57:18.365058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9
 
8.3%
) 6
 
5.5%
6
 
5.5%
( 6
 
5.5%
6
 
5.5%
5
 
4.6%
5
 
4.6%
4
 
3.7%
4
 
3.7%
4
 
3.7%
Other values (30) 54
49.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 80
73.4%
Close Punctuation 6
 
5.5%
Open Punctuation 6
 
5.5%
Space Separator 6
 
5.5%
Uppercase Letter 6
 
5.5%
Dash Punctuation 4
 
3.7%
Other Punctuation 1
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
 
11.2%
6
 
7.5%
5
 
6.2%
5
 
6.2%
4
 
5.0%
4
 
5.0%
4
 
5.0%
4
 
5.0%
3
 
3.8%
3
 
3.8%
Other values (22) 33
41.2%
Uppercase Letter
ValueCountFrequency (%)
X 2
33.3%
T 2
33.3%
K 2
33.3%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Space Separator
ValueCountFrequency (%)
6
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 80
73.4%
Common 23
 
21.1%
Latin 6
 
5.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9
 
11.2%
6
 
7.5%
5
 
6.2%
5
 
6.2%
4
 
5.0%
4
 
5.0%
4
 
5.0%
4
 
5.0%
3
 
3.8%
3
 
3.8%
Other values (22) 33
41.2%
Common
ValueCountFrequency (%)
) 6
26.1%
( 6
26.1%
6
26.1%
- 4
17.4%
/ 1
 
4.3%
Latin
ValueCountFrequency (%)
X 2
33.3%
T 2
33.3%
K 2
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 80
73.4%
ASCII 29
 
26.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
9
 
11.2%
6
 
7.5%
5
 
6.2%
5
 
6.2%
4
 
5.0%
4
 
5.0%
4
 
5.0%
4
 
5.0%
3
 
3.8%
3
 
3.8%
Other values (22) 33
41.2%
ASCII
ValueCountFrequency (%)
) 6
20.7%
( 6
20.7%
6
20.7%
- 4
13.8%
X 2
 
6.9%
T 2
 
6.9%
K 2
 
6.9%
/ 1
 
3.4%

Unnamed: 1
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing13
Missing (%)52.0%
Memory size328.0 B

Unnamed: 2
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing13
Missing (%)52.0%
Memory size328.0 B

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing13
Missing (%)52.0%
Memory size328.0 B

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing13
Missing (%)52.0%
Memory size328.0 B

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing13
Missing (%)52.0%
Memory size328.0 B

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing13
Missing (%)52.0%
Memory size328.0 B

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing13
Missing (%)52.0%
Memory size328.0 B

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing13
Missing (%)52.0%
Memory size328.0 B

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing13
Missing (%)52.0%
Memory size328.0 B

Unnamed: 10
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing13
Missing (%)52.0%
Memory size328.0 B

Unnamed: 11
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing13
Missing (%)52.0%
Memory size328.0 B

Unnamed: 12
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing13
Missing (%)52.0%
Memory size328.0 B

Unnamed: 13
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing13
Missing (%)52.0%
Memory size328.0 B

Unnamed: 14
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing13
Missing (%)52.0%
Memory size328.0 B

Unnamed: 15
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing13
Missing (%)52.0%
Memory size328.0 B

Unnamed: 16
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing13
Missing (%)52.0%
Memory size328.0 B

Unnamed: 17
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing13
Missing (%)52.0%
Memory size328.0 B

Unnamed: 18
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing13
Missing (%)52.0%
Memory size328.0 B

Unnamed: 19
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing13
Missing (%)52.0%
Memory size328.0 B

Unnamed: 20
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing9
Missing (%)36.0%
Memory size328.0 B

Sample

노선별 KTX 운행횟수Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19Unnamed: 20
0<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
1주중(화요일)기준NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN(단위 : 회)
2구분20042005200620072008200920102011201220132014201520162017201820192020202120222023
3경부선9610010410010410612010013012611911999105105105105105105111
4호남선3636363636363838424242645656565656545452
5<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
6<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
7연도말 주말(토요일)기준NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN(단위 : 회)
8구분20042005200620072008200920102011201220132014201520162017201820192020202120222023
9경부선104122126134143143154120152150143139122122122126126126125131
노선별 KTX 운행횟수Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19Unnamed: 20
15구분2004년2005년2006년2007년2008년2009년2010년2011년2012년2013년2014년2015년2016년2017년2018년2019년2020년2021년2022년2023년
16경부선(서울-부산)4500044800481004980051200512005550057300573005730057300598005980059800598005980059800598005980059800
17호남선(용산-목포)4110038000407004330043300433004330044700447004470044700528005280052800528005280052800528005280052800
18<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
19<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
20구간별 KTX 이용객NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
21<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN(단위 : 천명)
22구분2004년4월~2005년2006년2007년2008년2009년2010년2011년2012년2013년2014년2015년2016년2017년2018년2019년2020년2021년2022년2023년
23경부선(서울-부산)41296344656862936137589364337429741974187529722771495466572460322854273048796686
24호남선(서울/용산-목포)303558626633688642632627548540508632720661693744471533733832

Duplicate rows

Most frequently occurring

노선별 KTX 운행횟수# duplicates
4<NA>9
2구분4
0경부선2
1경부선(서울-부산)2
3호남선2