Overview

Dataset statistics

Number of variables6
Number of observations142
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.9 KiB
Average record size in memory49.9 B

Variable types

Categorical3
Numeric1
Text2

Dataset

Description대구2호선에 포함된 도시광역철도역들의 철도운영기관명,선명,역명,출구번호,출구별 주요시설명, 주소 등의 데이터 입니다.
Author국가철도공단
URLhttps://www.data.go.kr/data/15068956/fileData.do

Alerts

철도운영기관명 has constant value ""Constant
선명 has constant value ""Constant

Reproduction

Analysis started2023-12-12 11:43:14.802400
Analysis finished2023-12-12 11:43:15.621547
Duration0.82 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

철도운영기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
대구교통공사
142 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대구교통공사
2nd row대구교통공사
3rd row대구교통공사
4th row대구교통공사
5th row대구교통공사

Common Values

ValueCountFrequency (%)
대구교통공사 142
100.0%

Length

2023-12-12T20:43:15.733505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:43:15.937690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대구교통공사 142
100.0%

선명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2호선
142 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2호선
2nd row2호선
3rd row2호선
4th row2호선
5th row2호선

Common Values

ValueCountFrequency (%)
2호선 142
100.0%

Length

2023-12-12T20:43:16.132283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:43:16.317007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2호선 142
100.0%

역명
Categorical

Distinct29
Distinct (%)20.4%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
임당
 
8
성서산업단지
 
8
반월당
 
7
계명대
 
7
신매
 
7
Other values (24)
105 

Length

Max length14
Median length2
Mean length3.5070423
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row문양
2nd row문양
3rd row다사
4th row다사
5th row다사

Common Values

ValueCountFrequency (%)
임당 8
 
5.6%
성서산업단지 8
 
5.6%
반월당 7
 
4.9%
계명대 7
 
4.9%
신매 7
 
4.9%
강창 6
 
4.2%
이곡 6
 
4.2%
용산(서부법원·검찰청입구) 6
 
4.2%
두류 6
 
4.2%
영남대 5
 
3.5%
Other values (19) 76
53.5%

Length

2023-12-12T20:43:16.535321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
임당 8
 
5.6%
성서산업단지 8
 
5.6%
반월당 7
 
4.9%
계명대 7
 
4.9%
신매 7
 
4.9%
강창 6
 
4.2%
이곡 6
 
4.2%
용산(서부법원·검찰청입구 6
 
4.2%
두류 6
 
4.2%
청라언덕 5
 
3.5%
Other values (19) 76
53.5%

출구번호
Real number (ℝ)

Distinct15
Distinct (%)10.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.7535211
Minimum1
Maximum22
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2023-12-12T20:43:16.744719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q34
95-th percentile9.9
Maximum22
Range21
Interquartile range (IQR)2

Descriptive statistics

Standard deviation3.2793933
Coefficient of variation (CV)0.87368451
Kurtosis11.954881
Mean3.7535211
Median Absolute Deviation (MAD)1
Skewness2.9620933
Sum533
Variance10.75442
MonotonicityNot monotonic
2023-12-12T20:43:16.933752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
1 29
20.4%
2 27
19.0%
4 27
19.0%
3 26
18.3%
5 11
 
7.7%
6 7
 
4.9%
7 4
 
2.8%
8 3
 
2.1%
12 2
 
1.4%
10 1
 
0.7%
Other values (5) 5
 
3.5%
ValueCountFrequency (%)
1 29
20.4%
2 27
19.0%
3 26
18.3%
4 27
19.0%
5 11
 
7.7%
6 7
 
4.9%
7 4
 
2.8%
8 3
 
2.1%
10 1
 
0.7%
11 1
 
0.7%
ValueCountFrequency (%)
22 1
 
0.7%
21 1
 
0.7%
14 1
 
0.7%
13 1
 
0.7%
12 2
 
1.4%
11 1
 
0.7%
10 1
 
0.7%
8 3
2.1%
7 4
2.8%
6 7
4.9%
Distinct137
Distinct (%)96.5%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-12T20:43:17.376631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length11
Mean length6.4295775
Min length2

Characters and Unicode

Total characters913
Distinct characters170
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique132 ?
Unique (%)93.0%

Sample

1st row다사읍 문양리
2nd row다사읍 부곡리
3rd row다사초등학교
4th row다사보건지소
5th row다사읍사무소
ValueCountFrequency (%)
계명대학교 3
 
2.0%
대구은행 3
 
2.0%
남산2동주민센터 2
 
1.3%
북부동주민센터 2
 
1.3%
성서농협 2
 
1.3%
다사읍 2
 
1.3%
연호동 2
 
1.3%
범안로 2
 
1.3%
남부시외버스터미널 2
 
1.3%
임당동 1
 
0.7%
Other values (130) 130
86.1%
2023-12-12T20:43:18.055809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
41
 
4.5%
32
 
3.5%
31
 
3.4%
30
 
3.3%
29
 
3.2%
25
 
2.7%
25
 
2.7%
23
 
2.5%
23
 
2.5%
23
 
2.5%
Other values (160) 631
69.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 870
95.3%
Decimal Number 29
 
3.2%
Space Separator 9
 
1.0%
Close Punctuation 2
 
0.2%
Open Punctuation 2
 
0.2%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
41
 
4.7%
32
 
3.7%
31
 
3.6%
30
 
3.4%
29
 
3.3%
25
 
2.9%
25
 
2.9%
23
 
2.6%
23
 
2.6%
23
 
2.6%
Other values (151) 588
67.6%
Decimal Number
ValueCountFrequency (%)
1 12
41.4%
3 6
20.7%
2 6
20.7%
4 3
 
10.3%
9 2
 
6.9%
Space Separator
ValueCountFrequency (%)
9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 870
95.3%
Common 43
 
4.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
41
 
4.7%
32
 
3.7%
31
 
3.6%
30
 
3.4%
29
 
3.3%
25
 
2.9%
25
 
2.9%
23
 
2.6%
23
 
2.6%
23
 
2.6%
Other values (151) 588
67.6%
Common
ValueCountFrequency (%)
1 12
27.9%
9
20.9%
3 6
14.0%
2 6
14.0%
4 3
 
7.0%
) 2
 
4.7%
9 2
 
4.7%
( 2
 
4.7%
/ 1
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 870
95.3%
ASCII 43
 
4.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
41
 
4.7%
32
 
3.7%
31
 
3.6%
30
 
3.4%
29
 
3.3%
25
 
2.9%
25
 
2.9%
23
 
2.6%
23
 
2.6%
23
 
2.6%
Other values (151) 588
67.6%
ASCII
ValueCountFrequency (%)
1 12
27.9%
9
20.9%
3 6
14.0%
2 6
14.0%
4 3
 
7.0%
) 2
 
4.7%
9 2
 
4.7%
( 2
 
4.7%
/ 1
 
2.3%

주소
Text

Distinct125
Distinct (%)88.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-12T20:43:18.595765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17.5
Mean length13.84507
Min length9

Characters and Unicode

Total characters1966
Distinct characters106
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique112 ?
Unique (%)78.9%

Sample

1st row대구 달성군 다사읍 문양리
2nd row대구 달성군 다사읍 부곡리
3rd row대구 달성군 다사읍 다사로86
4th row대구 달성군 다사읍 매곡로7
5th row대구 달성군 다사읍 매곡로7
ValueCountFrequency (%)
대구 124
25.2%
달서구 46
 
9.3%
수성구 43
 
8.7%
경북 18
 
3.7%
경산시 18
 
3.7%
중구 17
 
3.5%
달구벌대로 14
 
2.8%
달성군 10
 
2.0%
다사읍 10
 
2.0%
서구 8
 
1.6%
Other values (159) 184
37.4%
2023-12-12T20:43:19.433613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
350
17.8%
274
 
13.9%
168
 
8.5%
92
 
4.7%
88
 
4.5%
1 64
 
3.3%
62
 
3.2%
60
 
3.1%
53
 
2.7%
2 46
 
2.3%
Other values (96) 709
36.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1267
64.4%
Space Separator 350
 
17.8%
Decimal Number 340
 
17.3%
Dash Punctuation 9
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
274
21.6%
168
13.3%
92
 
7.3%
88
 
6.9%
62
 
4.9%
60
 
4.7%
53
 
4.2%
45
 
3.6%
36
 
2.8%
35
 
2.8%
Other values (84) 354
27.9%
Decimal Number
ValueCountFrequency (%)
1 64
18.8%
2 46
13.5%
0 35
10.3%
7 34
10.0%
3 32
9.4%
5 30
8.8%
9 29
8.5%
4 29
8.5%
6 21
 
6.2%
8 20
 
5.9%
Space Separator
ValueCountFrequency (%)
350
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1267
64.4%
Common 699
35.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
274
21.6%
168
13.3%
92
 
7.3%
88
 
6.9%
62
 
4.9%
60
 
4.7%
53
 
4.2%
45
 
3.6%
36
 
2.8%
35
 
2.8%
Other values (84) 354
27.9%
Common
ValueCountFrequency (%)
350
50.1%
1 64
 
9.2%
2 46
 
6.6%
0 35
 
5.0%
7 34
 
4.9%
3 32
 
4.6%
5 30
 
4.3%
9 29
 
4.1%
4 29
 
4.1%
6 21
 
3.0%
Other values (2) 29
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1267
64.4%
ASCII 699
35.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
350
50.1%
1 64
 
9.2%
2 46
 
6.6%
0 35
 
5.0%
7 34
 
4.9%
3 32
 
4.6%
5 30
 
4.3%
9 29
 
4.1%
4 29
 
4.1%
6 21
 
3.0%
Other values (2) 29
 
4.1%
Hangul
ValueCountFrequency (%)
274
21.6%
168
13.3%
92
 
7.3%
88
 
6.9%
62
 
4.9%
60
 
4.7%
53
 
4.2%
45
 
3.6%
36
 
2.8%
35
 
2.8%
Other values (84) 354
27.9%

Interactions

2023-12-12T20:43:15.150325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T20:43:19.606616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역명출구번호
역명1.0000.000
출구번호0.0001.000
2023-12-12T20:43:19.759932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
출구번호역명
출구번호1.0000.000
역명0.0001.000

Missing values

2023-12-12T20:43:15.341187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:43:15.556948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

철도운영기관명선명역명출구번호출구별 주요시설명주소
0대구교통공사2호선문양1다사읍 문양리대구 달성군 다사읍 문양리
1대구교통공사2호선문양1다사읍 부곡리대구 달성군 다사읍 부곡리
2대구교통공사2호선다사1다사초등학교대구 달성군 다사읍 다사로86
3대구교통공사2호선다사2다사보건지소대구 달성군 다사읍 매곡로7
4대구교통공사2호선다사3다사읍사무소대구 달성군 다사읍 매곡로7
5대구교통공사2호선다사4다사고등학교대구 달성군 다사읍 달구벌대로839
6대구교통공사2호선대실1대구은행다사지점대구 달성군 다사읍 달구벌대로879
7대구교통공사2호선대실2죽곡초등학교대구 달성군 다사읍 죽곡1길 18
8대구교통공사2호선대실3강창교대구 달성군 다사읍 죽곡리
9대구교통공사2호선대실4한솔병원대구 달성군 다사읍 달구벌대로895
철도운영기관명선명역명출구번호출구별 주요시설명주소
132대구교통공사2호선임당4경산경찰서경북 경산시 원효로 68
133대구교통공사2호선임당5북부동주민센터경북 경산시 중방로 125
134대구교통공사2호선임당6경북지방경찰청기동대경북 경산시 계양동
135대구교통공사2호선임당7임당초등학교경북 경산시 임당로12길 7
136대구교통공사2호선임당8임당동경북 경산시 임당동
137대구교통공사2호선영남대1경북지방경찰철기동대경북 경산시 계양동
138대구교통공사2호선영남대2북부동주민센터경북 경산시 중방로125
139대구교통공사2호선영남대3북부동경북 경산시 북부동
140대구교통공사2호선영남대4영남대학교경북 경산시 대학로 280
141대구교통공사2호선영남대5압량우체국경북 경산시 압량면 부적길35-4