Overview

Dataset statistics

Number of variables3
Number of observations22
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory704.0 B
Average record size in memory32.0 B

Variable types

Numeric2
Text1

Dataset

Description대전교통공사가 관리하는 도시철도의 역별 출구 현황에 대한 데이터로 역번호, 역명, 출입구 개수 등을 보여주는 데이터입니다.
Author대전교통공사
URLhttps://www.data.go.kr/data/15043917/fileData.do

Alerts

역번호 has unique valuesUnique
역사 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:11:43.397704
Analysis finished2023-12-12 09:11:44.102462
Duration0.7 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

역번호
Real number (ℝ)

UNIQUE 

Distinct22
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean111.5
Minimum101
Maximum122
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size330.0 B
2023-12-12T18:11:44.222946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum101
5-th percentile102.05
Q1106.25
median111.5
Q3116.75
95-th percentile120.95
Maximum122
Range21
Interquartile range (IQR)10.5

Descriptive statistics

Standard deviation6.4935866
Coefficient of variation (CV)0.058238445
Kurtosis-1.2
Mean111.5
Median Absolute Deviation (MAD)5.5
Skewness0
Sum2453
Variance42.166667
MonotonicityStrictly increasing
2023-12-12T18:11:44.415988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
101 1
 
4.5%
113 1
 
4.5%
122 1
 
4.5%
121 1
 
4.5%
120 1
 
4.5%
119 1
 
4.5%
118 1
 
4.5%
117 1
 
4.5%
116 1
 
4.5%
115 1
 
4.5%
Other values (12) 12
54.5%
ValueCountFrequency (%)
101 1
4.5%
102 1
4.5%
103 1
4.5%
104 1
4.5%
105 1
4.5%
106 1
4.5%
107 1
4.5%
108 1
4.5%
109 1
4.5%
110 1
4.5%
ValueCountFrequency (%)
122 1
4.5%
121 1
4.5%
120 1
4.5%
119 1
4.5%
118 1
4.5%
117 1
4.5%
116 1
4.5%
115 1
4.5%
114 1
4.5%
113 1
4.5%

역사
Text

UNIQUE 

Distinct22
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size308.0 B
2023-12-12T18:11:44.632513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length2
Mean length2.6818182
Min length2

Characters and Unicode

Total characters59
Distinct characters49
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)100.0%

Sample

1st row판암
2nd row신흥
3rd row대동
4th row대전
5th row중앙로
ValueCountFrequency (%)
판암 1
 
4.5%
신흥 1
 
4.5%
지족 1
 
4.5%
노은 1
 
4.5%
월드컵경기장 1
 
4.5%
현충원 1
 
4.5%
구암 1
 
4.5%
유성온천 1
 
4.5%
갑천 1
 
4.5%
월평 1
 
4.5%
Other values (12) 12
54.5%
2023-12-12T18:11:45.046148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3
 
5.1%
3
 
5.1%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
1
 
1.7%
1
 
1.7%
Other values (39) 39
66.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 59
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3
 
5.1%
3
 
5.1%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
1
 
1.7%
1
 
1.7%
Other values (39) 39
66.1%

Most occurring scripts

ValueCountFrequency (%)
Hangul 59
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3
 
5.1%
3
 
5.1%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
1
 
1.7%
1
 
1.7%
Other values (39) 39
66.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 59
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3
 
5.1%
3
 
5.1%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
1
 
1.7%
1
 
1.7%
Other values (39) 39
66.1%

출입구 개수
Real number (ℝ)

Distinct8
Distinct (%)36.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.4090909
Minimum2
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size330.0 B
2023-12-12T18:11:45.186714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile3
Q14
median4
Q38
95-th percentile8
Maximum9
Range7
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.152719
Coefficient of variation (CV)0.39798167
Kurtosis-1.4705504
Mean5.4090909
Median Absolute Deviation (MAD)1
Skewness0.32584277
Sum119
Variance4.6341991
MonotonicityNot monotonic
2023-12-12T18:11:45.312542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
4 9
40.9%
8 6
27.3%
3 2
 
9.1%
9 1
 
4.5%
5 1
 
4.5%
7 1
 
4.5%
2 1
 
4.5%
6 1
 
4.5%
ValueCountFrequency (%)
2 1
 
4.5%
3 2
 
9.1%
4 9
40.9%
5 1
 
4.5%
6 1
 
4.5%
7 1
 
4.5%
8 6
27.3%
9 1
 
4.5%
ValueCountFrequency (%)
9 1
 
4.5%
8 6
27.3%
7 1
 
4.5%
6 1
 
4.5%
5 1
 
4.5%
4 9
40.9%
3 2
 
9.1%
2 1
 
4.5%

Interactions

2023-12-12T18:11:43.692756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:11:43.502126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:11:43.801870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:11:43.591038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:11:45.423840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역번호역사출입구 개수
역번호1.0001.0000.269
역사1.0001.0001.000
출입구 개수0.2691.0001.000
2023-12-12T18:11:45.544361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역번호출입구 개수
역번호1.000-0.303
출입구 개수-0.3031.000

Missing values

2023-12-12T18:11:43.936131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:11:44.029463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

역번호역사출입구 개수
0101판암4
1102신흥4
2103대동8
3104대전4
4105중앙로9
5106중구청4
6107서대전네거리8
7108오룡8
8109용문8
9110탄방5
역번호역사출입구 개수
12113갈마4
13114월평4
14115갑천3
15116유성온천8
16117구암3
17118현충원4
18119월드컵경기장7
19120노은4
20121지족2
21122반석6