Overview

Dataset statistics

Number of variables5
Number of observations91
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.9 KiB
Average record size in memory43.5 B

Variable types

Categorical2
Text2
Numeric1

Dataset

Description대구교통공사 1, 2, 3호선 각 역사별 지진 옥외 대피소 현황에 대한 데이터로 시설명 및 출구번호, 거리(미터) 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15119032/fileData.do

Alerts

거리(미터) has 2 (2.2%) zerosZeros

Reproduction

Analysis started2023-12-12 07:33:30.735174
Analysis finished2023-12-12 07:33:31.433343
Duration0.7 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

호선
Categorical

Distinct3
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size860.0 B
1
32 
3
30 
2
29 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 32
35.2%
3 30
33.0%
2 29
31.9%

Length

2023-12-12T16:33:31.510839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:33:31.622248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 32
35.2%
3 30
33.0%
2 29
31.9%

역명
Text

Distinct88
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Memory size860.0 B
2023-12-12T16:33:31.922500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length3
Mean length3.3076923
Min length2

Characters and Unicode

Total characters301
Distinct characters111
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique85 ?
Unique (%)93.4%

Sample

1st row설화명곡
2nd row화원
3rd row대 곡
4th row진 천
5th row월 배
ValueCountFrequency (%)
5
 
3.5%
5
 
3.5%
4
 
2.8%
4
 
2.8%
3
 
2.1%
3
 
2.1%
3
 
2.1%
3
 
2.1%
2
 
1.4%
2
 
1.4%
Other values (95) 109
76.2%
2023-12-12T16:33:32.441328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
52
 
17.3%
15
 
5.0%
8
 
2.7%
7
 
2.3%
7
 
2.3%
7
 
2.3%
6
 
2.0%
6
 
2.0%
5
 
1.7%
5
 
1.7%
Other values (101) 183
60.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 249
82.7%
Space Separator 52
 
17.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
15
 
6.0%
8
 
3.2%
7
 
2.8%
7
 
2.8%
7
 
2.8%
6
 
2.4%
6
 
2.4%
5
 
2.0%
5
 
2.0%
5
 
2.0%
Other values (100) 178
71.5%
Space Separator
ValueCountFrequency (%)
52
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 249
82.7%
Common 52
 
17.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
15
 
6.0%
8
 
3.2%
7
 
2.8%
7
 
2.8%
7
 
2.8%
6
 
2.4%
6
 
2.4%
5
 
2.0%
5
 
2.0%
5
 
2.0%
Other values (100) 178
71.5%
Common
ValueCountFrequency (%)
52
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 249
82.7%
ASCII 52
 
17.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
52
100.0%
Hangul
ValueCountFrequency (%)
15
 
6.0%
8
 
3.2%
7
 
2.8%
7
 
2.8%
7
 
2.8%
6
 
2.4%
6
 
2.4%
5
 
2.0%
5
 
2.0%
5
 
2.0%
Other values (100) 178
71.5%
Distinct84
Distinct (%)92.3%
Missing0
Missing (%)0.0%
Memory size860.0 B
2023-12-12T16:33:32.693680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length3
Mean length4.1758242
Min length3

Characters and Unicode

Total characters380
Distinct characters121
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique77 ?
Unique (%)84.6%

Sample

1st row화원고
2nd row화원초
3rd row대진고
4th row진천역환승주차장
5th row월배초
ValueCountFrequency (%)
동일초 2
 
2.0%
명덕초 2
 
2.0%
공영주차장 2
 
2.0%
서대구초 2
 
2.0%
남산초 2
 
2.0%
인도 2
 
2.0%
대구초 2
 
2.0%
경명여고 2
 
2.0%
대구체육고 2
 
2.0%
내서초 1
 
1.0%
Other values (82) 82
81.2%
2023-12-12T16:33:33.118594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
45
 
11.8%
19
 
5.0%
19
 
5.0%
17
 
4.5%
16
 
4.2%
15
 
3.9%
10
 
2.6%
8
 
2.1%
8
 
2.1%
8
 
2.1%
Other values (111) 215
56.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 365
96.1%
Space Separator 10
 
2.6%
Decimal Number 3
 
0.8%
Other Punctuation 2
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
45
 
12.3%
19
 
5.2%
19
 
5.2%
17
 
4.7%
16
 
4.4%
15
 
4.1%
8
 
2.2%
8
 
2.2%
8
 
2.2%
7
 
1.9%
Other values (107) 203
55.6%
Decimal Number
ValueCountFrequency (%)
3 2
66.7%
2 1
33.3%
Space Separator
ValueCountFrequency (%)
10
100.0%
Other Punctuation
ValueCountFrequency (%)
· 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 365
96.1%
Common 15
 
3.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
45
 
12.3%
19
 
5.2%
19
 
5.2%
17
 
4.7%
16
 
4.4%
15
 
4.1%
8
 
2.2%
8
 
2.2%
8
 
2.2%
7
 
1.9%
Other values (107) 203
55.6%
Common
ValueCountFrequency (%)
10
66.7%
3 2
 
13.3%
· 2
 
13.3%
2 1
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 365
96.1%
ASCII 13
 
3.4%
None 2
 
0.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
45
 
12.3%
19
 
5.2%
19
 
5.2%
17
 
4.7%
16
 
4.4%
15
 
4.1%
8
 
2.2%
8
 
2.2%
8
 
2.2%
7
 
1.9%
Other values (107) 203
55.6%
ASCII
ValueCountFrequency (%)
10
76.9%
3 2
 
15.4%
2 1
 
7.7%
None
ValueCountFrequency (%)
· 2
100.0%

출구번호
Categorical

Distinct14
Distinct (%)15.4%
Missing0
Missing (%)0.0%
Memory size860.0 B
1
29 
3
21 
2
19 
4
7
Other values (9)
10 

Length

Max length5
Median length1
Mean length1.1428571
Min length1

Unique

Unique8 ?
Unique (%)8.8%

Sample

1st row3
2nd row2
3rd row3
4th row2
5th row1

Common Values

ValueCountFrequency (%)
1 29
31.9%
3 21
23.1%
2 19
20.9%
4 9
 
9.9%
7 3
 
3.3%
5 2
 
2.2%
8 1
 
1.1%
1·2 1
 
1.1%
3·4·5 1
 
1.1%
1·7 1
 
1.1%
Other values (4) 4
 
4.4%

Length

2023-12-12T16:33:33.263625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1 29
31.9%
3 21
23.1%
2 19
20.9%
4 9
 
9.9%
7 3
 
3.3%
5 2
 
2.2%
8 1
 
1.1%
1·2 1
 
1.1%
3·4·5 1
 
1.1%
1·7 1
 
1.1%
Other values (4) 4
 
4.4%

거리(미터)
Real number (ℝ)

ZEROS 

Distinct55
Distinct (%)60.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean348.18681
Minimum0
Maximum1600
Zeros2
Zeros (%)2.2%
Negative0
Negative (%)0.0%
Memory size951.0 B
2023-12-12T16:33:33.443923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile50
Q1174
median280
Q3500
95-th percentile793
Maximum1600
Range1600
Interquartile range (IQR)326

Descriptive statistics

Standard deviation265.56702
Coefficient of variation (CV)0.76271418
Kurtosis4.1755926
Mean348.18681
Median Absolute Deviation (MAD)180
Skewness1.495386
Sum31685
Variance70525.842
MonotonicityNot monotonic
2023-12-12T16:33:33.948000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100 7
 
7.7%
50 6
 
6.6%
300 5
 
5.5%
200 5
 
5.5%
600 4
 
4.4%
500 4
 
4.4%
700 3
 
3.3%
10 2
 
2.2%
240 2
 
2.2%
400 2
 
2.2%
Other values (45) 51
56.0%
ValueCountFrequency (%)
0 2
 
2.2%
10 2
 
2.2%
50 6
6.6%
90 1
 
1.1%
95 1
 
1.1%
100 7
7.7%
125 1
 
1.1%
148 1
 
1.1%
170 2
 
2.2%
178 1
 
1.1%
ValueCountFrequency (%)
1600 1
 
1.1%
877 1
 
1.1%
870 1
 
1.1%
840 1
 
1.1%
798 1
 
1.1%
788 1
 
1.1%
774 1
 
1.1%
769 1
 
1.1%
700 3
3.3%
620 1
 
1.1%

Interactions

2023-12-12T16:33:31.120846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:33:34.052093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
호선역명시설명출구번호거리(미터)
호선1.0000.0000.7530.3950.164
역명0.0001.0001.0000.8630.976
시설명0.7531.0001.0000.5500.761
출구번호0.3950.8630.5501.0000.000
거리(미터)0.1640.9760.7610.0001.000
2023-12-12T16:33:34.179358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
출구번호호선
출구번호1.0000.222
호선0.2221.000
2023-12-12T16:33:34.277463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
거리(미터)호선출구번호
거리(미터)1.0000.1050.000
호선0.1051.0000.222
출구번호0.0000.2221.000

Missing values

2023-12-12T16:33:31.256407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:33:31.381306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

호선역명시설명출구번호거리(미터)
01설화명곡화원고3100
11화원화원초250
21대 곡대진고3700
31진 천진천역환승주차장2200
41월 배월배초1210
51상 인영남중고8200
61월 촌대서초4620
71송 현송현초1870
81서부정류장관문시장 공영주차장3350
91대 명대명초3280
호선역명시설명출구번호거리(미터)
813건들바위영선초2600
823대봉교대봉초2700
833수성시장동일초4500
843수성구민운동장수성구구민운동장1600
853어린이회관대구과고1100
863황 금들안길초1600
873수성못두산초1280
883지 산지산초1260
893범 물지산공원2170
903용 지복명초3148