Overview

Dataset statistics

Number of variables4
Number of observations260
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.0 KiB
Average record size in memory35.5 B

Variable types

Numeric3
Text1

Dataset

Description파일 다운로드
Author서울교통공사
URLhttps://data.seoul.go.kr/dataList/OA-2785/S/1/datasetView.do

Alerts

연번 is highly overall correlated with 호선High correlation
호선 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2024-04-29 22:02:15.612584
Analysis finished2024-04-29 22:02:17.968558
Duration2.36 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct260
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean130.5
Minimum1
Maximum260
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.4 KiB
2024-04-30T07:02:18.046133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile13.95
Q165.75
median130.5
Q3195.25
95-th percentile247.05
Maximum260
Range259
Interquartile range (IQR)129.5

Descriptive statistics

Standard deviation75.199734
Coefficient of variation (CV)0.57624317
Kurtosis-1.2
Mean130.5
Median Absolute Deviation (MAD)65
Skewness0
Sum33930
Variance5655
MonotonicityStrictly increasing
2024-04-30T07:02:18.171087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
165 1
 
0.4%
167 1
 
0.4%
168 1
 
0.4%
169 1
 
0.4%
170 1
 
0.4%
171 1
 
0.4%
172 1
 
0.4%
173 1
 
0.4%
174 1
 
0.4%
Other values (250) 250
96.2%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
260 1
0.4%
259 1
0.4%
258 1
0.4%
257 1
0.4%
256 1
0.4%
255 1
0.4%
254 1
0.4%
253 1
0.4%
252 1
0.4%
251 1
0.4%

호선
Real number (ℝ)

HIGH CORRELATION 

Distinct8
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.6846154
Minimum1
Maximum8
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.4 KiB
2024-04-30T07:02:18.263608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q13
median5
Q37
95-th percentile8
Maximum8
Range7
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.0345837
Coefficient of variation (CV)0.43431179
Kurtosis-1.2002478
Mean4.6846154
Median Absolute Deviation (MAD)2
Skewness-0.12262714
Sum1218
Variance4.1395307
MonotonicityIncreasing
2024-04-30T07:02:18.372918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
5 51
19.6%
7 50
19.2%
2 46
17.7%
6 33
12.7%
3 31
11.9%
4 23
8.8%
8 16
 
6.2%
1 10
 
3.8%
ValueCountFrequency (%)
1 10
 
3.8%
2 46
17.7%
3 31
11.9%
4 23
8.8%
5 51
19.6%
6 33
12.7%
7 50
19.2%
8 16
 
6.2%
ValueCountFrequency (%)
8 16
 
6.2%
7 50
19.2%
6 33
12.7%
5 51
19.6%
4 23
8.8%
3 31
11.9%
2 46
17.7%
1 10
 
3.8%

역명
Text

Distinct241
Distinct (%)92.7%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2024-04-30T07:02:18.643688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length2
Mean length2.9538462
Min length2

Characters and Unicode

Total characters768
Distinct characters212
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique222 ?
Unique (%)85.4%

Sample

1st row서울역
2nd row시청
3rd row종각
4th row종로3가
5th row종로5가
ValueCountFrequency (%)
서울역 2
 
0.8%
동묘앞 2
 
0.8%
삼각지 2
 
0.8%
대림 2
 
0.8%
잠실 2
 
0.8%
합정 2
 
0.8%
사당 2
 
0.8%
충정로 2
 
0.8%
교대 2
 
0.8%
노원 2
 
0.8%
Other values (231) 240
92.3%
2024-04-30T07:02:19.030645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
32
 
4.2%
28
 
3.6%
24
 
3.1%
22
 
2.9%
20
 
2.6%
14
 
1.8%
14
 
1.8%
14
 
1.8%
12
 
1.6%
12
 
1.6%
Other values (202) 576
75.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 759
98.8%
Decimal Number 5
 
0.7%
Close Punctuation 2
 
0.3%
Open Punctuation 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
 
4.2%
28
 
3.7%
24
 
3.2%
22
 
2.9%
20
 
2.6%
14
 
1.8%
14
 
1.8%
14
 
1.8%
12
 
1.6%
12
 
1.6%
Other values (197) 567
74.7%
Decimal Number
ValueCountFrequency (%)
3 3
60.0%
5 1
 
20.0%
4 1
 
20.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 759
98.8%
Common 9
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
 
4.2%
28
 
3.7%
24
 
3.2%
22
 
2.9%
20
 
2.6%
14
 
1.8%
14
 
1.8%
14
 
1.8%
12
 
1.6%
12
 
1.6%
Other values (197) 567
74.7%
Common
ValueCountFrequency (%)
3 3
33.3%
) 2
22.2%
( 2
22.2%
5 1
 
11.1%
4 1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 759
98.8%
ASCII 9
 
1.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
32
 
4.2%
28
 
3.7%
24
 
3.2%
22
 
2.9%
20
 
2.6%
14
 
1.8%
14
 
1.8%
14
 
1.8%
12
 
1.6%
12
 
1.6%
Other values (197) 567
74.7%
ASCII
ValueCountFrequency (%)
3 3
33.3%
) 2
22.2%
( 2
22.2%
5 1
 
11.1%
4 1
 
11.1%

대수
Real number (ℝ)

Distinct159
Distinct (%)61.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean129.70385
Minimum4
Maximum1125
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.4 KiB
2024-04-30T07:02:19.163484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4
5-th percentile12
Q144
median82
Q3168.5
95-th percentile391.25
Maximum1125
Range1121
Interquartile range (IQR)124.5

Descriptive statistics

Standard deviation141.58254
Coefficient of variation (CV)1.0915832
Kurtosis12.590637
Mean129.70385
Median Absolute Deviation (MAD)51
Skewness2.8966651
Sum33723
Variance20045.615
MonotonicityNot monotonic
2024-04-30T07:02:19.282064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10 6
 
2.3%
70 6
 
2.3%
30 5
 
1.9%
49 5
 
1.9%
50 5
 
1.9%
24 4
 
1.5%
20 4
 
1.5%
75 3
 
1.2%
36 3
 
1.2%
7 3
 
1.2%
Other values (149) 216
83.1%
ValueCountFrequency (%)
4 1
 
0.4%
5 1
 
0.4%
6 1
 
0.4%
7 3
1.2%
10 6
2.3%
12 3
1.2%
13 1
 
0.4%
14 1
 
0.4%
15 1
 
0.4%
17 2
 
0.8%
ValueCountFrequency (%)
1125 1
0.4%
820 1
0.4%
700 1
0.4%
652 1
0.4%
630 1
0.4%
535 1
0.4%
460 2
0.8%
440 2
0.8%
435 1
0.4%
434 1
0.4%

Interactions

2024-04-30T07:02:17.586397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:02:17.006804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:02:17.327057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:02:17.664374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:02:17.164469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:02:17.402634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:02:17.747835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:02:17.254905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:02:17.488976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-30T07:02:19.357471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번호선대수
연번1.0000.9220.000
호선0.9221.0000.056
대수0.0000.0561.000
2024-04-30T07:02:19.434080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번호선대수
연번1.0000.9870.123
호선0.9871.0000.127
대수0.1230.1271.000

Missing values

2024-04-30T07:02:17.862123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T07:02:17.934140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번호선역명대수
011서울역10
121시청25
231종각30
341종로3가60
451종로5가5
561동대문30
671신설동66
781제기동164
891청량리54
9101동묘앞6
연번호선역명대수
2502518가락시장221
2512528문정70
2522538장지147
2532548복정180
2542558산성43
2552568남한산성입구52
2562578단대오거리103
2572588신흥55
2582598수진36
2592608모란61