Overview

Dataset statistics

Number of variables2
Number of observations673
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.3 KiB
Average record size in memory17.2 B

Variable types

Text1
Numeric1

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15262/F/1/datasetView.do

Alerts

노선명 has unique valuesUnique
ROUTEID has unique valuesUnique

Reproduction

Analysis started2023-12-11 07:07:59.066970
Analysis finished2023-12-11 07:07:59.414543
Duration0.35 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

노선명
Text

UNIQUE 

Distinct673
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size5.4 KiB
2023-12-11T16:07:59.799155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length4
Mean length3.9628529
Min length2

Characters and Unicode

Total characters2667
Distinct characters75
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique673 ?
Unique (%)100.0%

Sample

1st row0017
2nd row01
3rd row0411
4th row100
5th row101
ValueCountFrequency (%)
0017 1
 
0.1%
강서04 1
 
0.1%
관악06 1
 
0.1%
강북09 1
 
0.1%
강북10 1
 
0.1%
강북11 1
 
0.1%
강북12 1
 
0.1%
강서01 1
 
0.1%
강서02 1
 
0.1%
강서03 1
 
0.1%
Other values (663) 663
98.5%
2023-12-11T16:08:00.456620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 446
16.7%
0 374
14.0%
2 241
 
9.0%
6 214
 
8.0%
3 183
 
6.9%
7 166
 
6.2%
5 160
 
6.0%
4 153
 
5.7%
8 59
 
2.2%
46
 
1.7%
Other values (65) 625
23.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2038
76.4%
Other Letter 558
 
20.9%
Uppercase Letter 54
 
2.0%
Dash Punctuation 15
 
0.6%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
46
 
8.2%
41
 
7.3%
32
 
5.7%
32
 
5.7%
31
 
5.6%
30
 
5.4%
26
 
4.7%
24
 
4.3%
21
 
3.8%
21
 
3.8%
Other values (45) 254
45.5%
Decimal Number
ValueCountFrequency (%)
1 446
21.9%
0 374
18.4%
2 241
11.8%
6 214
10.5%
3 183
9.0%
7 166
 
8.1%
5 160
 
7.9%
4 153
 
7.5%
8 59
 
2.9%
9 42
 
2.1%
Uppercase Letter
ValueCountFrequency (%)
N 17
31.5%
A 7
13.0%
R 6
 
11.1%
O 6
 
11.1%
T 6
 
11.1%
B 6
 
11.1%
U 6
 
11.1%
Dash Punctuation
ValueCountFrequency (%)
- 15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2055
77.1%
Hangul 558
 
20.9%
Latin 54
 
2.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
46
 
8.2%
41
 
7.3%
32
 
5.7%
32
 
5.7%
31
 
5.6%
30
 
5.4%
26
 
4.7%
24
 
4.3%
21
 
3.8%
21
 
3.8%
Other values (45) 254
45.5%
Common
ValueCountFrequency (%)
1 446
21.7%
0 374
18.2%
2 241
11.7%
6 214
10.4%
3 183
8.9%
7 166
 
8.1%
5 160
 
7.8%
4 153
 
7.4%
8 59
 
2.9%
9 42
 
2.0%
Other values (3) 17
 
0.8%
Latin
ValueCountFrequency (%)
N 17
31.5%
A 7
13.0%
R 6
 
11.1%
O 6
 
11.1%
T 6
 
11.1%
B 6
 
11.1%
U 6
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2109
79.1%
Hangul 558
 
20.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 446
21.1%
0 374
17.7%
2 241
11.4%
6 214
10.1%
3 183
8.7%
7 166
 
7.9%
5 160
 
7.6%
4 153
 
7.3%
8 59
 
2.8%
9 42
 
2.0%
Other values (10) 71
 
3.4%
Hangul
ValueCountFrequency (%)
46
 
8.2%
41
 
7.3%
32
 
5.7%
32
 
5.7%
31
 
5.6%
30
 
5.4%
26
 
4.7%
24
 
4.3%
21
 
3.8%
21
 
3.8%
Other values (45) 254
45.5%

ROUTEID
Real number (ℝ)

UNIQUE 

Distinct673
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.0638321 × 108
Minimum1.0000002 × 108
Maximum1.249 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.0 KiB
2023-12-11T16:08:00.663811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.0000002 × 108
5-th percentile1.0010004 × 108
Q11.0010025 × 108
median1.0010058 × 108
Q31.1290001 × 108
95-th percentile1.2190001 × 108
Maximum1.249 × 108
Range24899986
Interquartile range (IQR)12799765

Descriptive statistics

Standard deviation8099742.6
Coefficient of variation (CV)0.076137415
Kurtosis-0.7839389
Mean1.0638321 × 108
Median Absolute Deviation (MAD)559
Skewness0.86146069
Sum7.15959 × 1010
Variance6.560583 × 1013
MonotonicityNot monotonic
2023-12-11T16:08:00.919935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100100124 1
 
0.1%
115900005 1
 
0.1%
108900004 1
 
0.1%
108900009 1
 
0.1%
108900001 1
 
0.1%
108900012 1
 
0.1%
115900006 1
 
0.1%
115900003 1
 
0.1%
115900004 1
 
0.1%
115900001 1
 
0.1%
Other values (663) 663
98.5%
ValueCountFrequency (%)
100000017 1
0.1%
100000018 1
0.1%
100000020 1
0.1%
100100001 1
0.1%
100100006 1
0.1%
100100007 1
0.1%
100100008 1
0.1%
100100009 1
0.1%
100100010 1
0.1%
100100011 1
0.1%
ValueCountFrequency (%)
124900003 1
0.1%
124900002 1
0.1%
124900001 1
0.1%
124000039 1
0.1%
124000038 1
0.1%
124000036 1
0.1%
124000016 1
0.1%
124000015 1
0.1%
124000014 1
0.1%
124000013 1
0.1%

Interactions

2023-12-11T16:07:59.153560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-11T16:07:59.304884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T16:07:59.384814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

노선명ROUTEID
00017100100124
101100100001
20411104000012
3100100100549
4101100100006
51014100100129
61017100100130
7102100100007
81020100100131
9103100100008
노선명ROUTEID
663종로03100900010
664종로05100900011
665종로08100900005
666종로09100900003
667종로11100900007
668종로12100900009
669종로13100900002
670중랑01106900001
671중랑02106900002
672청와대A01(자율주행)100000020