Overview

Dataset statistics

Number of variables9
Number of observations127
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.8%
Total size in memory9.1 KiB
Average record size in memory73.0 B

Variable types

Text1
Categorical8

Dataset

Description파일 다운로드
Author서울 교통공사
URLhttps://data.seoul.go.kr/dataList/OA-13321/F/1/datasetView.do

Alerts

Dataset has 1 (0.8%) duplicate rowsDuplicates
유아수유실 is highly overall correlated with 자전거경사로 and 5 other fieldsHigh correlation
인출기 is highly overall correlated with 유아수유실 and 5 other fieldsHigh correlation
무 민원발급기설치역 is highly overall correlated with 유아수유실 and 5 other fieldsHigh correlation
무인택배보관함 is highly overall correlated with 유아수유실 and 5 other fieldsHigh correlation
자동칼라사진기 is highly overall correlated with 유아수유실 and 5 other fieldsHigh correlation
스넥자판기 is highly overall correlated with 유아수유실 and 5 other fieldsHigh correlation
자전거경사로 is highly overall correlated with 유아수유실 and 5 other fieldsHigh correlation
유아수유실 is highly imbalanced (61.4%)Imbalance
자전거경사로 is highly imbalanced (60.3%)Imbalance
자동칼라사진기 is highly imbalanced (59.0%)Imbalance
무 민원발급기설치역 is highly imbalanced (61.4%)Imbalance
무인민원발급기관할구청 is highly imbalanced (59.5%)Imbalance

Reproduction

Analysis started2023-12-11 08:45:26.723110
Analysis finished2023-12-11 08:45:27.765567
Duration1.04 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

역명
Text

Distinct116
Distinct (%)91.3%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-11T17:45:28.065170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length4
Mean length2.8661417
Min length2

Characters and Unicode

Total characters364
Distinct characters144
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique105 ?
Unique (%)82.7%

Sample

1st row전체계
2nd row1호선
3rd row서울역
4th row시청
5th row종각
ValueCountFrequency (%)
충무로 2
 
1.6%
문화공원 2
 
1.6%
동대문역사 2
 
1.6%
을지로3가 2
 
1.6%
신설동 2
 
1.6%
교대 2
 
1.6%
서울역 2
 
1.6%
종로3가 2
 
1.6%
시청 2
 
1.6%
동대문 2
 
1.6%
Other values (106) 107
84.3%
2023-12-11T17:45:28.648322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
21
 
5.8%
15
 
4.1%
13
 
3.6%
13
 
3.6%
11
 
3.0%
9
 
2.5%
7
 
1.9%
7
 
1.9%
7
 
1.9%
6
 
1.6%
Other values (134) 255
70.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 354
97.3%
Decimal Number 10
 
2.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
21
 
5.9%
15
 
4.2%
13
 
3.7%
13
 
3.7%
11
 
3.1%
9
 
2.5%
7
 
2.0%
7
 
2.0%
7
 
2.0%
6
 
1.7%
Other values (129) 245
69.2%
Decimal Number
ValueCountFrequency (%)
3 5
50.0%
4 2
 
20.0%
2 1
 
10.0%
5 1
 
10.0%
1 1
 
10.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 354
97.3%
Common 10
 
2.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
21
 
5.9%
15
 
4.2%
13
 
3.7%
13
 
3.7%
11
 
3.1%
9
 
2.5%
7
 
2.0%
7
 
2.0%
7
 
2.0%
6
 
1.7%
Other values (129) 245
69.2%
Common
ValueCountFrequency (%)
3 5
50.0%
4 2
 
20.0%
2 1
 
10.0%
5 1
 
10.0%
1 1
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 354
97.3%
ASCII 10
 
2.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
21
 
5.9%
15
 
4.2%
13
 
3.7%
13
 
3.7%
11
 
3.1%
9
 
2.5%
7
 
2.0%
7
 
2.0%
7
 
2.0%
6
 
1.7%
Other values (129) 245
69.2%
ASCII
ValueCountFrequency (%)
3 5
50.0%
4 2
 
20.0%
2 1
 
10.0%
5 1
 
10.0%
1 1
 
10.0%

유아수유실
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct8
Distinct (%)6.3%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
94 
26 
<NA>
 
2
26(역)
 
1
2
 
1
Other values (3)
 
3

Length

Max length5
Median length1
Mean length1.0866142
Min length1

Unique

Unique5 ?
Unique (%)3.9%

Sample

1st row26(역)
2nd row2
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
94
74.0%
26
 
20.5%
<NA> 2
 
1.6%
26(역) 1
 
0.8%
2 1
 
0.8%
11 1
 
0.8%
6 1
 
0.8%
7 1
 
0.8%

Length

2023-12-11T17:45:28.842601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T17:45:29.002089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
26
78.8%
na 2
 
6.1%
26(역 1
 
3.0%
2 1
 
3.0%
11 1
 
3.0%
6 1
 
3.0%
7 1
 
3.0%

자전거경사로
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct7
Distinct (%)5.5%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
96 
24 
<NA>
 
2
6
 
2
24(역)
 
1
Other values (2)
 
2

Length

Max length5
Median length1
Mean length1.0787402
Min length1

Unique

Unique3 ?
Unique (%)2.4%

Sample

1st row24(역)
2nd row3
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
96
75.6%
24
 
18.9%
<NA> 2
 
1.6%
6 2
 
1.6%
24(역) 1
 
0.8%
3 1
 
0.8%
9 1
 
0.8%

Length

2023-12-11T17:45:29.184270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T17:45:29.344426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
24
77.4%
na 2
 
6.5%
6 2
 
6.5%
24(역 1
 
3.2%
3 1
 
3.2%
9 1
 
3.2%

무인택배보관함
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)9.4%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
1
60 
2
25 
23 
3
 
6
4
 
4
Other values (7)

Length

Max length6
Median length1
Mean length1.1181102
Min length1

Unique

Unique5 ?
Unique (%)3.9%

Sample

1st row171(대)
2nd row17
3rd row5
4th row2
5th row2

Common Values

ValueCountFrequency (%)
1 60
47.2%
2 25
19.7%
23
 
18.1%
3 6
 
4.7%
4 4
 
3.1%
5 2
 
1.6%
<NA> 2
 
1.6%
171(대) 1
 
0.8%
17 1
 
0.8%
69 1
 
0.8%
Other values (2) 2
 
1.6%

Length

2023-12-11T17:45:29.483433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1 60
57.7%
2 25
24.0%
3 6
 
5.8%
4 4
 
3.8%
5 2
 
1.9%
na 2
 
1.9%
171(대 1
 
1.0%
17 1
 
1.0%
69 1
 
1.0%
28 1
 
1.0%

인출기
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)8.7%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2
40 
1
38 
3
35 
<NA>
 
2
Other values (6)

Length

Max length6
Median length1
Mean length1.1259843
Min length1

Unique

Unique6 ?
Unique (%)4.7%

Sample

1st row227(대)
2nd row17
3rd row2
4th row3
5th row3

Common Values

ValueCountFrequency (%)
2 40
31.5%
1 38
29.9%
3 35
27.6%
6
 
4.7%
<NA> 2
 
1.6%
227(대) 1
 
0.8%
17 1
 
0.8%
110 1
 
0.8%
4 1
 
0.8%
49 1
 
0.8%

Length

2023-12-11T17:45:29.615140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2 40
33.1%
1 38
31.4%
3 35
28.9%
na 2
 
1.7%
227(대 1
 
0.8%
17 1
 
0.8%
110 1
 
0.8%
4 1
 
0.8%
49 1
 
0.8%
52 1
 
0.8%

스넥자판기
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)7.9%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2
77 
1
30 
11 
<NA>
 
2
3
 
2
Other values (5)
 
5

Length

Max length6
Median length1
Mean length1.1181102
Min length1

Unique

Unique5 ?
Unique (%)3.9%

Sample

1st row190(대)
2nd row18
3rd row2
4th row2
5th row2

Common Values

ValueCountFrequency (%)
2 77
60.6%
1 30
 
23.6%
11
 
8.7%
<NA> 2
 
1.6%
3 2
 
1.6%
190(대) 1
 
0.8%
18 1
 
0.8%
85 1
 
0.8%
49 1
 
0.8%
38 1
 
0.8%

Length

2023-12-11T17:45:29.749422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T17:45:29.879085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 77
66.4%
1 30
 
25.9%
na 2
 
1.7%
3 2
 
1.7%
190(대 1
 
0.9%
18 1
 
0.9%
85 1
 
0.9%
49 1
 
0.9%
38 1
 
0.9%

자동칼라사진기
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct7
Distinct (%)5.5%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
1
91 
30 
<NA>
 
2
91(대)
 
1
8
 
1
Other values (2)
 
2

Length

Max length5
Median length1
Mean length1.0944882
Min length1

Unique

Unique4 ?
Unique (%)3.1%

Sample

1st row91(대)
2nd row8
3rd row1
4th row
5th row1

Common Values

ValueCountFrequency (%)
1 91
71.7%
30
 
23.6%
<NA> 2
 
1.6%
91(대) 1
 
0.8%
8 1
 
0.8%
42 1
 
0.8%
20 1
 
0.8%

Length

2023-12-11T17:45:30.054132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T17:45:30.199602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 91
93.8%
na 2
 
2.1%
91(대 1
 
1.0%
8 1
 
1.0%
42 1
 
1.0%
20 1
 
1.0%

무 민원발급기설치역
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct8
Distinct (%)6.3%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
94 
26 
<NA>
 
2
26(역)
 
1
1
 
1
Other values (3)
 
3

Length

Max length5
Median length1
Mean length1.0866142
Min length1

Unique

Unique5 ?
Unique (%)3.9%

Sample

1st row26(역)
2nd row1
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
94
74.0%
26
 
20.5%
<NA> 2
 
1.6%
26(역) 1
 
0.8%
1 1
 
0.8%
13 1
 
0.8%
7 1
 
0.8%
5 1
 
0.8%

Length

2023-12-11T17:45:30.371999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T17:45:30.536987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
26
78.8%
na 2
 
6.1%
26(역 1
 
3.0%
1 1
 
3.0%
13 1
 
3.0%
7 1
 
3.0%
5 1
 
3.0%

무인민원발급기관할구청
Categorical

IMBALANCE 

Distinct13
Distinct (%)10.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
99 
강남구
 
6
관악구
 
4
중구
 
3
강북구
 
3
Other values (8)
12 

Length

Max length4
Median length1
Mean length1.4566929
Min length1

Unique

Unique4 ?
Unique (%)3.1%

Sample

1st row
2nd row
3rd row중구
4th row
5th row

Common Values

ValueCountFrequency (%)
99
78.0%
강남구 6
 
4.7%
관악구 4
 
3.1%
중구 3
 
2.4%
강북구 3
 
2.4%
<NA> 2
 
1.6%
서대문구 2
 
1.6%
은평구 2
 
1.6%
종로구 2
 
1.6%
광진구 1
 
0.8%
Other values (3) 3
 
2.4%

Length

2023-12-11T17:45:30.695103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
강남구 6
21.4%
관악구 4
14.3%
중구 3
10.7%
강북구 3
10.7%
na 2
 
7.1%
서대문구 2
 
7.1%
은평구 2
 
7.1%
종로구 2
 
7.1%
광진구 1
 
3.6%
영등포구 1
 
3.6%
Other values (2) 2
 
7.1%

Correlations

2023-12-11T17:45:30.872478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
유아수유실자전거경사로무인택배보관함인출기스넥자판기자동칼라사진기무 민원발급기설치역무인민원발급기관할구청
유아수유실1.0000.9450.9670.9640.9580.9460.9970.000
자전거경사로0.9451.0000.9610.9580.9830.9870.9440.000
무인택배보관함0.9670.9611.0000.9460.9310.9710.9700.000
인출기0.9640.9580.9461.0000.9340.9700.9650.000
스넥자판기0.9580.9830.9310.9341.0000.9910.9590.000
자동칼라사진기0.9460.9870.9710.9700.9911.0000.9490.000
무 민원발급기설치역0.9970.9440.9700.9650.9590.9491.0000.533
무인민원발급기관할구청0.0000.0000.0000.0000.0000.0000.5331.000
2023-12-11T17:45:31.020365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
유아수유실인출기무 민원발급기설치역무인택배보관함무인민원발급기관할구청자전거경사로자동칼라사진기스넥자판기
유아수유실1.0000.8980.9080.8960.0000.8870.8880.900
인출기0.8981.0000.9020.7890.0000.8730.9020.779
무 민원발급기설치역0.9080.9021.0000.9040.2860.8850.8950.902
무인택배보관함0.8960.7890.9041.0000.0000.8730.9020.779
무인민원발급기관할구청0.0000.0000.2860.0001.0000.0000.0000.000
자전거경사로0.8870.8730.8850.8730.0001.0000.8300.881
자동칼라사진기0.8880.9020.8950.9020.0000.8301.0000.908
스넥자판기0.9000.7790.9020.7790.0000.8810.9081.000
2023-12-11T17:45:31.469848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
유아수유실자전거경사로무인택배보관함인출기스넥자판기자동칼라사진기무 민원발급기설치역무인민원발급기관할구청
유아수유실1.0000.8870.8960.8980.9000.8880.9080.000
자전거경사로0.8871.0000.8730.8730.8810.8300.8850.000
무인택배보관함0.8960.8731.0000.7890.7790.9020.9040.000
인출기0.8980.8730.7891.0000.7790.9020.9020.000
스넥자판기0.9000.8810.7790.7791.0000.9080.9020.000
자동칼라사진기0.8880.8300.9020.9020.9081.0000.8950.000
무 민원발급기설치역0.9080.8850.9040.9020.9020.8951.0000.286
무인민원발급기관할구청0.0000.0000.0000.0000.0000.0000.2861.000

Missing values

2023-12-11T17:45:27.526990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T17:45:27.696693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

역명유아수유실자전거경사로무인택배보관함인출기스넥자판기자동칼라사진기무 민원발급기설치역무인민원발급기관할구청
0전체계26(역)24(역)171(대)227(대)190(대)91(대)26(역)
11호선2317171881
2서울역5221중구
3시청232
4종각2321
5종로3가2221
6종로5가1221
7동대문2121
8신설동1121
9제기동1121
역명유아수유실자전거경사로무인택배보관함인출기스넥자판기자동칼라사진기무 민원발급기설치역무인민원발급기관할구청
117회현2211
118서울역1311
119숙대입구1321
120삼각지112
121신용산1121
122이촌112
123동작11
124총신대입구2321
125사당2231
126남태령11

Duplicate rows

Most frequently occurring

역명유아수유실자전거경사로무인택배보관함인출기스넥자판기자동칼라사진기무 민원발급기설치역무인민원발급기관할구청# duplicates
0문화공원<NA><NA><NA><NA><NA><NA><NA><NA>2