Overview

Dataset statistics

Number of variables5
Number of observations205
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.3 KiB
Average record size in memory41.6 B

Variable types

Numeric1
Categorical1
Text2
DateTime1

Dataset

Description파일 다운로드
Author서울교통공사
URLhttps://data.seoul.go.kr/dataList/OA-13235/F/1/datasetView.do

Alerts

연번 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
등록번호 has unique valuesUnique

Reproduction

Analysis started2024-04-29 16:47:53.588773
Analysis finished2024-04-29 16:47:54.126139
Duration0.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct205
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean103
Minimum1
Maximum205
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2024-04-30T01:47:54.225515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile11.2
Q152
median103
Q3154
95-th percentile194.8
Maximum205
Range204
Interquartile range (IQR)102

Descriptive statistics

Standard deviation59.322565
Coefficient of variation (CV)0.57594723
Kurtosis-1.2
Mean103
Median Absolute Deviation (MAD)51
Skewness0
Sum21115
Variance3519.1667
MonotonicityStrictly increasing
2024-04-30T01:47:54.362143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.5%
142 1
 
0.5%
132 1
 
0.5%
133 1
 
0.5%
134 1
 
0.5%
135 1
 
0.5%
136 1
 
0.5%
137 1
 
0.5%
138 1
 
0.5%
139 1
 
0.5%
Other values (195) 195
95.1%
ValueCountFrequency (%)
1 1
0.5%
2 1
0.5%
3 1
0.5%
4 1
0.5%
5 1
0.5%
6 1
0.5%
7 1
0.5%
8 1
0.5%
9 1
0.5%
10 1
0.5%
ValueCountFrequency (%)
205 1
0.5%
204 1
0.5%
203 1
0.5%
202 1
0.5%
201 1
0.5%
200 1
0.5%
199 1
0.5%
198 1
0.5%
197 1
0.5%
196 1
0.5%

구분
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
특허
88 
상표
70 
서비스표
29 
디자인
15 
업무표장
 
2

Length

Max length4
Median length2
Mean length2.3853659
Min length2

Unique

Unique1 ?
Unique (%)0.5%

Sample

1st row특허
2nd row특허
3rd row특허
4th row특허
5th row특허

Common Values

ValueCountFrequency (%)
특허 88
42.9%
상표 70
34.1%
서비스표 29
 
14.1%
디자인 15
 
7.3%
업무표장 2
 
1.0%
실용신안 1
 
0.5%

Length

2024-04-30T01:47:54.523430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T01:47:54.636885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
특허 88
42.9%
상표 70
34.1%
서비스표 29
 
14.1%
디자인 15
 
7.3%
업무표장 2
 
1.0%
실용신안 1
 
0.5%
Distinct201
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2024-04-30T01:47:54.909970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length37
Mean length19.717073
Min length4

Characters and Unicode

Total characters4042
Distinct characters315
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique197 ?
Unique (%)96.1%

Sample

1st row레일의 체결구조
2nd row철도 레일 콘크리트도상 궤도구조 및 그 시공방법
3rd row지하철 역사 양방향 비상 게이트
4th row스크린도어용 헤드박스의 보강구조
5th row스크린도어용 조립식 수직 포스트
ValueCountFrequency (%)
32
 
3.6%
31
 
3.5%
27
 
3.0%
bi 26
 
2.9%
시스템 25
 
2.8%
방법 17
 
1.9%
seoul 16
 
1.8%
이용한 16
 
1.8%
장치 15
 
1.7%
39류 13
 
1.4%
Other values (411) 680
75.7%
2024-04-30T01:47:55.348217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
701
 
17.3%
137
 
3.4%
81
 
2.0%
69
 
1.7%
) 65
 
1.6%
( 65
 
1.6%
3 63
 
1.6%
61
 
1.5%
59
 
1.5%
56
 
1.4%
Other values (305) 2685
66.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2343
58.0%
Space Separator 701
 
17.3%
Decimal Number 300
 
7.4%
Uppercase Letter 252
 
6.2%
Lowercase Letter 194
 
4.8%
Close Punctuation 99
 
2.4%
Open Punctuation 99
 
2.4%
Other Punctuation 50
 
1.2%
Dash Punctuation 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
137
 
5.8%
81
 
3.5%
69
 
2.9%
61
 
2.6%
59
 
2.5%
56
 
2.4%
56
 
2.4%
53
 
2.3%
52
 
2.2%
51
 
2.2%
Other values (249) 1668
71.2%
Uppercase Letter
ValueCountFrequency (%)
S 47
18.7%
M 30
11.9%
T 30
11.9%
B 28
11.1%
I 28
11.1%
R 14
 
5.6%
O 13
 
5.2%
E 12
 
4.8%
L 10
 
4.0%
C 9
 
3.6%
Other values (8) 31
12.3%
Lowercase Letter
ValueCountFrequency (%)
e 44
22.7%
o 33
17.0%
t 17
 
8.8%
r 17
 
8.8%
i 14
 
7.2%
l 14
 
7.2%
u 12
 
6.2%
v 8
 
4.1%
y 8
 
4.1%
c 7
 
3.6%
Other values (6) 20
10.3%
Decimal Number
ValueCountFrequency (%)
3 63
21.0%
1 32
10.7%
2 31
10.3%
9 31
10.3%
7 30
10.0%
6 30
10.0%
5 25
 
8.3%
0 23
 
7.7%
4 19
 
6.3%
8 15
 
5.0%
Other Punctuation
ValueCountFrequency (%)
, 27
54.0%
/ 8
 
16.0%
: 8
 
16.0%
? 5
 
10.0%
· 2
 
4.0%
Close Punctuation
ValueCountFrequency (%)
) 65
65.7%
] 34
34.3%
Open Punctuation
ValueCountFrequency (%)
( 65
65.7%
[ 34
34.3%
Space Separator
ValueCountFrequency (%)
701
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2343
58.0%
Common 1253
31.0%
Latin 446
 
11.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
137
 
5.8%
81
 
3.5%
69
 
2.9%
61
 
2.6%
59
 
2.5%
56
 
2.4%
56
 
2.4%
53
 
2.3%
52
 
2.2%
51
 
2.2%
Other values (249) 1668
71.2%
Latin
ValueCountFrequency (%)
S 47
 
10.5%
e 44
 
9.9%
o 33
 
7.4%
M 30
 
6.7%
T 30
 
6.7%
B 28
 
6.3%
I 28
 
6.3%
t 17
 
3.8%
r 17
 
3.8%
i 14
 
3.1%
Other values (24) 158
35.4%
Common
ValueCountFrequency (%)
701
55.9%
) 65
 
5.2%
( 65
 
5.2%
3 63
 
5.0%
[ 34
 
2.7%
] 34
 
2.7%
1 32
 
2.6%
2 31
 
2.5%
9 31
 
2.5%
7 30
 
2.4%
Other values (12) 167
 
13.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2343
58.0%
ASCII 1696
42.0%
None 3
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
701
41.3%
) 65
 
3.8%
( 65
 
3.8%
3 63
 
3.7%
S 47
 
2.8%
e 44
 
2.6%
[ 34
 
2.0%
] 34
 
2.0%
o 33
 
1.9%
1 32
 
1.9%
Other values (44) 578
34.1%
Hangul
ValueCountFrequency (%)
137
 
5.8%
81
 
3.5%
69
 
2.9%
61
 
2.6%
59
 
2.5%
56
 
2.4%
56
 
2.4%
53
 
2.3%
52
 
2.2%
51
 
2.2%
Other values (249) 1668
71.2%
None
ValueCountFrequency (%)
· 2
66.7%
1
33.3%
Distinct121
Distinct (%)59.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
Minimum1995-12-29 00:00:00
Maximum2023-04-20 00:00:00
2024-04-30T01:47:55.499579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T01:47:55.627393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

등록번호
Text

UNIQUE 

Distinct205
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2024-04-30T01:47:55.788321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length16
Mean length15.970732
Min length10

Characters and Unicode

Total characters3274
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique205 ?
Unique (%)100.0%

Sample

1st row10-0474255-00-00
2nd row10-0595429-00-00
3rd row10-0756641-00-00
4th row10-0826234-00-00
5th row10-0839484-00-00
ValueCountFrequency (%)
10-0474255-00-00 1
 
0.5%
41-0132220-00-00 1
 
0.5%
41-0132222-00-00 1
 
0.5%
41-0132223-00-00 1
 
0.5%
41-0136992-00-00 1
 
0.5%
41-0132224-00-00 1
 
0.5%
41-0132225-00-00 1
 
0.5%
41-0132226-00-00 1
 
0.5%
41-0132227-00-00 1
 
0.5%
41-0170756-00-00 1
 
0.5%
Other values (195) 195
95.1%
2024-04-30T01:47:56.059623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1172
35.8%
- 613
18.7%
1 383
 
11.7%
4 221
 
6.8%
2 175
 
5.3%
5 138
 
4.2%
7 123
 
3.8%
6 122
 
3.7%
3 114
 
3.5%
8 108
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2661
81.3%
Dash Punctuation 613
 
18.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1172
44.0%
1 383
 
14.4%
4 221
 
8.3%
2 175
 
6.6%
5 138
 
5.2%
7 123
 
4.6%
6 122
 
4.6%
3 114
 
4.3%
8 108
 
4.1%
9 105
 
3.9%
Dash Punctuation
ValueCountFrequency (%)
- 613
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3274
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1172
35.8%
- 613
18.7%
1 383
 
11.7%
4 221
 
6.8%
2 175
 
5.3%
5 138
 
4.2%
7 123
 
3.8%
6 122
 
3.7%
3 114
 
3.5%
8 108
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3274
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1172
35.8%
- 613
18.7%
1 383
 
11.7%
4 221
 
6.8%
2 175
 
5.3%
5 138
 
4.2%
7 123
 
3.8%
6 122
 
3.7%
3 114
 
3.5%
8 108
 
3.3%

Interactions

2024-04-30T01:47:53.801986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-30T01:47:56.140476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분
연번1.0000.836
구분0.8361.000
2024-04-30T01:47:56.207286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분
연번1.0000.645
구분0.6451.000

Missing values

2024-04-30T01:47:53.928870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T01:47:54.072135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번구분발명의 명칭등록일자등록번호
01특허레일의 체결구조2005-02-2210-0474255-00-00
12특허철도 레일 콘크리트도상 궤도구조 및 그 시공방법2006-06-2310-0595429-00-00
23특허지하철 역사 양방향 비상 게이트2007-09-0310-0756641-00-00
34특허스크린도어용 헤드박스의 보강구조2008-04-2310-0826234-00-00
45특허스크린도어용 조립식 수직 포스트2008-06-1210-0839484-00-00
56특허PSD조립체 모듈2009-08-1010-0912491-00-00
67특허슬림형 자동집·개표기2010-04-2710-0956175-00-00
78특허계통설비별 색상정보를 이용한 변전소 모니터링 시스템2011-01-1110-1008956-00-00
89특허이동식 안내로봇 및 그 시스템2011-01-2610-1012288-00-00
910특허실시간 장애정보 수집 및 교통카드시스템 운영상황 디스플레이방식의 교통카드 원격정비시스템2011-08-0910-1057126-00-00
연번구분발명의 명칭등록일자등록번호
195196상표또타러기지 39류2020-09-1640-1643927-00-00
196197상표T Luggage 또타러기지 39류2020-09-1640-1643928-00-00
197198상표또타딜리버리 35류2021-01-2640-1686718-00-00
198199상표또타딜리버리 39류2020-09-1640-1643929-00-00
199200상표T Delivery 또타딜리버리 35류2021-01-2640-1686719-00-00
200201상표T Delivery 또타딜리버리 39류2020-09-1640-1643930-00-00
201202상표또타픽업 35류2021-01-2640-1686720-00-00
202203상표또타픽업 39류2020-09-1640-1643931-00-00
203204상표T Pick Up 또타픽업 35류2021-01-2640-1686721-00-00
204205상표T Pick Up 또타픽업 39류2020-09-1640-1643932-00-00