Overview

Dataset statistics

Number of variables5
Number of observations242
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.8 KiB
Average record size in memory41.5 B

Variable types

Numeric1
Categorical1
Text2
DateTime1

Dataset

Description파일 다운로드
Author서울교통공사
URLhttps://data.seoul.go.kr/dataList/OA-13235/F/1/datasetView.do

Alerts

연번 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
등록번호 has unique valuesUnique

Reproduction

Analysis started2024-04-29 16:47:49.946572
Analysis finished2024-04-29 16:47:50.341395
Duration0.39 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct242
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean121.5
Minimum1
Maximum242
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2024-04-30T01:47:50.404275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile13.05
Q161.25
median121.5
Q3181.75
95-th percentile229.95
Maximum242
Range241
Interquartile range (IQR)120.5

Descriptive statistics

Standard deviation70.003571
Coefficient of variation (CV)0.57616108
Kurtosis-1.2
Mean121.5
Median Absolute Deviation (MAD)60.5
Skewness0
Sum29403
Variance4900.5
MonotonicityStrictly increasing
2024-04-30T01:47:50.806616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
183 1
 
0.4%
155 1
 
0.4%
156 1
 
0.4%
157 1
 
0.4%
158 1
 
0.4%
159 1
 
0.4%
160 1
 
0.4%
161 1
 
0.4%
162 1
 
0.4%
Other values (232) 232
95.9%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
242 1
0.4%
241 1
0.4%
240 1
0.4%
239 1
0.4%
238 1
0.4%
237 1
0.4%
236 1
0.4%
235 1
0.4%
234 1
0.4%
233 1
0.4%

구분
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
특허
114 
상표
75 
서비스표
24 
디자인
18 
실용신안
 
9

Length

Max length4
Median length2
Mean length2.3636364
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row특허
2nd row특허
3rd row특허
4th row특허
5th row특허

Common Values

ValueCountFrequency (%)
특허 114
47.1%
상표 75
31.0%
서비스표 24
 
9.9%
디자인 18
 
7.4%
실용신안 9
 
3.7%
업무표장 2
 
0.8%

Length

2024-04-30T01:47:50.965736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T01:47:51.069914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
특허 114
47.1%
상표 75
31.0%
서비스표 24
 
9.9%
디자인 18
 
7.4%
실용신안 9
 
3.7%
업무표장 2
 
0.8%
Distinct236
Distinct (%)97.5%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2024-04-30T01:47:51.335710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length39
Mean length19.53719
Min length4

Characters and Unicode

Total characters4728
Distinct characters357
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique231 ?
Unique (%)95.5%

Sample

1st row레일의 체결구조
2nd row철도 레일 콘크리트도상 궤도구조 및 그 시공방법
3rd row지하철 역사 양방향 비상 게이트
4th row스크린도어용 헤드박스의 보강구조
5th row스크린도어용 조립식 수직 포스트
ValueCountFrequency (%)
45
 
4.2%
31
 
2.9%
시스템 31
 
2.9%
27
 
2.5%
bi 26
 
2.4%
방법 22
 
2.0%
이용한 21
 
1.9%
seoul 16
 
1.5%
장치 16
 
1.5%
39류 13
 
1.2%
Other values (526) 831
77.0%
2024-04-30T01:47:51.751101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
837
 
17.7%
138
 
2.9%
86
 
1.8%
79
 
1.7%
71
 
1.5%
69
 
1.5%
66
 
1.4%
66
 
1.4%
65
 
1.4%
) 65
 
1.4%
Other values (347) 3186
67.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2860
60.5%
Space Separator 837
 
17.7%
Decimal Number 300
 
6.3%
Uppercase Letter 267
 
5.6%
Lowercase Letter 216
 
4.6%
Close Punctuation 99
 
2.1%
Open Punctuation 99
 
2.1%
Other Punctuation 46
 
1.0%
Dash Punctuation 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
138
 
4.8%
86
 
3.0%
79
 
2.8%
71
 
2.5%
69
 
2.4%
66
 
2.3%
66
 
2.3%
65
 
2.3%
65
 
2.3%
60
 
2.1%
Other values (286) 2095
73.3%
Uppercase Letter
ValueCountFrequency (%)
S 49
18.4%
T 31
11.6%
M 30
11.2%
I 29
10.9%
B 28
10.5%
R 14
 
5.2%
O 13
 
4.9%
E 12
 
4.5%
L 10
 
3.7%
C 10
 
3.7%
Other values (11) 41
15.4%
Lowercase Letter
ValueCountFrequency (%)
e 45
20.8%
o 34
15.7%
t 20
9.3%
r 18
 
8.3%
i 17
 
7.9%
l 16
 
7.4%
u 13
 
6.0%
y 9
 
4.2%
c 8
 
3.7%
v 8
 
3.7%
Other values (9) 28
13.0%
Decimal Number
ValueCountFrequency (%)
3 63
21.0%
1 32
10.7%
2 31
10.3%
9 31
10.3%
6 30
10.0%
7 30
10.0%
5 25
 
8.3%
0 23
 
7.7%
4 19
 
6.3%
8 15
 
5.0%
Other Punctuation
ValueCountFrequency (%)
, 28
60.9%
/ 8
 
17.4%
: 8
 
17.4%
· 2
 
4.3%
Close Punctuation
ValueCountFrequency (%)
) 65
65.7%
] 34
34.3%
Open Punctuation
ValueCountFrequency (%)
( 65
65.7%
[ 34
34.3%
Space Separator
ValueCountFrequency (%)
837
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2860
60.5%
Common 1385
29.3%
Latin 483
 
10.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
138
 
4.8%
86
 
3.0%
79
 
2.8%
71
 
2.5%
69
 
2.4%
66
 
2.3%
66
 
2.3%
65
 
2.3%
65
 
2.3%
60
 
2.1%
Other values (286) 2095
73.3%
Latin
ValueCountFrequency (%)
S 49
 
10.1%
e 45
 
9.3%
o 34
 
7.0%
T 31
 
6.4%
M 30
 
6.2%
I 29
 
6.0%
B 28
 
5.8%
t 20
 
4.1%
r 18
 
3.7%
i 17
 
3.5%
Other values (30) 182
37.7%
Common
ValueCountFrequency (%)
837
60.4%
) 65
 
4.7%
( 65
 
4.7%
3 63
 
4.5%
[ 34
 
2.5%
] 34
 
2.5%
1 32
 
2.3%
2 31
 
2.2%
9 31
 
2.2%
6 30
 
2.2%
Other values (11) 163
 
11.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2860
60.5%
ASCII 1863
39.4%
None 5
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
837
44.9%
) 65
 
3.5%
( 65
 
3.5%
3 63
 
3.4%
S 49
 
2.6%
e 45
 
2.4%
[ 34
 
1.8%
] 34
 
1.8%
o 34
 
1.8%
1 32
 
1.7%
Other values (47) 605
32.5%
Hangul
ValueCountFrequency (%)
138
 
4.8%
86
 
3.0%
79
 
2.8%
71
 
2.5%
69
 
2.4%
66
 
2.3%
66
 
2.3%
65
 
2.3%
65
 
2.3%
60
 
2.1%
Other values (286) 2095
73.3%
None
ValueCountFrequency (%)
· 2
40.0%
1
20.0%
1
20.0%
1
20.0%

등록번호
Text

UNIQUE 

Distinct242
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2024-04-30T01:47:51.933281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length16
Mean length16
Min length16

Characters and Unicode

Total characters3872
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique242 ?
Unique (%)100.0%

Sample

1st row10-0474255-00-00
2nd row10-0595429-00-00
3rd row10-0756641-00-00
4th row10-0826234-00-00
5th row10-0839484-00-00
ValueCountFrequency (%)
10-0474255-00-00 1
 
0.4%
41-0212726-00-00 1
 
0.4%
41-0125784-00-00 1
 
0.4%
41-0132219-00-00 1
 
0.4%
41-0125785-00-00 1
 
0.4%
41-0125786-00-00 1
 
0.4%
41-0125787-00-00 1
 
0.4%
41-0125788-00-00 1
 
0.4%
41-0125789-00-00 1
 
0.4%
40-0660797-00-00 1
 
0.4%
Other values (232) 232
95.9%
2024-04-30T01:47:52.241350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1403
36.2%
- 726
18.8%
1 466
 
12.0%
4 242
 
6.2%
2 209
 
5.4%
5 150
 
3.9%
7 147
 
3.8%
6 145
 
3.7%
3 136
 
3.5%
8 125
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3146
81.2%
Dash Punctuation 726
 
18.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1403
44.6%
1 466
 
14.8%
4 242
 
7.7%
2 209
 
6.6%
5 150
 
4.8%
7 147
 
4.7%
6 145
 
4.6%
3 136
 
4.3%
8 125
 
4.0%
9 123
 
3.9%
Dash Punctuation
ValueCountFrequency (%)
- 726
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3872
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1403
36.2%
- 726
18.8%
1 466
 
12.0%
4 242
 
6.2%
2 209
 
5.4%
5 150
 
3.9%
7 147
 
3.8%
6 145
 
3.7%
3 136
 
3.5%
8 125
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3872
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1403
36.2%
- 726
18.8%
1 466
 
12.0%
4 242
 
6.2%
2 209
 
5.4%
5 150
 
3.9%
7 147
 
3.8%
6 145
 
3.7%
3 136
 
3.5%
8 125
 
3.2%
Distinct155
Distinct (%)64.0%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
Minimum1995-12-29 00:00:00
Maximum2021-06-02 00:00:00
2024-04-30T01:47:52.367863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T01:47:52.509023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2024-04-30T01:47:50.145039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-30T01:47:52.602562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분
연번1.0000.858
구분0.8581.000
2024-04-30T01:47:52.677917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분
연번1.0000.672
구분0.6721.000

Missing values

2024-04-30T01:47:50.234646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T01:47:50.311132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번구분발명의명칭등록번호등록일자
01특허레일의 체결구조10-0474255-00-002005-02-22
12특허철도 레일 콘크리트도상 궤도구조 및 그 시공방법10-0595429-00-002006-06-23
23특허지하철 역사 양방향 비상 게이트10-0756641-00-002007-09-03
34특허스크린도어용 헤드박스의 보강구조10-0826234-00-002008-04-23
45특허스크린도어용 조립식 수직 포스트10-0839484-00-002008-06-12
56특허PSD조립체 모듈10-0912491-00-002009-08-10
67특허ADU시뮬레이터(Aspect Display Unit Simulator)10-0932003-00-002009-12-07
78특허슬림형 자동집·개표기10-0956175-00-002010-04-27
89특허프리캐스트구조물 및 이를 이용한 지중 승강 및 연결통로 터널구조물10-0990429-00-002010-10-21
910특허계통설비별 색상정보를 이용한 변전소 모니터링 시스템10-1008956-00-002011-01-11
연번구분발명의명칭등록번호등록일자
232233상표또타러기지 39류40-1643927-00-002020-09-16
233234상표T Luggage 또타러기지 39류40-1643928-00-002020-09-16
234235상표또타딜리버리 35류40-1686718-00-002021-01-26
235236상표또타딜리버리 39류40-1643929-00-002020-09-16
236237상표T Delivery 또타딜리버리 35류40-1686719-00-002021-01-26
237238상표T Delivery 또타딜리버리 39류40-1643930-00-002020-09-16
238239상표또타픽업 35류40-1686720-00-002021-01-26
239240상표또타픽업 39류40-1643931-00-002020-09-16
240241상표T Pick Up 또타픽업 35류40-1686721-00-002021-01-26
241242상표T Pick Up 또타픽업 39류40-1643932-00-002020-09-16