Overview

Dataset statistics

Number of variables4
Number of observations2607
Missing cells5
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory81.6 KiB
Average record size in memory32.1 B

Variable types

Categorical2
Text2

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15249/F/1/datasetView.do

Alerts

대여소별 대여내역 is highly overall correlated with Unnamed: 1High correlation
Unnamed: 1 is highly overall correlated with 대여소별 대여내역High correlation

Reproduction

Analysis started2024-03-13 09:54:32.876628
Analysis finished2024-03-13 09:54:33.536381
Duration0.66 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

대여소별 대여내역
Categorical

HIGH CORRELATION 

Distinct30
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size20.5 KiB
송파구
207 
강서구
 
179
강남구
 
149
영등포구
 
141
서초구
 
141
Other values (25)
1790 

Length

Max length7
Median length3
Mean length3.0847718
Min length2

Unique

Unique5 ?
Unique (%)0.2%

Sample

1st row<NA>
2nd row구분
3rd row조회구분
4th row총 2602건
5th row대여소 그룹

Common Values

ValueCountFrequency (%)
송파구 207
 
7.9%
강서구 179
 
6.9%
강남구 149
 
5.7%
영등포구 141
 
5.4%
서초구 141
 
5.4%
노원구 124
 
4.8%
마포구 121
 
4.6%
강동구 117
 
4.5%
양천구 105
 
4.0%
종로구 104
 
4.0%
Other values (20) 1219
46.8%

Length

2024-03-13T18:54:33.595911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
송파구 207
 
7.9%
강서구 179
 
6.9%
강남구 149
 
5.7%
영등포구 141
 
5.4%
서초구 141
 
5.4%
노원구 124
 
4.8%
마포구 121
 
4.6%
강동구 117
 
4.5%
양천구 105
 
4.0%
종로구 104
 
4.0%
Other values (22) 1221
46.8%

Unnamed: 1
Categorical

HIGH CORRELATION 

Distinct40
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size20.5 KiB
상암1팀
 
150
학여울2팀
 
125
천호1팀
 
121
천호2팀
 
115
학여울1팀
 
95
Other values (35)
2001 

Length

Max length6
Median length4
Mean length4.2696586
Min length1

Unique

Unique4 ?
Unique (%)0.2%

Sample

1st row<NA>
2nd row
3rd row대여소별
4th row<NA>
5th row팀명

Common Values

ValueCountFrequency (%)
상암1팀 150
 
5.8%
학여울2팀 125
 
4.8%
천호1팀 121
 
4.6%
천호2팀 115
 
4.4%
학여울1팀 95
 
3.6%
중랑3팀 94
 
3.6%
영남3팀 87
 
3.3%
중랑2팀 87
 
3.3%
중랑1팀 85
 
3.3%
이수3팀 84
 
3.2%
Other values (30) 1564
60.0%

Length

2024-03-13T18:54:34.987512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
상암1팀 150
 
5.6%
학여울2팀 125
 
4.7%
천호1팀 121
 
4.6%
천호2팀 115
 
4.3%
학여울1팀 95
 
3.6%
중랑3팀 94
 
3.5%
영남3팀 87
 
3.3%
중랑2팀 87
 
3.3%
중랑1팀 85
 
3.2%
이수3팀 84
 
3.2%
Other values (31) 1615
60.8%
Distinct2605
Distinct (%)100.0%
Missing2
Missing (%)0.1%
Memory size20.5 KiB
2024-03-13T18:54:35.277711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length32
Mean length15.566219
Min length3

Characters and Unicode

Total characters40550
Distinct characters588
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2605 ?
Unique (%)100.0%

Sample

1st row일/월
2nd row대여소
3rd row대여소 명
4th row2391. 구룡터널 입구(개포1단지아파트)
5th row9980. 에이텍
ValueCountFrequency (%)
683
 
9.0%
출구 104
 
1.4%
96
 
1.3%
입구 70
 
0.9%
교차로 62
 
0.8%
1번출구 61
 
0.8%
사거리 57
 
0.8%
2번출구 47
 
0.6%
3번출구 44
 
0.6%
43
 
0.6%
Other values (5195) 6306
83.3%
2024-03-13T18:54:35.689387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4968
 
12.3%
. 2612
 
6.4%
1 1967
 
4.9%
2 1595
 
3.9%
3 1306
 
3.2%
4 1280
 
3.2%
5 952
 
2.3%
0 932
 
2.3%
6 897
 
2.2%
7 811
 
2.0%
Other values (578) 23230
57.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20947
51.7%
Decimal Number 11134
27.5%
Space Separator 4968
 
12.3%
Other Punctuation 2645
 
6.5%
Uppercase Letter 333
 
0.8%
Open Punctuation 225
 
0.6%
Close Punctuation 225
 
0.6%
Lowercase Letter 46
 
0.1%
Dash Punctuation 18
 
< 0.1%
Math Symbol 4
 
< 0.1%
Other values (3) 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
810
 
3.9%
792
 
3.8%
580
 
2.8%
579
 
2.8%
503
 
2.4%
490
 
2.3%
436
 
2.1%
382
 
1.8%
371
 
1.8%
349
 
1.7%
Other values (513) 15655
74.7%
Uppercase Letter
ValueCountFrequency (%)
K 41
12.3%
S 39
11.7%
T 33
9.9%
C 30
 
9.0%
A 22
 
6.6%
D 21
 
6.3%
G 20
 
6.0%
P 17
 
5.1%
M 17
 
5.1%
B 17
 
5.1%
Other values (13) 76
22.8%
Lowercase Letter
ValueCountFrequency (%)
e 15
32.6%
k 7
15.2%
s 7
15.2%
t 3
 
6.5%
l 2
 
4.3%
n 2
 
4.3%
v 1
 
2.2%
y 1
 
2.2%
a 1
 
2.2%
g 1
 
2.2%
Other values (6) 6
 
13.0%
Decimal Number
ValueCountFrequency (%)
1 1967
17.7%
2 1595
14.3%
3 1306
11.7%
4 1280
11.5%
5 952
8.6%
0 932
8.4%
6 897
8.1%
7 811
7.3%
8 757
 
6.8%
9 637
 
5.7%
Other Punctuation
ValueCountFrequency (%)
. 2612
98.8%
, 18
 
0.7%
& 7
 
0.3%
· 4
 
0.2%
? 3
 
0.1%
/ 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 3
75.0%
+ 1
 
25.0%
Other Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
4968
100.0%
Open Punctuation
ValueCountFrequency (%)
( 225
100.0%
Close Punctuation
ValueCountFrequency (%)
) 225
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20948
51.7%
Common 19223
47.4%
Latin 379
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
810
 
3.9%
792
 
3.8%
580
 
2.8%
579
 
2.8%
503
 
2.4%
490
 
2.3%
436
 
2.1%
382
 
1.8%
371
 
1.8%
349
 
1.7%
Other values (514) 15656
74.7%
Latin
ValueCountFrequency (%)
K 41
 
10.8%
S 39
 
10.3%
T 33
 
8.7%
C 30
 
7.9%
A 22
 
5.8%
D 21
 
5.5%
G 20
 
5.3%
P 17
 
4.5%
M 17
 
4.5%
B 17
 
4.5%
Other values (29) 122
32.2%
Common
ValueCountFrequency (%)
4968
25.8%
. 2612
13.6%
1 1967
 
10.2%
2 1595
 
8.3%
3 1306
 
6.8%
4 1280
 
6.7%
5 952
 
5.0%
0 932
 
4.8%
6 897
 
4.7%
7 811
 
4.2%
Other values (15) 1903
 
9.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20947
51.7%
ASCII 19596
48.3%
None 5
 
< 0.1%
Enclosed Alphanum 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4968
25.4%
. 2612
13.3%
1 1967
 
10.0%
2 1595
 
8.1%
3 1306
 
6.7%
4 1280
 
6.5%
5 952
 
4.9%
0 932
 
4.8%
6 897
 
4.6%
7 811
 
4.1%
Other values (51) 2276
11.6%
Hangul
ValueCountFrequency (%)
810
 
3.9%
792
 
3.8%
580
 
2.8%
579
 
2.8%
503
 
2.4%
490
 
2.3%
436
 
2.1%
382
 
1.8%
371
 
1.8%
349
 
1.7%
Other values (513) 15655
74.7%
None
ValueCountFrequency (%)
· 4
80.0%
1
 
20.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct1781
Distinct (%)68.4%
Missing3
Missing (%)0.1%
Memory size20.5 KiB
2024-03-13T18:54:36.066040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length4
Mean length3.5821813
Min length1

Characters and Unicode

Total characters9328
Distinct characters16
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1198 ?
Unique (%)46.0%

Sample

1st row202204 ~ 202204
2nd row대여 건수
3rd row7
4th row21
5th row24
ValueCountFrequency (%)
1117 6
 
0.2%
583 6
 
0.2%
972 5
 
0.2%
726 5
 
0.2%
769 5
 
0.2%
266 5
 
0.2%
488 5
 
0.2%
1051 5
 
0.2%
251 5
 
0.2%
994 5
 
0.2%
Other values (1773) 2555
98.0%
2024-03-13T18:54:36.554068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 1663
17.8%
2 1184
12.7%
3 978
10.5%
4 891
9.6%
5 826
8.9%
6 791
8.5%
7 771
8.3%
9 751
8.1%
0 745
8.0%
8 720
7.7%
Other values (6) 8
 
0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 9320
99.9%
Other Letter 4
 
< 0.1%
Space Separator 3
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 1663
17.8%
2 1184
12.7%
3 978
10.5%
4 891
9.6%
5 826
8.9%
6 791
8.5%
7 771
8.3%
9 751
8.1%
0 745
8.0%
8 720
7.7%
Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Space Separator
ValueCountFrequency (%)
3
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 9324
> 99.9%
Hangul 4
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
1 1663
17.8%
2 1184
12.7%
3 978
10.5%
4 891
9.6%
5 826
8.9%
6 791
8.5%
7 771
8.3%
9 751
8.1%
0 745
8.0%
8 720
7.7%
Other values (2) 4
 
< 0.1%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9324
> 99.9%
Hangul 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 1663
17.8%
2 1184
12.7%
3 978
10.5%
4 891
9.6%
5 826
8.9%
6 791
8.5%
7 771
8.3%
9 751
8.1%
0 745
8.0%
8 720
7.7%
Other values (2) 4
 
< 0.1%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Correlations

2024-03-13T18:54:36.643566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대여소별 대여내역Unnamed: 1
대여소별 대여내역1.0000.996
Unnamed: 10.9961.000
2024-03-13T18:54:36.717094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대여소별 대여내역Unnamed: 1
대여소별 대여내역1.0000.906
Unnamed: 10.9061.000
2024-03-13T18:54:36.808199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대여소별 대여내역Unnamed: 1
대여소별 대여내역1.0000.906
Unnamed: 10.9061.000

Missing values

2024-03-13T18:54:33.307480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-13T18:54:33.383428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-13T18:54:33.475205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

대여소별 대여내역Unnamed: 1Unnamed: 2Unnamed: 3
0<NA><NA><NA><NA>
1구분일/월202204 ~ 202204
2조회구분대여소별대여소<NA>
3총 2602건<NA><NA><NA>
4대여소 그룹팀명대여소 명대여 건수
5강남구학여울1팀2391. 구룡터널 입구(개포1단지아파트)7
6성동구테스트9980. 에이텍21
7서초구이수1팀4314. 탑성마을 버스정거장 옆24
8서초구이수1팀4322.서울추모공원 입구25
9서초구이수1팀2286. 탑성마을입구32
대여소별 대여내역Unnamed: 1Unnamed: 2Unnamed: 3
2597송파구천호2팀2622. 올림픽공원역 3번출구9044
2598관악구영남2팀2177. 신대방역 2번 출구9150
2599영등포구영남1팀272. 당산육갑문9355
2600영등포구영남3팀210. IFC몰10877
2601관악구영남2팀2102. 봉림교 교통섬11060
2602송파구천호2팀1210. 롯데월드타워(잠실역2번출구 쪽)11117
2603강서구개화1팀2715.마곡나루역 2번 출구12962
2604마포구상암1팀4217. 한강공원 망원나들목17282
2605광진구중랑1팀502. 뚝섬유원지역 1번출구 앞18598
2606영등포구영남3팀207. 여의나루역 1번출구 앞20291