Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory410.2 KiB
Average record size in memory42.0 B

Variable types

Categorical1
Text1
Numeric2

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15249/F/1/datasetView.do

Reproduction

Analysis started2024-03-13 09:53:20.568603
Analysis finished2024-03-13 09:53:21.799380
Duration1.23 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

대여소 그룹
Categorical

Distinct27
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
송파구
 
644
강서구
 
618
서초구
 
556
강남구
 
554
영등포구
 
518
Other values (22)
7110 

Length

Max length6
Median length3
Mean length3.0903
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row성북구
2nd row성동구
3rd row강서구
4th row양천구
5th row동작구

Common Values

ValueCountFrequency (%)
송파구 644
 
6.4%
강서구 618
 
6.2%
서초구 556
 
5.6%
강남구 554
 
5.5%
영등포구 518
 
5.2%
종로구 471
 
4.7%
마포구 446
 
4.5%
노원구 432
 
4.3%
구로구 395
 
4.0%
양천구 387
 
3.9%
Other values (17) 4979
49.8%

Length

2024-03-13T18:53:21.887854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
송파구 644
 
6.4%
강서구 618
 
6.2%
서초구 556
 
5.6%
강남구 554
 
5.5%
영등포구 518
 
5.2%
종로구 471
 
4.7%
마포구 446
 
4.5%
노원구 432
 
4.3%
구로구 395
 
3.9%
양천구 387
 
3.9%
Other values (18) 4985
49.8%
Distinct2192
Distinct (%)21.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-13T18:53:22.142663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length31
Mean length15.3644
Min length3

Characters and Unicode

Total characters153644
Distinct characters564
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique82 ?
Unique (%)0.8%

Sample

1st row1302. 한성대입구역6번출구 뒤
2nd row585. 성수2가1동 공영주차장 인근
3rd row1196. 서울식물원(문화센터) 건너편
4th row733. 신정이펜하우스314동
5th row2091.이수역9번출구(맥도날드)
ValueCountFrequency (%)
2588
 
9.0%
430
 
1.5%
출구 393
 
1.4%
1번출구 277
 
1.0%
입구 267
 
0.9%
사거리 219
 
0.8%
교차로 218
 
0.8%
2번출구 196
 
0.7%
3번출구 195
 
0.7%
195
 
0.7%
Other values (4307) 23756
82.7%
2024-03-13T18:53:22.545839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18734
 
12.2%
. 10005
 
6.5%
1 8258
 
5.4%
2 6753
 
4.4%
3 4771
 
3.1%
4 3539
 
2.3%
5 3536
 
2.3%
0 3420
 
2.2%
3242
 
2.1%
6 3149
 
2.0%
Other values (554) 88237
57.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 79774
51.9%
Decimal Number 41429
27.0%
Space Separator 18734
 
12.2%
Other Punctuation 10099
 
6.6%
Uppercase Letter 1433
 
0.9%
Open Punctuation 967
 
0.6%
Close Punctuation 967
 
0.6%
Lowercase Letter 138
 
0.1%
Dash Punctuation 70
 
< 0.1%
Math Symbol 22
 
< 0.1%
Other values (2) 11
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3242
 
4.1%
3148
 
3.9%
2329
 
2.9%
2102
 
2.6%
2059
 
2.6%
2045
 
2.6%
1663
 
2.1%
1345
 
1.7%
1341
 
1.7%
1262
 
1.6%
Other values (496) 59238
74.3%
Uppercase Letter
ValueCountFrequency (%)
K 159
11.1%
S 157
11.0%
C 137
9.6%
T 119
 
8.3%
A 104
 
7.3%
G 89
 
6.2%
L 88
 
6.1%
M 86
 
6.0%
D 84
 
5.9%
P 82
 
5.7%
Other values (14) 328
22.9%
Lowercase Letter
ValueCountFrequency (%)
e 50
36.2%
k 18
 
13.0%
s 13
 
9.4%
n 12
 
8.7%
l 11
 
8.0%
t 10
 
7.2%
y 6
 
4.3%
m 5
 
3.6%
c 5
 
3.6%
o 5
 
3.6%
Decimal Number
ValueCountFrequency (%)
1 8258
19.9%
2 6753
16.3%
3 4771
11.5%
4 3539
8.5%
5 3536
8.5%
0 3420
8.3%
6 3149
 
7.6%
7 2969
 
7.2%
8 2542
 
6.1%
9 2492
 
6.0%
Other Punctuation
ValueCountFrequency (%)
. 10005
99.1%
, 58
 
0.6%
? 18
 
0.2%
& 14
 
0.1%
· 4
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 17
77.3%
+ 5
 
22.7%
Space Separator
ValueCountFrequency (%)
18734
100.0%
Open Punctuation
ValueCountFrequency (%)
( 967
100.0%
Close Punctuation
ValueCountFrequency (%)
) 967
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 70
100.0%
Other Symbol
ValueCountFrequency (%)
7
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 79781
51.9%
Common 72292
47.1%
Latin 1571
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3242
 
4.1%
3148
 
3.9%
2329
 
2.9%
2102
 
2.6%
2059
 
2.6%
2045
 
2.6%
1663
 
2.1%
1345
 
1.7%
1341
 
1.7%
1262
 
1.6%
Other values (497) 59245
74.3%
Latin
ValueCountFrequency (%)
K 159
 
10.1%
S 157
 
10.0%
C 137
 
8.7%
T 119
 
7.6%
A 104
 
6.6%
G 89
 
5.7%
L 88
 
5.6%
M 86
 
5.5%
D 84
 
5.3%
P 82
 
5.2%
Other values (25) 466
29.7%
Common
ValueCountFrequency (%)
18734
25.9%
. 10005
13.8%
1 8258
11.4%
2 6753
 
9.3%
3 4771
 
6.6%
4 3539
 
4.9%
5 3536
 
4.9%
0 3420
 
4.7%
6 3149
 
4.4%
7 2969
 
4.1%
Other values (12) 7158
 
9.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 79774
51.9%
ASCII 73859
48.1%
None 11
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
18734
25.4%
. 10005
13.5%
1 8258
11.2%
2 6753
 
9.1%
3 4771
 
6.5%
4 3539
 
4.8%
5 3536
 
4.8%
0 3420
 
4.6%
6 3149
 
4.3%
7 2969
 
4.0%
Other values (46) 8725
11.8%
Hangul
ValueCountFrequency (%)
3242
 
4.1%
3148
 
3.9%
2329
 
2.9%
2102
 
2.6%
2059
 
2.6%
2045
 
2.6%
1663
 
2.1%
1345
 
1.7%
1341
 
1.7%
1262
 
1.6%
Other values (496) 59238
74.3%
None
ValueCountFrequency (%)
7
63.6%
· 4
36.4%

대여 일자 / 월
Real number (ℝ)

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean202022.89
Minimum202007
Maximum202101
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-13T18:53:22.690734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum202007
5-th percentile202007
Q1202008
median202010
Q3202012
95-th percentile202101
Maximum202101
Range94
Interquartile range (IQR)4

Descriptive statistics

Standard deviation32.375682
Coefficient of variation (CV)0.00016025749
Kurtosis1.9867593
Mean202022.89
Median Absolute Deviation (MAD)2
Skewness1.9918041
Sum2.0202289 × 109
Variance1048.1848
MonotonicityNot monotonic
2024-03-13T18:53:22.819859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
202101 1463
14.6%
202007 1444
14.4%
202012 1442
14.4%
202011 1424
14.2%
202009 1416
14.2%
202010 1414
14.1%
202008 1397
14.0%
ValueCountFrequency (%)
202007 1444
14.4%
202008 1397
14.0%
202009 1416
14.2%
202010 1414
14.1%
202011 1424
14.2%
202012 1442
14.4%
202101 1463
14.6%
ValueCountFrequency (%)
202101 1463
14.6%
202012 1442
14.4%
202011 1424
14.2%
202010 1414
14.1%
202009 1416
14.2%
202008 1397
14.0%
202007 1444
14.4%

대여 건수
Real number (ℝ)

Distinct2663
Distinct (%)26.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean952.8477
Minimum0
Maximum17916
Zeros12
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-13T18:53:22.939849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile91
Q1336
median676
Q31245
95-th percentile2683.15
Maximum17916
Range17916
Interquartile range (IQR)909

Descriptive statistics

Standard deviation986.82575
Coefficient of variation (CV)1.0356595
Kurtosis31.431505
Mean952.8477
Median Absolute Deviation (MAD)405
Skewness3.7678315
Sum9528477
Variance973825.05
MonotonicityNot monotonic
2024-03-13T18:53:23.076686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
370 19
 
0.2%
255 18
 
0.2%
1 17
 
0.2%
263 17
 
0.2%
252 16
 
0.2%
197 16
 
0.2%
253 15
 
0.1%
776 15
 
0.1%
375 15
 
0.1%
271 15
 
0.1%
Other values (2653) 9837
98.4%
ValueCountFrequency (%)
0 12
0.1%
1 17
0.2%
2 5
 
0.1%
3 5
 
0.1%
4 3
 
< 0.1%
5 3
 
< 0.1%
6 9
0.1%
7 4
 
< 0.1%
8 3
 
< 0.1%
9 5
 
0.1%
ValueCountFrequency (%)
17916 1
< 0.1%
15613 1
< 0.1%
14779 1
< 0.1%
12790 1
< 0.1%
12696 1
< 0.1%
11167 1
< 0.1%
10809 1
< 0.1%
10418 1
< 0.1%
10264 1
< 0.1%
9909 1
< 0.1%

Interactions

2024-03-13T18:53:21.161538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T18:53:20.984580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T18:53:21.268040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T18:53:21.065160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-13T18:53:23.178719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대여소 그룹대여 일자 / 월대여 건수
대여소 그룹1.0000.0000.209
대여 일자 / 월0.0001.0000.159
대여 건수0.2090.1591.000
2024-03-13T18:53:23.295932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대여 일자 / 월대여 건수대여소 그룹
대여 일자 / 월1.000-0.3390.000
대여 건수-0.3391.0000.077
대여소 그룹0.0000.0771.000

Missing values

2024-03-13T18:53:21.676547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-13T18:53:21.758795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

대여소 그룹대여소 명대여 일자 / 월대여 건수
3383성북구1302. 한성대입구역6번출구 뒤2020081074
3377성동구585. 성수2가1동 공영주차장 인근2020081627
8671강서구1196. 서울식물원(문화센터) 건너편202011452
1538양천구733. 신정이펜하우스314동202007519
929동작구2091.이수역9번출구(맥도날드)202007574
2144강남구2372. 대치역 사거리202008726
1054서대문구164. 북가좌1동 주민센터202007525
10715강서구1135. 강서구의회202012826
11994양천구720. 서울강월초등학교 앞202012304
4769금천구1804. 독산역 2번출구 자전거주차장2020091940
대여소 그룹대여소 명대여 일자 / 월대여 건수
5359서초구2515.서초초등학교 후문202009588
982마포구199. 서울 월드컵 경기장2020071027
11038구로구2813.항동지구 3단지 302동 앞20201262
5420성동구507. 성수아이에스비즈타워 앞2020091353
6201중랑구1428. 원묵고등학교202009523
2837도봉구1716. 하나로마트 창동점2020081625
7228마포구181. 망원초록길 입구202010486
13667마포구497.합정동주민센터 앞202101802
9675성북구1343. 한성대7번출구 앞202011629
9838송파구2644.성내5교 (GS주유소)202011625