Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells2
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory390.6 KiB
Average record size in memory40.0 B

Variable types

Categorical2
Text2

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15249/F/1/datasetView.do

Reproduction

Analysis started2024-04-17 19:02:59.051347
Analysis finished2024-04-17 19:02:59.634736
Duration0.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct28
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
송파구
754 
강서구
691 
강남구
 
590
영등포구
 
565
서초구
 
545
Other values (23)
6855 

Length

Max length6
Median length3
Mean length3.0886
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row성동구
2nd row영등포구
3rd row강서구
4th row구로구
5th row강서구

Common Values

ValueCountFrequency (%)
송파구 754
 
7.5%
강서구 691
 
6.9%
강남구 590
 
5.9%
영등포구 565
 
5.7%
서초구 545
 
5.5%
강동구 461
 
4.6%
노원구 452
 
4.5%
마포구 437
 
4.4%
종로구 421
 
4.2%
양천구 413
 
4.1%
Other values (18) 4671
46.7%

Length

2024-04-18T04:02:59.691958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
송파구 754
 
7.5%
강서구 691
 
6.9%
강남구 590
 
5.9%
영등포구 565
 
5.6%
서초구 545
 
5.4%
강동구 461
 
4.6%
노원구 452
 
4.5%
마포구 437
 
4.4%
종로구 421
 
4.2%
양천구 413
 
4.1%
Other values (20) 4676
46.7%


Text

Distinct2602
Distinct (%)26.0%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2024-04-18T04:02:59.902699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length29
Mean length15.389639
Min length3

Characters and Unicode

Total characters153881
Distinct characters599
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique113 ?
Unique (%)1.1%

Sample

1st row557. 도선동 주민센터 앞
2nd row283. 아크로타워 스퀘어(영등포시장)
3rd row1113. 서남환경공원 버스정류장
4th row2809.항동지구 11단지 1103동 앞
5th row5059. 화곡2동주민센터
ValueCountFrequency (%)
2655
 
9.2%
출구 374
 
1.3%
368
 
1.3%
입구 267
 
0.9%
1번출구 236
 
0.8%
사거리 221
 
0.8%
교차로 221
 
0.8%
2번출구 183
 
0.6%
168
 
0.6%
4번출구 163
 
0.6%
Other values (5173) 23971
83.2%
2024-04-18T04:03:00.245898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18828
 
12.2%
. 10017
 
6.5%
1 7539
 
4.9%
2 6117
 
4.0%
3 4918
 
3.2%
4 4677
 
3.0%
5 3659
 
2.4%
0 3533
 
2.3%
6 3464
 
2.3%
3239
 
2.1%
Other values (589) 87890
57.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 79345
51.6%
Decimal Number 42257
27.5%
Space Separator 18828
 
12.2%
Other Punctuation 10113
 
6.6%
Uppercase Letter 1405
 
0.9%
Open Punctuation 831
 
0.5%
Close Punctuation 831
 
0.5%
Lowercase Letter 170
 
0.1%
Dash Punctuation 63
 
< 0.1%
Math Symbol 22
 
< 0.1%
Other values (3) 16
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3239
 
4.1%
2973
 
3.7%
2208
 
2.8%
2127
 
2.7%
1913
 
2.4%
1883
 
2.4%
1605
 
2.0%
1463
 
1.8%
1378
 
1.7%
1350
 
1.7%
Other values (524) 59206
74.6%
Uppercase Letter
ValueCountFrequency (%)
K 168
12.0%
S 162
11.5%
T 126
 
9.0%
C 117
 
8.3%
G 98
 
7.0%
A 95
 
6.8%
L 88
 
6.3%
D 77
 
5.5%
P 75
 
5.3%
M 65
 
4.6%
Other values (14) 334
23.8%
Lowercase Letter
ValueCountFrequency (%)
e 52
30.6%
k 34
20.0%
s 29
17.1%
t 13
 
7.6%
n 8
 
4.7%
l 7
 
4.1%
g 4
 
2.4%
a 4
 
2.4%
y 4
 
2.4%
v 3
 
1.8%
Other values (6) 12
 
7.1%
Decimal Number
ValueCountFrequency (%)
1 7539
17.8%
2 6117
14.5%
3 4918
11.6%
4 4677
11.1%
5 3659
8.7%
0 3533
8.4%
6 3464
8.2%
7 3209
7.6%
8 2705
 
6.4%
9 2436
 
5.8%
Other Punctuation
ValueCountFrequency (%)
. 10017
99.1%
, 56
 
0.6%
& 21
 
0.2%
? 12
 
0.1%
· 7
 
0.1%
Math Symbol
ValueCountFrequency (%)
~ 17
77.3%
+ 5
 
22.7%
Other Number
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
18828
100.0%
Open Punctuation
ValueCountFrequency (%)
( 831
100.0%
Close Punctuation
ValueCountFrequency (%)
) 831
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 63
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 8
100.0%
Other Symbol
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 79350
51.6%
Common 72956
47.4%
Latin 1575
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3239
 
4.1%
2973
 
3.7%
2208
 
2.8%
2127
 
2.7%
1913
 
2.4%
1883
 
2.4%
1605
 
2.0%
1463
 
1.8%
1378
 
1.7%
1350
 
1.7%
Other values (525) 59211
74.6%
Latin
ValueCountFrequency (%)
K 168
 
10.7%
S 162
 
10.3%
T 126
 
8.0%
C 117
 
7.4%
G 98
 
6.2%
A 95
 
6.0%
L 88
 
5.6%
D 77
 
4.9%
P 75
 
4.8%
M 65
 
4.1%
Other values (30) 504
32.0%
Common
ValueCountFrequency (%)
18828
25.8%
. 10017
13.7%
1 7539
10.3%
2 6117
 
8.4%
3 4918
 
6.7%
4 4677
 
6.4%
5 3659
 
5.0%
0 3533
 
4.8%
6 3464
 
4.7%
7 3209
 
4.4%
Other values (14) 6995
 
9.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 79345
51.6%
ASCII 74521
48.4%
None 12
 
< 0.1%
Enclosed Alphanum 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
18828
25.3%
. 10017
13.4%
1 7539
10.1%
2 6117
 
8.2%
3 4918
 
6.6%
4 4677
 
6.3%
5 3659
 
4.9%
0 3533
 
4.7%
6 3464
 
4.6%
7 3209
 
4.3%
Other values (51) 8560
11.5%
Hangul
ValueCountFrequency (%)
3239
 
4.1%
2973
 
3.7%
2208
 
2.8%
2127
 
2.7%
1913
 
2.4%
1883
 
2.4%
1605
 
2.0%
1463
 
1.8%
1378
 
1.7%
1350
 
1.7%
Other values (524) 59206
74.6%
None
ValueCountFrequency (%)
· 7
58.3%
5
41.7%
Enclosed Alphanum
ValueCountFrequency (%)
2
66.7%
1
33.3%

일/월
Categorical

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
202112
1722 
202111
1684 
202109
1682 
202110
1664 
202108
1647 
Other values (3)
1601 

Length

Max length9
Median length6
Mean length6
Min length3

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row202112
2nd row202112
3rd row202112
4th row202107
5th row202112

Common Values

ValueCountFrequency (%)
202112 1722
17.2%
202111 1684
16.8%
202109 1682
16.8%
202110 1664
16.6%
202108 1647
16.5%
202107 1599
16.0%
대여소 1
 
< 0.1%
대여 일자 / 월 1
 
< 0.1%

Length

2024-04-18T04:03:00.350602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T04:03:00.441448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
202112 1722
17.2%
202111 1684
16.8%
202109 1682
16.8%
202110 1664
16.6%
202108 1647
16.5%
202107 1599
16.0%
대여소 1
 
< 0.1%
대여 1
 
< 0.1%
일자 1
 
< 0.1%
1
 
< 0.1%
Distinct3098
Distinct (%)31.0%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2024-04-18T04:03:00.767416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length3
Mean length3.4306431
Min length1

Characters and Unicode

Total characters34303
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1052 ?
Unique (%)10.5%

Sample

1st row766
2nd row2077
3rd row1050
4th row241
5th row455
ValueCountFrequency (%)
382 15
 
0.1%
442 15
 
0.1%
418 14
 
0.1%
373 14
 
0.1%
765 14
 
0.1%
253 13
 
0.1%
332 13
 
0.1%
748 13
 
0.1%
487 12
 
0.1%
305 12
 
0.1%
Other values (3089) 9865
98.7%
2024-04-18T04:03:01.221767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 6101
17.8%
2 4204
12.3%
3 3540
10.3%
5 3102
9.0%
4 3088
9.0%
6 3012
8.8%
7 2987
8.7%
9 2804
8.2%
8 2749
8.0%
0 2711
7.9%
Other values (5) 5
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 34298
> 99.9%
Other Letter 4
 
< 0.1%
Space Separator 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 6101
17.8%
2 4204
12.3%
3 3540
10.3%
5 3102
9.0%
4 3088
9.0%
6 3012
8.8%
7 2987
8.7%
9 2804
8.2%
8 2749
8.0%
0 2711
7.9%
Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 34299
> 99.9%
Hangul 4
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
1 6101
17.8%
2 4204
12.3%
3 3540
10.3%
5 3102
9.0%
4 3088
9.0%
6 3012
8.8%
7 2987
8.7%
9 2804
8.2%
8 2749
8.0%
0 2711
7.9%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 34299
> 99.9%
Hangul 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 6101
17.8%
2 4204
12.3%
3 3540
10.3%
5 3102
9.0%
4 3088
9.0%
6 3012
8.8%
7 2987
8.7%
9 2804
8.2%
8 2749
8.0%
0 2711
7.9%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Correlations

2024-04-18T04:03:01.308925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분일/월
구분1.0000.720
일/월0.7201.000
2024-04-18T04:03:01.372731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분일/월
구분1.0000.375
일/월0.3751.000
2024-04-18T04:03:01.441313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분일/월
구분1.0000.375
일/월0.3751.000

Missing values

2024-04-18T04:02:59.449403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-18T04:02:59.512359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-18T04:02:59.590187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

구분일/월202107 ~ 202112
14137성동구557. 도선동 주민센터 앞202112766
14619영등포구283. 아크로타워 스퀘어(영등포시장)2021122077
12899강서구1113. 서남환경공원 버스정류장2021121050
705구로구2809.항동지구 11단지 1103동 앞202107241
13055강서구5059. 화곡2동주민센터202112455
3543동대문구674.고대앞사거리 교통섬2021081776
11810송파구4480. 가락시장역 롯데마트2202111721
10446강서구2729.넥센 유니버시티 앞2021111722
13427노원구1654. 당고개입구 오거리202112491
8985성동구3560.성동구 견인차량 보관소 앞2021101301
구분일/월202107 ~ 202112
12237은평구948. 디지털미디어 시티역 4번출구(DMC역)202111911
14534양천구785.양천구청, 보건소 사잇길2021123367
3454도봉구1772.시립도봉노인복지관 버스정류소202108891
7448중랑구1462.동부시장 북문 앞2021092120
9076성북구1346. 길음8골어린이공원 옆2021101047
3523동대문구650. 중랑교사거리2021081871
13748마포구107. 신한은행 서교동금융센터점 앞2021121098
14564영등포구221. 여의도초교 앞202112845
11586성북구1306. 한성대입구역2번출구2021111709
2597강남구2429.압구정로데오역 6번출구202108823