Overview

Dataset statistics

Number of variables3
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows113
Duplicate rows (%)1.1%
Total size in memory312.5 KiB
Average record size in memory32.0 B

Variable types

DateTime2
Text1

Dataset

Description파일 다운로드
Author강서구
URLhttps://data.seoul.go.kr/dataList/OA-21795/F/1/datasetView.do

Alerts

Dataset has 113 (1.1%) duplicate rowsDuplicates

Reproduction

Analysis started2024-04-20 15:45:04.748018
Analysis finished2024-04-20 15:45:06.779908
Duration2.03 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct217
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2020-01-01 00:00:00
Maximum2020-08-04 00:00:00
2024-04-21T00:45:06.903571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T00:45:07.154461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct1054
Distinct (%)10.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2024-04-21 00:00:00
Maximum2024-04-21 23:59:00
2024-04-21T00:45:07.422477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T00:45:07.655340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2549
Distinct (%)25.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-21T00:45:08.633374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length9.5589
Min length3

Characters and Unicode

Total characters95589
Distinct characters418
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1985 ?
Unique (%)19.9%

Sample

1st row엠밸리15단지주변
2nd row서울식물원주변
3rd row국내선2층
4th row엠밸리712동건너편주변
5th row1165-1
ValueCountFrequency (%)
주변 1508
 
9.4%
부근 1067
 
6.7%
보타닉파크타워 402
 
2.5%
좋은책신사고주변 238
 
1.5%
마곡필네이처오피스텔주변 213
 
1.3%
마곡사이언스타워 194
 
1.2%
트라이콤택주변 194
 
1.2%
886 185
 
1.2%
마곡지엠지타워 180
 
1.1%
1373 170
 
1.1%
Other values (2098) 11660
72.8%
2024-04-21T00:45:09.676000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6356
 
6.6%
6031
 
6.3%
6019
 
6.3%
3425
 
3.6%
1 2988
 
3.1%
2548
 
2.7%
2102
 
2.2%
3 2040
 
2.1%
2 1634
 
1.7%
6 1470
 
1.5%
Other values (408) 60976
63.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 72289
75.6%
Decimal Number 15125
 
15.8%
Space Separator 6031
 
6.3%
Dash Punctuation 911
 
1.0%
Uppercase Letter 810
 
0.8%
Close Punctuation 178
 
0.2%
Open Punctuation 178
 
0.2%
Lowercase Letter 30
 
< 0.1%
Math Symbol 20
 
< 0.1%
Other Punctuation 15
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6356
 
8.8%
6019
 
8.3%
3425
 
4.7%
2548
 
3.5%
2102
 
2.9%
1371
 
1.9%
1348
 
1.9%
1342
 
1.9%
1298
 
1.8%
1237
 
1.7%
Other values (361) 45243
62.6%
Uppercase Letter
ValueCountFrequency (%)
T 282
34.8%
I 213
26.3%
F 107
 
13.2%
S 97
 
12.0%
K 90
 
11.1%
H 4
 
0.5%
C 3
 
0.4%
G 2
 
0.2%
E 2
 
0.2%
L 2
 
0.2%
Other values (7) 8
 
1.0%
Lowercase Letter
ValueCountFrequency (%)
t 6
20.0%
g 5
16.7%
k 5
16.7%
l 4
13.3%
i 3
10.0%
o 2
 
6.7%
h 1
 
3.3%
u 1
 
3.3%
e 1
 
3.3%
m 1
 
3.3%
Decimal Number
ValueCountFrequency (%)
1 2988
19.8%
3 2040
13.5%
2 1634
10.8%
6 1470
9.7%
8 1455
9.6%
4 1449
9.6%
7 1394
9.2%
5 1377
9.1%
0 691
 
4.6%
9 627
 
4.1%
Other Punctuation
ValueCountFrequency (%)
. 9
60.0%
, 5
33.3%
@ 1
 
6.7%
Space Separator
ValueCountFrequency (%)
6031
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 911
100.0%
Close Punctuation
ValueCountFrequency (%)
) 178
100.0%
Open Punctuation
ValueCountFrequency (%)
( 178
100.0%
Math Symbol
ValueCountFrequency (%)
~ 20
100.0%
Other Number
ValueCountFrequency (%)
¹ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 72289
75.6%
Common 22460
 
23.5%
Latin 840
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6356
 
8.8%
6019
 
8.3%
3425
 
4.7%
2548
 
3.5%
2102
 
2.9%
1371
 
1.9%
1348
 
1.9%
1342
 
1.9%
1298
 
1.8%
1237
 
1.7%
Other values (361) 45243
62.6%
Latin
ValueCountFrequency (%)
T 282
33.6%
I 213
25.4%
F 107
 
12.7%
S 97
 
11.5%
K 90
 
10.7%
t 6
 
0.7%
g 5
 
0.6%
k 5
 
0.6%
H 4
 
0.5%
l 4
 
0.5%
Other values (18) 27
 
3.2%
Common
ValueCountFrequency (%)
6031
26.9%
1 2988
13.3%
3 2040
 
9.1%
2 1634
 
7.3%
6 1470
 
6.5%
8 1455
 
6.5%
4 1449
 
6.5%
7 1394
 
6.2%
5 1377
 
6.1%
- 911
 
4.1%
Other values (9) 1711
 
7.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 72289
75.6%
ASCII 23298
 
24.4%
None 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6356
 
8.8%
6019
 
8.3%
3425
 
4.7%
2548
 
3.5%
2102
 
2.9%
1371
 
1.9%
1348
 
1.9%
1342
 
1.9%
1298
 
1.8%
1237
 
1.7%
Other values (361) 45243
62.6%
ASCII
ValueCountFrequency (%)
6031
25.9%
1 2988
12.8%
3 2040
 
8.8%
2 1634
 
7.0%
6 1470
 
6.3%
8 1455
 
6.2%
4 1449
 
6.2%
7 1394
 
6.0%
5 1377
 
5.9%
- 911
 
3.9%
Other values (36) 2549
10.9%
None
ValueCountFrequency (%)
¹ 2
100.0%

Missing values

2024-04-21T00:45:06.593310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T00:45:06.722337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

단속일단속시간단속장소
34852020-01-1014:39엠밸리15단지주변
597032020-05-1314:56서울식물원주변
548442020-05-0214:21국내선2층
837512020-07-0110:39엠밸리712동건너편주변
958672020-07-2517:361165-1
126112020-02-0322:35강서로7길 28
258022020-03-0111:27FITI시험연구원주변
22722020-01-0813:18초록마을로 66-7
404612020-03-3108:55마곡동로4길 마곡힐스테이트
766432020-06-1712:08보타닉파크타워 주변
단속일단속시간단속장소
93552020-01-2416:19357-1
221542020-02-2219:38의약품수출입협회주변
976322020-07-3010:36발산역1번출구주변
227472020-02-2413:11서울식물원주변
784072020-06-2019:39마곡사이언스타워 주변
122602020-02-0310:42252-14
58572020-01-1519:25가양시니어스타워 주변
553212020-05-0407:38마곡필네이처오피스텔주변
534012020-04-2908:05가양시니어스타워 주변
370672020-03-2411:01국내선2층

Duplicate rows

Most frequently occurring

단속일단속시간단속장소# duplicates
82020-01-1613:53마곡동로8길 부근4
472020-03-1613:20812 부근4
732020-04-2214:43마곡중앙8로3길 부근4
1102020-07-2808:52강서로 부근4
32020-01-1020:50마곡중앙로 부근3
72020-01-1613:50마곡중앙8로3길 부근3
242020-02-1815:02마곡중앙8로3길 부근3
292020-02-2721:371373 부근3
342020-03-0409:43마곡중앙12로 부근3
562020-03-2315:26마곡중앙12로 부근3