Overview

Dataset statistics

Number of variables3
Number of observations657
Missing cells0
Missing cells (%)0.0%
Duplicate rows9
Duplicate rows (%)1.4%
Total size in memory15.5 KiB
Average record size in memory24.2 B

Variable types

Categorical1
Text1
DateTime1

Dataset

Description파일 다운로드
Author구로구
URLhttps://data.seoul.go.kr/dataList/OA-21855/F/1/datasetView.do

Alerts

기준일자 has constant value ""Constant
Dataset has 9 (1.4%) duplicate rowsDuplicates

Reproduction

Analysis started2024-04-20 23:51:06.915310
Analysis finished2024-04-20 23:51:07.432453
Duration0.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

행정동
Categorical

Distinct13
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size5.3 KiB
수궁동
101 
구로2동
83 
고척2동
56 
구로5동
55 
개봉1동
54 
Other values (8)
308 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row구로2동
2nd row구로2동
3rd row구로2동
4th row구로2동
5th row구로2동

Common Values

ValueCountFrequency (%)
수궁동 101
15.4%
구로2동 83
12.6%
고척2동 56
8.5%
구로5동 55
8.4%
개봉1동 54
8.2%
오류1동 53
8.1%
구로3동 50
7.6%
오류2동 47
7.2%
구로4동 39
 
5.9%
개봉3동 39
 
5.9%
Other values (3) 80
12.2%

Length

2024-04-21T08:51:07.488307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
수궁동 101
15.4%
구로2동 83
12.6%
고척2동 56
8.5%
구로5동 55
8.4%
개봉1동 54
8.2%
오류1동 53
8.1%
구로3동 50
7.6%
오류2동 47
7.2%
구로4동 39
 
5.9%
개봉3동 39
 
5.9%
Other values (3) 80
12.2%

위치
Text

Distinct648
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size5.3 KiB
2024-04-21T08:51:07.713143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length13
Mean length10.004566
Min length5

Characters and Unicode

Total characters6573
Distinct characters64
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique639 ?
Unique (%)97.3%

Sample

1st row가마산로20다길 3
2nd row구로동로28길 58
3rd row가마산로20다길 28-14
4th row가마산로20다길 22-10
5th row가마산로20나길 22-2
ValueCountFrequency (%)
13 14
 
1.1%
16 13
 
1.0%
부일로1길 13
 
1.0%
고척로 12
 
0.9%
오리로21길 11
 
0.8%
고척로3길 10
 
0.8%
경인로35길 10
 
0.8%
18 9
 
0.7%
14 9
 
0.7%
23 9
 
0.7%
Other values (572) 1200
91.6%
2024-04-21T08:51:08.093196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
764
 
11.6%
658
 
10.0%
1 624
 
9.5%
568
 
8.6%
2 446
 
6.8%
3 314
 
4.8%
4 243
 
3.7%
5 236
 
3.6%
- 203
 
3.1%
6 194
 
3.0%
Other values (54) 2323
35.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3015
45.9%
Decimal Number 2697
41.0%
Space Separator 658
 
10.0%
Dash Punctuation 203
 
3.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
764
25.3%
568
18.8%
116
 
3.8%
112
 
3.7%
112
 
3.7%
99
 
3.3%
74
 
2.5%
73
 
2.4%
69
 
2.3%
69
 
2.3%
Other values (42) 959
31.8%
Decimal Number
ValueCountFrequency (%)
1 624
23.1%
2 446
16.5%
3 314
11.6%
4 243
 
9.0%
5 236
 
8.8%
6 194
 
7.2%
7 184
 
6.8%
8 171
 
6.3%
0 162
 
6.0%
9 123
 
4.6%
Space Separator
ValueCountFrequency (%)
658
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 203
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3558
54.1%
Hangul 3015
45.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
764
25.3%
568
18.8%
116
 
3.8%
112
 
3.7%
112
 
3.7%
99
 
3.3%
74
 
2.5%
73
 
2.4%
69
 
2.3%
69
 
2.3%
Other values (42) 959
31.8%
Common
ValueCountFrequency (%)
658
18.5%
1 624
17.5%
2 446
12.5%
3 314
8.8%
4 243
 
6.8%
5 236
 
6.6%
- 203
 
5.7%
6 194
 
5.5%
7 184
 
5.2%
8 171
 
4.8%
Other values (2) 285
8.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3558
54.1%
Hangul 3015
45.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
764
25.3%
568
18.8%
116
 
3.8%
112
 
3.7%
112
 
3.7%
99
 
3.3%
74
 
2.5%
73
 
2.4%
69
 
2.3%
69
 
2.3%
Other values (42) 959
31.8%
ASCII
ValueCountFrequency (%)
658
18.5%
1 624
17.5%
2 446
12.5%
3 314
8.8%
4 243
 
6.8%
5 236
 
6.6%
- 203
 
5.7%
6 194
 
5.5%
7 184
 
5.2%
8 171
 
4.8%
Other values (2) 285
8.0%

기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size5.3 KiB
Minimum2023-03-31 00:00:00
Maximum2023-03-31 00:00:00
2024-04-21T08:51:08.194180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T08:51:08.272108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Missing values

2024-04-21T08:51:07.340422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T08:51:07.403864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

행정동위치기준일자
0구로2동가마산로20다길 32023-03-31
1구로2동구로동로28길 582023-03-31
2구로2동가마산로20다길 28-142023-03-31
3구로2동가마산로20다길 22-102023-03-31
4구로2동가마산로20나길 22-22023-03-31
5구로2동구로동로 221-12023-03-31
6구로2동구로동로 2052023-03-31
7구로2동구로동로43길 242023-03-31
8구로2동구로동로43길 402023-03-31
9구로2동가마산로 1342023-03-31
행정동위치기준일자
647개봉3동개봉로1길 1062023-03-31
648개봉3동개봉로1길 402023-03-31
649개봉3동개봉로12길 9-142023-03-31
650개봉3동개봉로2길 752023-03-31
651개봉3동개봉로1나길 12023-03-31
652개봉3동개봉로1길 222023-03-31
653개봉3동개봉로10길 272023-03-31
654개봉3동개봉로6길 11-22023-03-31
655개봉3동개봉로16길 262023-03-31
656개봉3동개봉로15길 912023-03-31

Duplicate rows

Most frequently occurring

행정동위치기준일자# duplicates
0고척2동경인로35길 104-72023-03-312
1고척2동경인로35길 94-72023-03-312
2고척2동경인로35길 98-12023-03-312
3구로2동가마산로20나길 22-22023-03-312
4구로2동가마산로20다길 22-102023-03-312
5구로2동가마산로20다길 32023-03-312
6구로5동공원로7길 122023-03-312
7수궁동부일로17길 502023-03-312
8수궁동부일로5길 302023-03-312