Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 7065 |
Missing cells | 1539 |
Missing cells (%) | 3.6% |
Duplicate rows | 6 |
Duplicate rows (%) | 0.1% |
Total size in memory | 352.0 KiB |
Average record size in memory | 51.0 B |
Variable types
Categorical | 1 |
---|---|
DateTime | 1 |
Text | 2 |
Numeric | 2 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-22190/F/1/datasetView.do |
단속일 has constant value "" | Constant |
Dataset has 6 (0.1%) duplicate rows | Duplicates |
도로명 has 1531 (21.7%) missing values | Missing |
경도 is highly skewed (γ1 = 59.41379953) | Skewed |
위도 is highly skewed (γ1 = 83.71093753) | Skewed |
Reproduction
Analysis started | 2024-04-21 00:07:02.278536 |
---|---|
Analysis finished | 2024-04-21 00:07:05.121040 |
Duration | 2.84 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
단속일
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 55.3 KiB |
20210929 |
---|
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20210929 |
---|---|
2nd row | 20210929 |
3rd row | 20210929 |
4th row | 20210929 |
5th row | 20210929 |
Common Values
Value | Count | Frequency (%) |
20210929 | 7065 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
20210929 | 7065 |
단속시간
Date
Distinct | 6473 |
---|---|
Distinct (%) | 91.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 55.3 KiB |
Minimum | 2024-04-21 00:00:44 |
---|---|
Maximum | 2024-04-21 20:50:27 |
구주소
Text
Distinct | 2613 |
---|---|
Distinct (%) | 37.0% |
Missing | 4 |
Missing (%) | 0.1% |
Memory size | 55.3 KiB |
Length
Max length | 56 |
---|---|
Median length | 41 |
Mean length | 13.83685 |
Min length | 2 |
Characters and Unicode
Total characters | 97702 |
---|---|
Distinct characters | 408 |
Distinct categories | 11 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 1312 ? |
---|---|
Unique (%) | 18.6% |
Sample
1st row | 서울특별시 동대문구 장한로18길 9 |
---|---|
2nd row | 서울 강북구 수유동 142 |
3rd row | 서울 중구 흥인동 160 |
4th row | 서울 중구 흥인동 160 |
5th row | 서울 중구 흥인동 160 |
Value | Count | Frequency (%) |
서울 | 2600 | 12.4% |
서울특별시 | 675 | 3.2% |
중구 | 426 | 2.0% |
서초구 | 305 | 1.5% |
종로구 | 233 | 1.1% |
마포구 | 229 | 1.1% |
동대문구 | 201 | 1.0% |
양천구 | 188 | 0.9% |
강서구 | 187 | 0.9% |
신당동 | 175 | 0.8% |
Other values (3238) | 15728 |
Most occurring characters
Value | Count | Frequency (%) |
14502 | 14.8% | |
동 | 6857 | 7.0% |
1 | 5806 | 5.9% |
- | 5262 | 5.4% |
서 | 4145 | 4.2% |
2 | 4033 | 4.1% |
구 | 3592 | 3.7% |
3 | 3488 | 3.6% |
울 | 3297 | 3.4% |
4 | 2929 | 3.0% |
Other values (398) | 43791 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 46866 | |
Decimal Number | 29990 | |
Space Separator | 14502 | 14.8% |
Dash Punctuation | 5262 | 5.4% |
Open Punctuation | 381 | 0.4% |
Close Punctuation | 378 | 0.4% |
Other Punctuation | 294 | 0.3% |
Uppercase Letter | 16 | < 0.1% |
Math Symbol | 6 | < 0.1% |
Lowercase Letter | 6 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 6857 | 14.6% |
서 | 4145 | 8.8% |
구 | 3592 | 7.7% |
울 | 3297 | 7.0% |
로 | 1613 | 3.4% |
가 | 1225 | 2.6% |
시 | 758 | 1.6% |
중 | 722 | 1.5% |
신 | 706 | 1.5% |
별 | 675 | 1.4% |
Other values (362) | 23276 |
Decimal Number
Value | Count | Frequency (%) |
1 | 5806 | |
2 | 4033 | |
3 | 3488 | |
4 | 2929 | |
5 | 2828 | |
7 | 2517 | |
6 | 2372 | |
9 | 2108 | 7.0% |
0 | 2046 | 6.8% |
8 | 1863 | 6.2% |
Uppercase Letter
Value | Count | Frequency (%) |
S | 5 | |
B | 2 | 12.5% |
G | 2 | 12.5% |
K | 1 | 6.2% |
I | 1 | 6.2% |
L | 1 | 6.2% |
C | 1 | 6.2% |
U | 1 | 6.2% |
E | 1 | 6.2% |
T | 1 | 6.2% |
Lowercase Letter
Value | Count | Frequency (%) |
o | 1 | |
w | 1 | |
e | 1 | |
r | 1 | |
d | 1 | |
b | 1 |
Other Punctuation
Value | Count | Frequency (%) |
, | 289 | |
/ | 3 | 1.0% |
: | 1 | 0.3% |
@ | 1 | 0.3% |
Space Separator
Value | Count | Frequency (%) |
14502 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 5262 |
Open Punctuation
Value | Count | Frequency (%) |
( | 381 |
Close Punctuation
Value | Count | Frequency (%) |
) | 378 |
Math Symbol
Value | Count | Frequency (%) |
~ | 6 |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 50813 | |
Hangul | 46866 | |
Latin | 23 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 6857 | 14.6% |
서 | 4145 | 8.8% |
구 | 3592 | 7.7% |
울 | 3297 | 7.0% |
로 | 1613 | 3.4% |
가 | 1225 | 2.6% |
시 | 758 | 1.6% |
중 | 722 | 1.5% |
신 | 706 | 1.5% |
별 | 675 | 1.4% |
Other values (362) | 23276 |
Common
Value | Count | Frequency (%) |
14502 | ||
1 | 5806 | |
- | 5262 | 10.4% |
2 | 4033 | 7.9% |
3 | 3488 | 6.9% |
4 | 2929 | 5.8% |
5 | 2828 | 5.6% |
7 | 2517 | 5.0% |
6 | 2372 | 4.7% |
9 | 2108 | 4.1% |
Other values (9) | 4968 | 9.8% |
Latin
Value | Count | Frequency (%) |
S | 5 | |
B | 2 | 8.7% |
G | 2 | 8.7% |
K | 1 | 4.3% |
Ⅱ | 1 | 4.3% |
I | 1 | 4.3% |
L | 1 | 4.3% |
C | 1 | 4.3% |
U | 1 | 4.3% |
E | 1 | 4.3% |
Other values (7) | 7 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 50835 | |
Hangul | 46866 | |
Number Forms | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
14502 | ||
1 | 5806 | |
- | 5262 | 10.4% |
2 | 4033 | 7.9% |
3 | 3488 | 6.9% |
4 | 2929 | 5.8% |
5 | 2828 | 5.6% |
7 | 2517 | 5.0% |
6 | 2372 | 4.7% |
9 | 2108 | 4.1% |
Other values (25) | 4990 | 9.8% |
Hangul
Value | Count | Frequency (%) |
동 | 6857 | 14.6% |
서 | 4145 | 8.8% |
구 | 3592 | 7.7% |
울 | 3297 | 7.0% |
로 | 1613 | 3.4% |
가 | 1225 | 2.6% |
시 | 758 | 1.6% |
중 | 722 | 1.5% |
신 | 706 | 1.5% |
별 | 675 | 1.4% |
Other values (362) | 23276 |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 1 |
도로명
Text
MISSING
 
Distinct | 1781 |
---|---|
Distinct (%) | 32.2% |
Missing | 1531 |
Missing (%) | 21.7% |
Memory size | 55.3 KiB |
Length
Max length | 29 |
---|---|
Median length | 24 |
Mean length | 11.244308 |
Min length | 2 |
Characters and Unicode
Total characters | 62226 |
---|---|
Distinct characters | 327 |
Distinct categories | 8 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 793 ? |
---|---|
Unique (%) | 14.3% |
Sample
1st row | 서울특별시 동대문구 장한로18길 9 |
---|---|
2nd row | 서울특별시 강북구 도봉로 337 |
3rd row | 서울특별시 강북구 솔매로 99 |
4th row | 퇴계로 307 (광희동1가) |
5th row | 을지로 238 (을지로6가) |
Value | Count | Frequency (%) |
서울 | 1339 | 9.3% |
서울특별시 | 409 | 2.8% |
중구 | 226 | 1.6% |
양천구 | 182 | 1.3% |
마포구 | 152 | 1.1% |
청계천로 | 145 | 1.0% |
퇴계로 | 112 | 0.8% |
은평구 | 106 | 0.7% |
11 | 104 | 0.7% |
광진구 | 100 | 0.7% |
Other values (1864) | 11546 |
Most occurring characters
Value | Count | Frequency (%) |
9118 | 14.7% | |
로 | 5533 | 8.9% |
1 | 3896 | 6.3% |
2 | 2452 | 3.9% |
길 | 2308 | 3.7% |
3 | 2224 | 3.6% |
서 | 2049 | 3.3% |
구 | 1971 | 3.2% |
울 | 1775 | 2.9% |
4 | 1641 | 2.6% |
Other values (317) | 29259 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 34050 | |
Decimal Number | 17720 | |
Space Separator | 9118 | 14.7% |
Dash Punctuation | 606 | 1.0% |
Open Punctuation | 333 | 0.5% |
Close Punctuation | 333 | 0.5% |
Uppercase Letter | 62 | 0.1% |
Other Punctuation | 4 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
로 | 5533 | 16.2% |
길 | 2308 | 6.8% |
서 | 2049 | 6.0% |
구 | 1971 | 5.8% |
울 | 1775 | 5.2% |
동 | 988 | 2.9% |
대 | 675 | 2.0% |
천 | 633 | 1.9% |
산 | 506 | 1.5% |
시 | 474 | 1.4% |
Other values (298) | 17138 |
Decimal Number
Value | Count | Frequency (%) |
1 | 3896 | |
2 | 2452 | |
3 | 2224 | |
4 | 1641 | |
6 | 1453 | 8.2% |
5 | 1431 | 8.1% |
7 | 1304 | 7.4% |
0 | 1258 | 7.1% |
8 | 1127 | 6.4% |
9 | 934 | 5.3% |
Uppercase Letter
Value | Count | Frequency (%) |
L | 30 | |
G | 30 | |
C | 1 | 1.6% |
S | 1 | 1.6% |
Space Separator
Value | Count | Frequency (%) |
9118 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 606 |
Open Punctuation
Value | Count | Frequency (%) |
( | 333 |
Close Punctuation
Value | Count | Frequency (%) |
) | 333 |
Other Punctuation
Value | Count | Frequency (%) |
, | 4 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 34050 | |
Common | 28114 | |
Latin | 62 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
로 | 5533 | 16.2% |
길 | 2308 | 6.8% |
서 | 2049 | 6.0% |
구 | 1971 | 5.8% |
울 | 1775 | 5.2% |
동 | 988 | 2.9% |
대 | 675 | 2.0% |
천 | 633 | 1.9% |
산 | 506 | 1.5% |
시 | 474 | 1.4% |
Other values (298) | 17138 |
Common
Value | Count | Frequency (%) |
9118 | ||
1 | 3896 | |
2 | 2452 | 8.7% |
3 | 2224 | 7.9% |
4 | 1641 | 5.8% |
6 | 1453 | 5.2% |
5 | 1431 | 5.1% |
7 | 1304 | 4.6% |
0 | 1258 | 4.5% |
8 | 1127 | 4.0% |
Other values (5) | 2210 | 7.9% |
Latin
Value | Count | Frequency (%) |
L | 30 | |
G | 30 | |
C | 1 | 1.6% |
S | 1 | 1.6% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 34050 | |
ASCII | 28176 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
9118 | ||
1 | 3896 | |
2 | 2452 | 8.7% |
3 | 2224 | 7.9% |
4 | 1641 | 5.8% |
6 | 1453 | 5.2% |
5 | 1431 | 5.1% |
7 | 1304 | 4.6% |
0 | 1258 | 4.5% |
8 | 1127 | 4.0% |
Other values (9) | 2272 | 8.1% |
Hangul
Value | Count | Frequency (%) |
로 | 5533 | 16.2% |
길 | 2308 | 6.8% |
서 | 2049 | 6.0% |
구 | 1971 | 5.8% |
울 | 1775 | 5.2% |
동 | 988 | 2.9% |
대 | 675 | 2.0% |
천 | 633 | 1.9% |
산 | 506 | 1.5% |
시 | 474 | 1.4% |
Other values (298) | 17138 |
경도
Real number (ℝ)
SKEWED
 
Distinct | 3383 |
---|---|
Distinct (%) | 47.9% |
Missing | 2 |
Missing (%) | < 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 358020.13 |
Minimum | 37.543484 |
---|---|
Maximum | 1.2638998 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 62.2 KiB |
Quantile statistics
Minimum | 37.543484 |
---|---|
5-th percentile | 126.86325 |
Q1 | 126.91909 |
median | 126.98556 |
Q3 | 127.02596 |
95-th percentile | 127.0759 |
Maximum | 1.2638998 × 109 |
Range | 1.2638998 × 109 |
Interquartile range (IQR) | 0.1068695 |
Descriptive statistics
Standard deviation | 21266806 |
---|---|
Coefficient of variation (CV) | 59.401144 |
Kurtosis | 3528.9989 |
Mean | 358020.13 |
Median Absolute Deviation (MAD) | 0.054280531 |
Skewness | 59.4138 |
Sum | 2.5286962 × 109 |
Variance | 4.5227702 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
126.9158448 | 59 | 0.8% |
126.9786163 | 53 | 0.8% |
126.93173 | 44 | 0.6% |
126.97281501547188 | 42 | 0.6% |
126.97925943831896 | 40 | 0.6% |
126.924697 | 34 | 0.5% |
126.91878 | 30 | 0.4% |
126.98750515671884 | 29 | 0.4% |
126.9149868 | 23 | 0.3% |
126.86665492510193 | 21 | 0.3% |
Other values (3373) | 6688 |
Value | Count | Frequency (%) |
37.5434837341309 | 1 | |
126.807648069139 | 1 | |
126.80785727929504 | 1 | |
126.807877655171 | 1 | |
126.80794662702776 | 1 | |
126.8081155843709 | 1 | |
126.80856970039116 | 1 | |
126.80861714276212 | 1 | |
126.80878027490792 | 1 | |
126.80880908682272 | 1 |
Value | Count | Frequency (%) |
1263899849.0 | 2 | < 0.1% |
127.17401708848259 | 2 | < 0.1% |
127.1734772 | 3 | |
127.1734543 | 2 | < 0.1% |
127.1733246 | 1 | < 0.1% |
127.1729889 | 3 | |
127.1723099 | 1 | < 0.1% |
127.168663 | 5 | |
127.166030545194 | 2 | < 0.1% |
127.165508 | 4 |
위도
Real number (ℝ)
SKEWED
 
Distinct | 3381 |
---|---|
Distinct (%) | 47.9% |
Missing | 2 |
Missing (%) | < 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 37.564385 |
Minimum | 35.517833 |
---|---|
Maximum | 127.01531 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 62.2 KiB |
Quantile statistics
Minimum | 35.517833 |
---|---|
5-th percentile | 37.476667 |
Q1 | 37.519244 |
median | 37.557256 |
Q3 | 37.573069 |
95-th percentile | 37.640806 |
Maximum | 127.01531 |
Range | 91.497479 |
Interquartile range (IQR) | 0.053825178 |
Descriptive statistics
Standard deviation | 1.0659104 |
---|---|
Coefficient of variation (CV) | 0.02837556 |
Kurtosis | 7026.0692 |
Mean | 37.564385 |
Median Absolute Deviation (MAD) | 0.028588932 |
Skewness | 83.710938 |
Sum | 265317.25 |
Variance | 1.1361651 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
37.5859655 | 59 | 0.8% |
37.5744654 | 53 | 0.8% |
37.520769 | 44 | 0.6% |
37.57306915349144 | 42 | 0.6% |
37.57122342169176 | 40 | 0.6% |
37.491633 | 34 | 0.5% |
37.529953 | 30 | 0.4% |
37.55846872864774 | 29 | 0.4% |
37.5693973 | 23 | 0.3% |
37.5695626 | 21 | 0.3% |
Other values (3371) | 6688 |
Value | Count | Frequency (%) |
35.517833 | 1 | |
37.44122280819926 | 2 | |
37.44410730254636 | 1 | |
37.444220068329805 | 1 | |
37.446707 | 1 | |
37.447218 | 1 | |
37.4477544 | 1 | |
37.44893 | 2 | |
37.4489924636289 | 1 | |
37.450028 | 2 |
Value | Count | Frequency (%) |
127.015312194824 | 1 | < 0.1% |
37.69013773708887 | 4 | |
37.683274044176514 | 1 | < 0.1% |
37.6792296666667 | 1 | < 0.1% |
37.6792063333333 | 1 | < 0.1% |
37.6791726666667 | 1 | < 0.1% |
37.6791491666667 | 1 | < 0.1% |
37.6790831666667 | 1 | < 0.1% |
37.6790255 | 1 | < 0.1% |
37.6785601666667 | 1 | < 0.1% |
경도 | 위도 | |
---|---|---|
경도 | 1.000 | 0.000 |
위도 | 0.000 | 1.000 |
경도 | 위도 | |
---|---|---|
경도 | 1.000 | 0.467 |
위도 | 0.467 | 1.000 |
단속일 | 단속시간 | 구주소 | 도로명 | 경도 | 위도 | |
---|---|---|---|---|---|---|
0 | 20210929 | 00:00:44 | 서울특별시 동대문구 장한로18길 9 | 서울특별시 동대문구 장한로18길 9 | 127.069473 | 37.567763 |
1 | 20210929 | 00:03:36 | 서울 강북구 수유동 142 | 서울특별시 강북구 도봉로 337 | 127.025343 | 37.638049 |
2 | 20210929 | 00:04:20 | 서울 중구 흥인동 160 | <NA> | 127.014166 | 37.565367 |
3 | 20210929 | 00:04:53 | 서울 중구 흥인동 160 | <NA> | 127.014206 | 37.565343 |
4 | 20210929 | 00:05:23 | 서울 중구 흥인동 160 | <NA> | 127.014194 | 37.565294 |
5 | 20210929 | 00:07:03 | 서울 중구 을지로7가 2-36 | <NA> | 127.010355 | 37.565875 |
6 | 20210929 | 00:07:05 | 서울 중구 을지로7가 2-36 | <NA> | 127.010485 | 37.565923 |
7 | 20210929 | 00:07:09 | 서울 중구 을지로7가 2-36 | <NA> | 127.01078 | 37.56601 |
8 | 20210929 | 00:07:10 | 서울 중구 을지로7가 2-36 | <NA> | 127.010922 | 37.566052 |
9 | 20210929 | 00:07:11 | 서울 중구 을지로7가 2-36 | <NA> | 127.010922 | 37.566052 |
단속일 | 단속시간 | 구주소 | 도로명 | 경도 | 위도 | |
---|---|---|---|---|---|---|
7055 | 20210929 | 20:49:12 | 서울특별시 서초구 서초대로77길 15 (서초동, 대경빌딩) | <NA> | 127.026295 | 37.499184 |
7056 | 20210929 | 20:49:31 | 서울특별시 서초구 서초대로77길 15 (서초동, 대경빌딩) | <NA> | 127.026268 | 37.499128 |
7057 | 20210929 | 20:49:32 | 서울특별시 서초구 청두곶길 50 (방배동) | <NA> | 126.985357 | 37.481141 |
7058 | 20210929 | 20:49:41 | 신당동 217-92번지앞 | 청계천로 318 | 127.01403 | 37.569563 |
7059 | 20210929 | 20:49:41 | 서울 중구 신당동 372-12 | <NA> | 127.01135 | 37.5534 |
7060 | 20210929 | 20:49:54 | 방이동 89-28 | 양재대로 1233 | 127.130372 | 37.515717 |
7061 | 20210929 | 20:50:00 | 서울 금천구 독산동 901-4 | 서울특별시 금천구 남부순환로 1424 | 126.908784 | 37.480488 |
7062 | 20210929 | 20:50:01 | 서울특별시 서초구 서초대로77길 15 (서초동, 대경빌딩) | <NA> | 127.026246 | 37.498826 |
7063 | 20210929 | 20:50:16 | 신월3동 166-7 | 남부순환로 364 | 126.828895 | 37.534737 |
7064 | 20210929 | 20:50:27 | 서울특별시 중구 동호로 171 | 서울특별시 중구 동호로 171 | 127.011243 | 37.553442 |
Most frequently occurring
단속일 | 단속시간 | 구주소 | 도로명 | 경도 | 위도 | # duplicates | |
---|---|---|---|---|---|---|---|
0 | 20210929 | 04:10:00 | 서울 관악구 신림동 475-103 | 서울 관악구 조원로31길 57 | 126.918917 | 37.48845 | 2 |
1 | 20210929 | 13:31:51 | 서울 강서구 마곡동 758 | <NA> | 126.825353 | 37.569363 | 2 |
2 | 20210929 | 13:36:00 | 서울 동대문구 장안동 465-1 | 서울특별시 동대문구 천호대로 427-4 | 127.066761 | 37.561834 | 2 |
3 | 20210929 | 14:36:00 | 서울 관악구 봉천동 893-29 | 서울 관악구 남부순환로 1769 | 126.946944 | 37.482138 | 2 |
4 | 20210929 | 18:55:00 | 서울 강서구 염창동 281 | 서울특별시 강서구 공항대로 607 | 126.872405 | 37.547476 | 2 |
5 | 20210929 | 19:27:38 | 서울특별시 동대문구 홍릉로12길 24 | <NA> | 127.044429 | 37.585782 | 2 |