Overview

Dataset statistics

Number of variables6
Number of observations7007
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory349.1 KiB
Average record size in memory51.0 B

Variable types

Categorical2
Text1
Numeric2
DateTime1

Dataset

Description자치구,안심 주소,위도,경도,CCTV 수량,수정 일시
Author강남구
URLhttps://data.seoul.go.kr/dataList/OA-20946/S/1/datasetView.do

Alerts

자치구 has constant value ""Constant
수정 일시 has constant value ""Constant
위도 is highly overall correlated with 경도High correlation
경도 is highly overall correlated with 위도High correlation
CCTV 수량 is highly imbalanced (99.6%)Imbalance
안심 주소 has unique valuesUnique

Reproduction

Analysis started2024-03-13 08:02:31.092696
Analysis finished2024-03-13 08:02:31.841359
Duration0.75 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

자치구
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size54.9 KiB
강남구
7007 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강남구
2nd row강남구
3rd row강남구
4th row강남구
5th row강남구

Common Values

ValueCountFrequency (%)
강남구 7007
100.0%

Length

2024-03-13T17:02:31.897486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T17:02:31.973891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
강남구 7007
100.0%

안심 주소
Text

UNIQUE 

Distinct7007
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size54.9 KiB
2024-03-13T17:02:32.175439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length9.8090481
Min length9

Characters and Unicode

Total characters68732
Distinct characters57
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7007 ?
Unique (%)100.0%

Sample

1st row 삼성1-291-02
2nd row개포1-101-00
3rd row개포1-101-01
4th row개포1-101-02
5th row개포1-202-00
ValueCountFrequency (%)
삼성1-291-02 1
 
< 0.1%
압구정-217-04 1
 
< 0.1%
압구정-225-01 1
 
< 0.1%
압구정-225-00 1
 
< 0.1%
압구정-224-03 1
 
< 0.1%
압구정-224-02 1
 
< 0.1%
압구정-224-01 1
 
< 0.1%
압구정-224-00 1
 
< 0.1%
압구정-223-02 1
 
< 0.1%
압구정-223-01 1
 
< 0.1%
Other values (6997) 6997
99.9%
2024-03-13T17:02:32.503809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 14060
20.5%
0 11072
16.1%
2 10847
15.8%
1 6479
9.4%
4 3278
 
4.8%
3 2628
 
3.8%
1767
 
2.6%
5 1337
 
1.9%
6 1269
 
1.8%
7 1211
 
1.8%
Other values (47) 14784
21.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 40063
58.3%
Other Letter 14608
 
21.3%
Dash Punctuation 14060
 
20.5%
Space Separator 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1767
 
12.1%
1034
 
7.1%
997
 
6.8%
997
 
6.8%
865
 
5.9%
829
 
5.7%
799
 
5.5%
733
 
5.0%
673
 
4.6%
673
 
4.6%
Other values (35) 5241
35.9%
Decimal Number
ValueCountFrequency (%)
0 11072
27.6%
2 10847
27.1%
1 6479
16.2%
4 3278
 
8.2%
3 2628
 
6.6%
5 1337
 
3.3%
6 1269
 
3.2%
7 1211
 
3.0%
8 1087
 
2.7%
9 855
 
2.1%
Dash Punctuation
ValueCountFrequency (%)
- 14060
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 54124
78.7%
Hangul 14608
 
21.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1767
 
12.1%
1034
 
7.1%
997
 
6.8%
997
 
6.8%
865
 
5.9%
829
 
5.7%
799
 
5.5%
733
 
5.0%
673
 
4.6%
673
 
4.6%
Other values (35) 5241
35.9%
Common
ValueCountFrequency (%)
- 14060
26.0%
0 11072
20.5%
2 10847
20.0%
1 6479
12.0%
4 3278
 
6.1%
3 2628
 
4.9%
5 1337
 
2.5%
6 1269
 
2.3%
7 1211
 
2.2%
8 1087
 
2.0%
Other values (2) 856
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 54124
78.7%
Hangul 14608
 
21.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 14060
26.0%
0 11072
20.5%
2 10847
20.0%
1 6479
12.0%
4 3278
 
6.1%
3 2628
 
4.9%
5 1337
 
2.5%
6 1269
 
2.3%
7 1211
 
2.2%
8 1087
 
2.0%
Other values (2) 856
 
1.6%
Hangul
ValueCountFrequency (%)
1767
 
12.1%
1034
 
7.1%
997
 
6.8%
997
 
6.8%
865
 
5.9%
829
 
5.7%
799
 
5.5%
733
 
5.0%
673
 
4.6%
673
 
4.6%
Other values (35) 5241
35.9%

위도
Real number (ℝ)

HIGH CORRELATION 

Distinct635
Distinct (%)9.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.501856
Minimum37.4605
Maximum37.5327
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size61.7 KiB
2024-03-13T17:02:32.628793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum37.4605
5-th percentile37.4733
Q137.4897
median37.503
Q337.5153
95-th percentile37.5259
Maximum37.5327
Range0.0722
Interquartile range (IQR)0.0256

Descriptive statistics

Standard deviation0.016262226
Coefficient of variation (CV)0.00043363791
Kurtosis-0.75859127
Mean37.501856
Median Absolute Deviation (MAD)0.0126
Skewness-0.30086522
Sum262775.5
Variance0.00026446001
MonotonicityNot monotonic
2024-03-13T17:02:32.748482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.5115 34
 
0.5%
37.5 33
 
0.5%
37.5122 32
 
0.5%
37.5109 32
 
0.5%
37.4957 32
 
0.5%
37.4941 31
 
0.4%
37.4885 29
 
0.4%
37.4931 28
 
0.4%
37.5063 28
 
0.4%
37.5202 28
 
0.4%
Other values (625) 6700
95.6%
ValueCountFrequency (%)
37.4605 2
< 0.1%
37.461 1
 
< 0.1%
37.4614 4
0.1%
37.4615 1
 
< 0.1%
37.4622 1
 
< 0.1%
37.4623 1
 
< 0.1%
37.4624 1
 
< 0.1%
37.4625 1
 
< 0.1%
37.4626 4
0.1%
37.4629 4
0.1%
ValueCountFrequency (%)
37.5327 6
0.1%
37.5323 3
< 0.1%
37.5322 1
 
< 0.1%
37.5321 1
 
< 0.1%
37.5319 3
< 0.1%
37.5318 4
0.1%
37.5317 1
 
< 0.1%
37.5316 1
 
< 0.1%
37.5315 4
0.1%
37.5311 4
0.1%

경도
Real number (ℝ)

HIGH CORRELATION 

Distinct718
Distinct (%)10.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean127.05175
Minimum127.0184
Maximum127.1219
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size61.7 KiB
2024-03-13T17:02:33.217356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum127.0184
5-th percentile127.0246
Q1127.0358
median127.0477
Q3127.0605
95-th percentile127.10107
Maximum127.1219
Range0.1035
Interquartile range (IQR)0.0247

Descriptive statistics

Standard deviation0.021868179
Coefficient of variation (CV)0.00017212025
Kurtosis0.4700173
Mean127.05175
Median Absolute Deviation (MAD)0.0123
Skewness1.0144397
Sum890251.59
Variance0.00047821725
MonotonicityNot monotonic
2024-03-13T17:02:33.338271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
127.0443 36
 
0.5%
127.0308 33
 
0.5%
127.0471 31
 
0.4%
127.0491 31
 
0.4%
127.0439 29
 
0.4%
127.0524 28
 
0.4%
127.0523 28
 
0.4%
127.0434 28
 
0.4%
127.0371 27
 
0.4%
127.0375 27
 
0.4%
Other values (708) 6709
95.7%
ValueCountFrequency (%)
127.0184 4
 
0.1%
127.0186 2
 
< 0.1%
127.0188 1
 
< 0.1%
127.0192 1
 
< 0.1%
127.0193 4
 
0.1%
127.0194 10
0.1%
127.0195 5
0.1%
127.0198 5
0.1%
127.0199 1
 
< 0.1%
127.02 5
0.1%
ValueCountFrequency (%)
127.1219 3
< 0.1%
127.1208 1
 
< 0.1%
127.1201 1
 
< 0.1%
127.1199 1
 
< 0.1%
127.1196 3
< 0.1%
127.1192 1
 
< 0.1%
127.1186 1
 
< 0.1%
127.1182 1
 
< 0.1%
127.118 2
< 0.1%
127.1179 2
< 0.1%

CCTV 수량
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size54.9 KiB
1
7005 
2
 
2

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 7005
> 99.9%
2 2
 
< 0.1%

Length

2024-03-13T17:02:33.462959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T17:02:33.552092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 7005
> 99.9%
2 2
 
< 0.1%

수정 일시
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size54.9 KiB
Minimum2022-12-01 00:00:00
Maximum2022-12-01 00:00:00
2024-03-13T17:02:33.620491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T17:02:33.697176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-03-13T17:02:31.517914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T17:02:31.327781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T17:02:31.602603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T17:02:31.434544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-13T17:02:33.757232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위도경도CCTV 수량
위도1.0000.7510.000
경도0.7511.0000.000
CCTV 수량0.0000.0001.000
2024-03-13T17:02:33.832922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위도경도CCTV 수량
위도1.000-0.5610.000
경도-0.5611.0000.000
CCTV 수량0.0000.0001.000

Missing values

2024-03-13T17:02:31.709132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-13T17:02:31.800406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

자치구안심 주소위도경도CCTV 수량수정 일시
0강남구삼성1-291-0237.5148127.055912022-12-01
1강남구개포1-101-0037.4866127.056712022-12-01
2강남구개포1-101-0137.4866127.056712022-12-01
3강남구개포1-101-0237.4866127.056712022-12-01
4강남구개포1-202-0037.4834127.052712022-12-01
5강남구개포1-202-0137.4834127.052712022-12-01
6강남구개포1-202-0237.4834127.052712022-12-01
7강남구개포1-202-0337.4834127.052712022-12-01
8강남구개포1-203-0037.4854127.056112022-12-01
9강남구개포1-203-0137.4854127.056112022-12-01
자치구안심 주소위도경도CCTV 수량수정 일시
6997강남구치수-124-3-0037.4834127.107412022-12-01
6998강남구치수-124-4-0037.4834127.107312022-12-01
6999강남구치수-124-5-0037.4834127.107412022-12-01
7000강남구치수-124-6-0037.4835127.107412022-12-01
7001강남구치수-125-0037.4941127.07312022-12-01
7002강남구치수-126-0037.5287127.052412022-12-01
7003강남구치수-127-0037.5285127.052812022-12-01
7004강남구치수-128-0037.5283127.053212022-12-01
7005강남구치수-128-0137.5283127.053212022-12-01
7006강남구치수-128-0237.5283127.053212022-12-01