Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory410.2 KiB
Average record size in memory42.0 B

Variable types

Categorical1
Text1
Numeric2

Dataset

Description부산광역시_연제구_CCTV심볼정보_20230901
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15039757

Alerts

경도 is highly skewed (γ1 = -98.36114416)Skewed
위도 is highly skewed (γ1 = -99.72209187)Skewed
심볼 명칭 has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:11:28.105083
Analysis finished2023-12-10 16:11:29.157692
Duration1.05 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

레이어 명칭
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
새주소
4911 
지번
4628 
건물
 
461

Length

Max length3
Median length2
Mean length2.4911
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row건물
2nd row지번
3rd row지번
4th row새주소
5th row지번

Common Values

ValueCountFrequency (%)
새주소 4911
49.1%
지번 4628
46.3%
건물 461
 
4.6%

Length

2023-12-11T01:11:29.208498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:11:29.297457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
새주소 4911
49.1%
지번 4628
46.3%
건물 461
 
4.6%

심볼 명칭
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T01:11:29.552781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length34
Mean length20.1212
Min length2

Characters and Unicode

Total characters201212
Distinct characters395
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st row보광사
2nd row부산광역시 연제구 거제동 760-32
3rd row부산광역시 연제구 거제동 676-66 춘당빌라
4th row부산광역시 연제구 중앙천로25번길 26-3
5th row부산광역시 연제구 연산동 19
ValueCountFrequency (%)
연제구 9540
24.3%
부산광역시 9540
24.3%
연산동 3528
 
9.0%
거제동 1104
 
2.8%
과정로 98
 
0.2%
쌍미천로 94
 
0.2%
거제천로 89
 
0.2%
14 81
 
0.2%
13 78
 
0.2%
10 76
 
0.2%
Other values (7035) 15016
38.3%
2023-12-11T01:11:29.927622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
29244
 
14.5%
13702
 
6.8%
13450
 
6.7%
11228
 
5.6%
1 9953
 
4.9%
9749
 
4.8%
9588
 
4.8%
9571
 
4.8%
9558
 
4.8%
9553
 
4.7%
Other values (385) 75616
37.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 121009
60.1%
Decimal Number 44495
 
22.1%
Space Separator 29244
 
14.5%
Dash Punctuation 6427
 
3.2%
Uppercase Letter 29
 
< 0.1%
Other Punctuation 3
 
< 0.1%
Close Punctuation 2
 
< 0.1%
Open Punctuation 2
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13702
11.3%
13450
11.1%
11228
9.3%
9749
 
8.1%
9588
 
7.9%
9571
 
7.9%
9558
 
7.9%
9553
 
7.9%
4930
 
4.1%
4849
 
4.0%
Other values (354) 24831
20.5%
Uppercase Letter
ValueCountFrequency (%)
A 5
17.2%
B 4
13.8%
S 3
10.3%
C 3
10.3%
W 2
 
6.9%
K 2
 
6.9%
J 2
 
6.9%
M 2
 
6.9%
I 1
 
3.4%
V 1
 
3.4%
Other values (4) 4
13.8%
Decimal Number
ValueCountFrequency (%)
1 9953
22.4%
2 6234
14.0%
3 4895
11.0%
4 4241
9.5%
6 3677
 
8.3%
5 3634
 
8.2%
7 3325
 
7.5%
8 3202
 
7.2%
0 2778
 
6.2%
9 2556
 
5.7%
Other Punctuation
ValueCountFrequency (%)
, 2
66.7%
/ 1
33.3%
Space Separator
ValueCountFrequency (%)
29244
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6427
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 121009
60.1%
Common 80173
39.8%
Latin 30
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
13702
11.3%
13450
11.1%
11228
9.3%
9749
 
8.1%
9588
 
7.9%
9571
 
7.9%
9558
 
7.9%
9553
 
7.9%
4930
 
4.1%
4849
 
4.0%
Other values (354) 24831
20.5%
Common
ValueCountFrequency (%)
29244
36.5%
1 9953
 
12.4%
- 6427
 
8.0%
2 6234
 
7.8%
3 4895
 
6.1%
4 4241
 
5.3%
6 3677
 
4.6%
5 3634
 
4.5%
7 3325
 
4.1%
8 3202
 
4.0%
Other values (6) 5341
 
6.7%
Latin
ValueCountFrequency (%)
A 5
16.7%
B 4
13.3%
S 3
10.0%
C 3
10.0%
W 2
 
6.7%
K 2
 
6.7%
J 2
 
6.7%
M 2
 
6.7%
I 1
 
3.3%
V 1
 
3.3%
Other values (5) 5
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 121009
60.1%
ASCII 80203
39.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
29244
36.5%
1 9953
 
12.4%
- 6427
 
8.0%
2 6234
 
7.8%
3 4895
 
6.1%
4 4241
 
5.3%
6 3677
 
4.6%
5 3634
 
4.5%
7 3325
 
4.1%
8 3202
 
4.0%
Other values (21) 5371
 
6.7%
Hangul
ValueCountFrequency (%)
13702
11.3%
13450
11.1%
11228
9.3%
9749
 
8.1%
9588
 
7.9%
9571
 
7.9%
9558
 
7.9%
9553
 
7.9%
4930
 
4.1%
4849
 
4.0%
Other values (354) 24831
20.5%

경도
Real number (ℝ)

SKEWED 

Distinct7458
Distinct (%)74.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean129.08489
Minimum117.9926
Maximum129.11421
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T01:11:30.047011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum117.9926
5-th percentile129.06688
Q1129.07757
median129.08654
Q3129.09414
95-th percentile129.10785
Maximum129.11421
Range11.121607
Interquartile range (IQR)0.01656725

Descriptive statistics

Standard deviation0.11154651
Coefficient of variation (CV)0.00086413296
Kurtosis9782.0793
Mean129.08489
Median Absolute Deviation (MAD)0.008132
Skewness-98.361144
Sum1290848.9
Variance0.012442623
MonotonicityNot monotonic
2023-12-11T01:11:30.189628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
129.08723 6
 
0.1%
129.108622 6
 
0.1%
129.087527 6
 
0.1%
129.088858 5
 
0.1%
129.087191 5
 
0.1%
129.086229 5
 
0.1%
129.095906 5
 
0.1%
129.086737 5
 
0.1%
129.066575 5
 
0.1%
129.107291 5
 
0.1%
Other values (7448) 9947
99.5%
ValueCountFrequency (%)
117.992603 1
< 0.1%
129.050077 1
< 0.1%
129.053432 1
< 0.1%
129.053838 1
< 0.1%
129.054129 1
< 0.1%
129.054186 1
< 0.1%
129.054271 1
< 0.1%
129.054551 1
< 0.1%
129.055756 2
< 0.1%
129.055829 2
< 0.1%
ValueCountFrequency (%)
129.11421 1
< 0.1%
129.11412 2
< 0.1%
129.114106 1
< 0.1%
129.113942 1
< 0.1%
129.113934 2
< 0.1%
129.113843 1
< 0.1%
129.113834 2
< 0.1%
129.113794 1
< 0.1%
129.113575 1
< 0.1%
129.113525 2
< 0.1%

위도
Real number (ℝ)

SKEWED 

Distinct7051
Distinct (%)70.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35.180877
Minimum19.694477
Maximum35.199215
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T01:11:30.315767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum19.694477
5-th percentile35.171281
Q135.17721
median35.183034
Q335.187401
95-th percentile35.192303
Maximum35.199215
Range15.504738
Interquartile range (IQR)0.0101915

Descriptive statistics

Standard deviation0.15502321
Coefficient of variation (CV)0.0044064624
Kurtosis9962.9567
Mean35.180877
Median Absolute Deviation (MAD)0.0050305
Skewness-99.722092
Sum351808.77
Variance0.024032196
MonotonicityNot monotonic
2023-12-11T01:11:30.440287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
35.187339 6
 
0.1%
35.186105 6
 
0.1%
35.188932 6
 
0.1%
35.185307 6
 
0.1%
35.173749 6
 
0.1%
35.18974 5
 
0.1%
35.175299 5
 
0.1%
35.186664 5
 
0.1%
35.172298 5
 
0.1%
35.180388 5
 
0.1%
Other values (7041) 9945
99.5%
ValueCountFrequency (%)
19.694477 1
 
< 0.1%
35.162364 3
< 0.1%
35.163781 1
 
< 0.1%
35.164009 1
 
< 0.1%
35.164041 1
 
< 0.1%
35.164068 2
< 0.1%
35.164118 1
 
< 0.1%
35.164186 1
 
< 0.1%
35.164205 1
 
< 0.1%
35.164213 1
 
< 0.1%
ValueCountFrequency (%)
35.199215 1
< 0.1%
35.199061 1
< 0.1%
35.199025 1
< 0.1%
35.198946 1
< 0.1%
35.19894 2
< 0.1%
35.198873 1
< 0.1%
35.198853 1
< 0.1%
35.198777 1
< 0.1%
35.198768 1
< 0.1%
35.198743 1
< 0.1%

Interactions

2023-12-11T01:11:28.691767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:11:28.534688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:11:28.764890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:11:28.609174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:11:30.520578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
레이어 명칭경도위도
레이어 명칭1.0000.0000.000
경도0.0001.0000.707
위도0.0000.7071.000
2023-12-11T01:11:30.600458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
경도위도레이어 명칭
경도1.000-0.1360.000
위도-0.1361.0000.000
레이어 명칭0.0000.0001.000

Missing values

2023-12-11T01:11:29.052322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:11:29.122344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

레이어 명칭심볼 명칭경도위도
551건물보광사129.08221535.162364
21586지번부산광역시 연제구 거제동 760-32129.06598735.180318
21259지번부산광역시 연제구 거제동 676-66 춘당빌라129.06702535.180074
16707새주소부산광역시 연제구 중앙천로25번길 26-3129.08196735.177463
26252지번부산광역시 연제구 연산동 19129.10316835.181894
16836새주소부산광역시 연제구 중앙천로37번가길 5129.08221935.177611
32746지번부산광역시 연제구 연산동 660-42 스위트빌129.08587435.179257
25077지번부산광역시 연제구 연산동 1811-211129.09590635.173027
8094새주소부산광역시 연제구 대리로5번길 86-12129.08416535.184781
8313새주소부산광역시 연제구 마곡천로 8129.08767135.173672
레이어 명칭심볼 명칭경도위도
17701새주소부산광역시 연제구 해맞이로77번길 46129.06628935.181055
10442새주소부산광역시 연제구 쌍미천로135번길 11-1129.08645935.184584
11843새주소부산광역시 연제구 아시아드대로65번가길 15129.06654135.189823
32740지번부산광역시 연제구 연산동 660-37129.08673335.179143
915건물양지빌라129.06499335.181678
26731지번부산광역시 연제구 연산동 2003-25129.08615935.173553
16362새주소부산광역시 연제구 중앙대로1251번길 22-1129.07711935.1986
20406지번부산광역시 연제구 거제동 453-24129.07217735.184112
16614새주소부산광역시 연제구 중앙천로19번길 19-5129.08242335.176129
14802새주소부산광역시 연제구 월드컵대로226번길 7129.07293335.189565