Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows181
Duplicate rows (%)1.8%
Total size in memory410.2 KiB
Average record size in memory42.0 B

Variable types

Text2
Numeric2

Dataset

Description경상남도 창원시 5개 구(의창, 성산, 마산합포, 마산회원, 진해구) 소재 가로등 위치 정보입니다. 항목은 표찰번호, 가로등위치주소, 위도, 경도 입니다.
Author경상남도 창원시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15074277

Alerts

Dataset has 181 (1.8%) duplicate rowsDuplicates
경도 is highly overall correlated with 위도High correlation
위도 is highly overall correlated with 경도High correlation

Reproduction

Analysis started2023-12-11 00:19:41.616497
Analysis finished2023-12-11 00:19:42.599728
Duration0.98 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct9810
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T09:19:42.841823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length17
Mean length11.14
Min length3

Characters and Unicode

Total characters111400
Distinct characters208
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9627 ?
Unique (%)96.3%

Sample

1st row신항로-03 R4-02
2nd row수정1길2L-4
3rd row공단로473-02 좌-07
4th row외동반림로-05 우2-01
5th row중동중앙로LP-63 좌07
ValueCountFrequency (%)
좌-01 125
 
0.8%
좌-04 107
 
0.6%
좌-03 104
 
0.6%
좌-02 103
 
0.6%
우-03 99
 
0.6%
우-01 98
 
0.6%
우-04 95
 
0.6%
우-02 92
 
0.6%
우1-01 88
 
0.5%
r-01 83
 
0.5%
Other values (4906) 15604
94.0%
2023-12-11T09:19:43.290062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 15728
 
14.1%
0 12502
 
11.2%
1 9748
 
8.8%
8739
 
7.8%
6601
 
5.9%
2 5479
 
4.9%
3 3623
 
3.3%
4 2998
 
2.7%
L 2697
 
2.4%
R 2696
 
2.4%
Other values (198) 40589
36.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 42983
38.6%
Other Letter 40360
36.2%
Dash Punctuation 15728
 
14.1%
Space Separator 6601
 
5.9%
Uppercase Letter 5542
 
5.0%
Other Punctuation 170
 
0.2%
Open Punctuation 8
 
< 0.1%
Close Punctuation 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8739
21.7%
2330
 
5.8%
2320
 
5.7%
2255
 
5.6%
1154
 
2.9%
994
 
2.5%
957
 
2.4%
831
 
2.1%
781
 
1.9%
636
 
1.6%
Other values (178) 19363
48.0%
Decimal Number
ValueCountFrequency (%)
0 12502
29.1%
1 9748
22.7%
2 5479
12.7%
3 3623
 
8.4%
4 2998
 
7.0%
5 2417
 
5.6%
7 1881
 
4.4%
6 1791
 
4.2%
8 1357
 
3.2%
9 1187
 
2.8%
Uppercase Letter
ValueCountFrequency (%)
L 2697
48.7%
R 2696
48.6%
P 147
 
2.7%
U 2
 
< 0.1%
Other Punctuation
ValueCountFrequency (%)
. 168
98.8%
/ 2
 
1.2%
Dash Punctuation
ValueCountFrequency (%)
- 15728
100.0%
Space Separator
ValueCountFrequency (%)
6601
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 65498
58.8%
Hangul 40360
36.2%
Latin 5542
 
5.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8739
21.7%
2330
 
5.8%
2320
 
5.7%
2255
 
5.6%
1154
 
2.9%
994
 
2.5%
957
 
2.4%
831
 
2.1%
781
 
1.9%
636
 
1.6%
Other values (178) 19363
48.0%
Common
ValueCountFrequency (%)
- 15728
24.0%
0 12502
19.1%
1 9748
14.9%
6601
10.1%
2 5479
 
8.4%
3 3623
 
5.5%
4 2998
 
4.6%
5 2417
 
3.7%
7 1881
 
2.9%
6 1791
 
2.7%
Other values (6) 2730
 
4.2%
Latin
ValueCountFrequency (%)
L 2697
48.7%
R 2696
48.6%
P 147
 
2.7%
U 2
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 71040
63.8%
Hangul 40360
36.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 15728
22.1%
0 12502
17.6%
1 9748
13.7%
6601
9.3%
2 5479
 
7.7%
3 3623
 
5.1%
4 2998
 
4.2%
L 2697
 
3.8%
R 2696
 
3.8%
5 2417
 
3.4%
Other values (10) 6551
9.2%
Hangul
ValueCountFrequency (%)
8739
21.7%
2330
 
5.8%
2320
 
5.7%
2255
 
5.6%
1154
 
2.9%
994
 
2.5%
957
 
2.4%
831
 
2.1%
781
 
1.9%
636
 
1.6%
Other values (178) 19363
48.0%

주소
Text

Distinct3960
Distinct (%)39.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T09:19:43.610074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length24
Mean length19.2962
Min length11

Characters and Unicode

Total characters192962
Distinct characters144
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2494 ?
Unique (%)24.9%

Sample

1st row진해구 용원동 1326 도
2nd row경상남도 창원시 마산합포구 구산면 229-1
3rd row성산구 성산동 74-5 장
4th row성산구 두대동 506-1 도
5th row경상남도 창원시 의창구 중동 795-5
ValueCountFrequency (%)
경상남도 5458
 
12.4%
창원시 5458
 
12.4%
성산구 2745
 
6.2%
진해구 2107
 
4.8%
마산합포구 2026
 
4.6%
의창구 1827
 
4.2%
1478
 
3.4%
마산회원구 1295
 
2.9%
용원동 470
 
1.1%
내서읍 469
 
1.1%
Other values (3675) 20597
46.9%
2023-12-11T09:19:44.084946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
39943
20.7%
10201
 
5.3%
9055
 
4.7%
8814
 
4.6%
7546
 
3.9%
7442
 
3.9%
7323
 
3.8%
1 6900
 
3.6%
6227
 
3.2%
- 5853
 
3.0%
Other values (134) 83658
43.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 110966
57.5%
Space Separator 39943
 
20.7%
Decimal Number 36200
 
18.8%
Dash Punctuation 5853
 
3.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10201
 
9.2%
9055
 
8.2%
8814
 
7.9%
7546
 
6.8%
7442
 
6.7%
7323
 
6.6%
6227
 
5.6%
5831
 
5.3%
5576
 
5.0%
5458
 
4.9%
Other values (122) 37493
33.8%
Decimal Number
ValueCountFrequency (%)
1 6900
19.1%
5 4898
13.5%
2 4165
11.5%
3 3878
10.7%
0 3508
9.7%
4 3011
8.3%
7 2755
 
7.6%
6 2704
 
7.5%
8 2225
 
6.1%
9 2156
 
6.0%
Space Separator
ValueCountFrequency (%)
39943
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5853
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 110966
57.5%
Common 81996
42.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10201
 
9.2%
9055
 
8.2%
8814
 
7.9%
7546
 
6.8%
7442
 
6.7%
7323
 
6.6%
6227
 
5.6%
5831
 
5.3%
5576
 
5.0%
5458
 
4.9%
Other values (122) 37493
33.8%
Common
ValueCountFrequency (%)
39943
48.7%
1 6900
 
8.4%
- 5853
 
7.1%
5 4898
 
6.0%
2 4165
 
5.1%
3 3878
 
4.7%
0 3508
 
4.3%
4 3011
 
3.7%
7 2755
 
3.4%
6 2704
 
3.3%
Other values (2) 4381
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 110966
57.5%
ASCII 81996
42.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
39943
48.7%
1 6900
 
8.4%
- 5853
 
7.1%
5 4898
 
6.0%
2 4165
 
5.1%
3 3878
 
4.7%
0 3508
 
4.3%
4 3011
 
3.7%
7 2755
 
3.4%
6 2704
 
3.3%
Other values (2) 4381
 
5.3%
Hangul
ValueCountFrequency (%)
10201
 
9.2%
9055
 
8.2%
8814
 
7.9%
7546
 
6.8%
7442
 
6.7%
7323
 
6.6%
6227
 
5.6%
5831
 
5.3%
5576
 
5.0%
5458
 
4.9%
Other values (122) 37493
33.8%

경도
Real number (ℝ)

HIGH CORRELATION 

Distinct9166
Distinct (%)91.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean83.25887
Minimum35.083353
Maximum128.71756
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T09:19:44.225363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum35.083353
5-th percentile35.100183
Q135.192801
median128.47243
Q3128.58006
95-th percentile128.65043
Maximum128.71756
Range93.634202
Interquartile range (IQR)93.387258

Descriptive statistics

Standard deviation46.685397
Coefficient of variation (CV)0.56072581
Kurtosis-1.9968855
Mean83.25887
Median Absolute Deviation (MAD)0.22840265
Skewness-0.059233997
Sum832588.7
Variance2179.5263
MonotonicityNot monotonic
2023-12-11T09:19:44.348255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
128.5695953 6
 
0.1%
128.5562134 5
 
0.1%
128.5791016 5
 
0.1%
128.5602112 5
 
0.1%
128.5583038 5
 
0.1%
128.5786591 4
 
< 0.1%
128.5619812 4
 
< 0.1%
128.5636597 4
 
< 0.1%
128.4864197 4
 
< 0.1%
128.5644226 4
 
< 0.1%
Other values (9156) 9954
99.5%
ValueCountFrequency (%)
35.08335335 1
< 0.1%
35.08337432 1
< 0.1%
35.08339809 1
< 0.1%
35.08341958 1
< 0.1%
35.08343494 1
< 0.1%
35.0835285 1
< 0.1%
35.08360308 1
< 0.1%
35.08362005 1
< 0.1%
35.08368637 1
< 0.1%
35.08381363 1
< 0.1%
ValueCountFrequency (%)
128.7175554 1
< 0.1%
128.7174453 1
< 0.1%
128.7167035 1
< 0.1%
128.7165952 1
< 0.1%
128.7163835 1
< 0.1%
128.7161774 1
< 0.1%
128.7153479 1
< 0.1%
128.7147244 1
< 0.1%
128.7144168 1
< 0.1%
128.7137012 1
< 0.1%

위도
Real number (ℝ)

HIGH CORRELATION 

Distinct9655
Distinct (%)96.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean80.579385
Minimum35.058617
Maximum128.82732
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T09:19:44.485825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum35.058617
5-th percentile35.140819
Q135.228727
median35.333669
Q3128.69368
95-th percentile128.80335
Maximum128.82732
Range93.768703
Interquartile range (IQR)93.46495

Descriptive statistics

Standard deviation46.723505
Coefficient of variation (CV)0.5798444
Kurtosis-1.9968849
Mean80.579385
Median Absolute Deviation (MAD)0.2187534
Skewness0.059234474
Sum805793.85
Variance2183.0859
MonotonicityNot monotonic
2023-12-11T09:19:44.603797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
35.21343994 3
 
< 0.1%
35.23542023 3
 
< 0.1%
35.16228104 3
 
< 0.1%
35.23070526 3
 
< 0.1%
128.6847022 3
 
< 0.1%
128.6884687 2
 
< 0.1%
35.24841309 2
 
< 0.1%
35.21351242 2
 
< 0.1%
128.7057164 2
 
< 0.1%
35.20286179 2
 
< 0.1%
Other values (9645) 9975
99.8%
ValueCountFrequency (%)
35.05861664 2
< 0.1%
35.059021 1
< 0.1%
35.05907059 1
< 0.1%
35.05936432 1
< 0.1%
35.05971146 1
< 0.1%
35.05980682 1
< 0.1%
35.06006241 1
< 0.1%
35.06020355 1
< 0.1%
35.0605278 1
< 0.1%
35.06106949 1
< 0.1%
ValueCountFrequency (%)
128.8273193 1
< 0.1%
128.8270752 1
< 0.1%
128.826845 1
< 0.1%
128.8268281 1
< 0.1%
128.8261182 1
< 0.1%
128.8258994 1
< 0.1%
128.8258818 1
< 0.1%
128.8256443 1
< 0.1%
128.8249356 1
< 0.1%
128.8249202 1
< 0.1%

Interactions

2023-12-11T09:19:42.259914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:19:42.086871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:19:42.351206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:19:42.174263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:19:44.671946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
경도위도
경도1.0001.000
위도1.0001.000
2023-12-11T09:19:44.757203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
경도위도
경도1.000-0.736
위도-0.7361.000

Missing values

2023-12-11T09:19:42.454894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:19:42.545000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

표찰번호주소경도위도
27385신항로-03 R4-02진해구 용원동 1326 도35.084861128.818417
17459수정1길2L-4경상남도 창원시 마산합포구 구산면 229-1128.59188835.12265
10634공단로473-02 좌-07성산구 성산동 74-5 장35.206668128.674156
13104외동반림로-05 우2-01성산구 두대동 506-1 도35.229785128.67107
3824중동중앙로LP-63 좌07경상남도 창원시 의창구 중동 795-5128.62502635.25565
7582창원대로-18 좌-03성산구 남산동 946도35.203123128.695619
10466공단로474-03 우2-05성산구 상복동 588-2 도35.198765128.663036
4493창이대로-01 좌1-08경상남도 창원시 의창구 명서동 503128.64412135.233184
23298함마대로3R-2경상남도 창원시 마산회원구 내서읍 676128.51345835.251965
1978사화로-09 좌04경상남도 창원시 의창구 팔용동 532-2128.62573535.247917
표찰번호주소경도위도
15149경남대로10R-8경상남도 창원시 마산합포구 예곡동 777128.54449535.167305
9423창이대로739-01 좌-08성산구 사파정동 69-4 답35.215974128.704162
23590호원로1R-5경상남도 창원시 마산회원구 내서읍 산 76-1128.53497335.242805
16017드림베이대로12R-4경상남도 창원시 마산합포구 가포동 산 82128.58941635.15871
3674중동남로LP-58 좌04경상남도 창원시 의창구 중동 795-4128.63012535.252775
2117상남로-01 좌2-03경상남도 창원시 의창구 용동 산 2-31128.69878935.247255
6712남면로-01 우-07성산구 가음정동 390도35.2019128.686652
22520송평로8L-9경상남도 창원시 마산회원구 내서읍 산 121-11128.54470835.2523
8788충혼로-04 우1-05성산구 반림동 501도35.238664128.665968
12993적현로279-02 우-04성산구 신촌동 60-2임35.197513128.603461

Duplicate rows

Most frequently occurring

표찰번호주소경도위도# duplicates
143창원대로-15 우1-05성산구 가음정동 558-2 도35.20976128.6847023
0가양로116-01 좌2-08성산구 대방동 1128-7구35.211469128.7053852
1가음로-03 우-03성산구 남양동 512 도35.204716128.7037642
2가음로-03 좌-03성산구 남양동 45차35.203294128.7050532
3가음로73-01 우-01성산구 남양동 35-10대35.20705128.7025252
4가음정로-03 우-02성산구 가음동 500-1 도35.208586128.6896982
5가음정로-03 우-05성산구 가음동 500-1 도35.207884128.6909382
6가음정로-06 우-04성산구 가음동 511 도35.203752128.6985132
7가음정로-06 좌-03성산구 가음동 511 도35.202887128.7000682
8가음정로-06 좌-04성산구 가음동 511 도35.202746128.700312