Overview

Dataset statistics

Number of variables6
Number of observations444
Missing cells0
Missing cells (%)0.0%
Duplicate rows8
Duplicate rows (%)1.8%
Total size in memory21.8 KiB
Average record size in memory50.3 B

Variable types

Categorical2
Text1
Numeric2
DateTime1

Dataset

Description경기도 광주시 관내 가로화단에 대한 데이터로 지형지물부호, 관리기관, 지번주소, 길이(미터), 면적(제곱미터) 등을 제공합니다.
Author경기도 광주시
URLhttps://www.data.go.kr/data/15042239/fileData.do

Alerts

지형지물부호 has constant value ""Constant
관리기관 has constant value ""Constant
데이터기준일 has constant value ""Constant
Dataset has 8 (1.8%) duplicate rowsDuplicates
길이 is highly overall correlated with 면적High correlation
면적 is highly overall correlated with 길이High correlation

Reproduction

Analysis started2023-12-12 15:06:13.512558
Analysis finished2023-12-12 15:06:14.311822
Duration0.8 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지형지물부호
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
가로화단
444 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가로화단
2nd row가로화단
3rd row가로화단
4th row가로화단
5th row가로화단

Common Values

ValueCountFrequency (%)
가로화단 444
100.0%

Length

2023-12-13T00:06:14.382245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:06:14.495702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
가로화단 444
100.0%

관리기관
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
광주시
444 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row광주시
2nd row광주시
3rd row광주시
4th row광주시
5th row광주시

Common Values

ValueCountFrequency (%)
광주시 444
100.0%

Length

2023-12-13T00:06:14.609104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:06:14.711741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
광주시 444
100.0%

위치
Text

Distinct284
Distinct (%)64.0%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
2023-12-13T00:06:14.949812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length24
Mean length19.707207
Min length14

Characters and Unicode

Total characters8750
Distinct characters125
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique219 ?
Unique (%)49.3%

Sample

1st row경기도 광주시 쌍령동 372도
2nd row경기도 광주시 쌍령동 372도
3rd row경기도 광주시 쌍령동 372도
4th row경기도 광주시 신현동 568-10도
5th row경기도 광주시 송정동 39-4도
ValueCountFrequency (%)
광주시 446
22.7%
경기도 444
22.5%
곤지암읍 67
 
3.4%
송정동 55
 
2.8%
초월읍 48
 
2.4%
도척면 46
 
2.3%
탄벌동 34
 
1.7%
중대동 28
 
1.4%
밀목사거리~광주대로99 26
 
1.3%
문형동 26
 
1.3%
Other values (350) 749
38.0%
2023-12-13T00:06:15.350941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1525
17.4%
747
 
8.5%
482
 
5.5%
473
 
5.4%
447
 
5.1%
446
 
5.1%
444
 
5.1%
- 336
 
3.8%
238
 
2.7%
2 223
 
2.5%
Other values (115) 3389
38.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5317
60.8%
Decimal Number 1546
 
17.7%
Space Separator 1525
 
17.4%
Dash Punctuation 336
 
3.8%
Math Symbol 26
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
747
14.0%
482
 
9.1%
473
 
8.9%
447
 
8.4%
446
 
8.4%
444
 
8.4%
238
 
4.5%
219
 
4.1%
115
 
2.2%
94
 
1.8%
Other values (102) 1612
30.3%
Decimal Number
ValueCountFrequency (%)
2 223
14.4%
1 210
13.6%
6 188
12.2%
5 181
11.7%
4 165
10.7%
3 162
10.5%
9 137
8.9%
7 124
8.0%
0 87
 
5.6%
8 69
 
4.5%
Space Separator
ValueCountFrequency (%)
1525
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 336
100.0%
Math Symbol
ValueCountFrequency (%)
~ 26
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5317
60.8%
Common 3433
39.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
747
14.0%
482
 
9.1%
473
 
8.9%
447
 
8.4%
446
 
8.4%
444
 
8.4%
238
 
4.5%
219
 
4.1%
115
 
2.2%
94
 
1.8%
Other values (102) 1612
30.3%
Common
ValueCountFrequency (%)
1525
44.4%
- 336
 
9.8%
2 223
 
6.5%
1 210
 
6.1%
6 188
 
5.5%
5 181
 
5.3%
4 165
 
4.8%
3 162
 
4.7%
9 137
 
4.0%
7 124
 
3.6%
Other values (3) 182
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5317
60.8%
ASCII 3433
39.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1525
44.4%
- 336
 
9.8%
2 223
 
6.5%
1 210
 
6.1%
6 188
 
5.5%
5 181
 
5.3%
4 165
 
4.8%
3 162
 
4.7%
9 137
 
4.0%
7 124
 
3.6%
Other values (3) 182
 
5.3%
Hangul
ValueCountFrequency (%)
747
14.0%
482
 
9.1%
473
 
8.9%
447
 
8.4%
446
 
8.4%
444
 
8.4%
238
 
4.5%
219
 
4.1%
115
 
2.2%
94
 
1.8%
Other values (102) 1612
30.3%

길이
Real number (ℝ)

HIGH CORRELATION 

Distinct407
Distinct (%)91.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean28.77732
Minimum0
Maximum508.39
Zeros1
Zeros (%)0.2%
Negative0
Negative (%)0.0%
Memory size4.0 KiB
2023-12-13T00:06:15.520223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2.303
Q16.83
median14.05
Q330.385
95-th percentile110.716
Maximum508.39
Range508.39
Interquartile range (IQR)23.555

Descriptive statistics

Standard deviation46.234103
Coefficient of variation (CV)1.606616
Kurtosis35.424219
Mean28.77732
Median Absolute Deviation (MAD)9.13
Skewness4.9252584
Sum12777.13
Variance2137.5923
MonotonicityNot monotonic
2023-12-13T00:06:15.687390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
8.94 4
 
0.9%
8.46 4
 
0.9%
2.26 3
 
0.7%
2.3 2
 
0.5%
8.98 2
 
0.5%
43.18 2
 
0.5%
13.8 2
 
0.5%
4.7 2
 
0.5%
3.09 2
 
0.5%
3.49 2
 
0.5%
Other values (397) 419
94.4%
ValueCountFrequency (%)
0.0 1
0.2%
0.95 1
0.2%
1.31 1
0.2%
1.74 1
0.2%
1.79 2
0.5%
1.94 1
0.2%
1.95 1
0.2%
2.04 1
0.2%
2.09 1
0.2%
2.16 2
0.5%
ValueCountFrequency (%)
508.39 1
0.2%
348.5 1
0.2%
281.1 1
0.2%
245.3 1
0.2%
233.87 2
0.5%
191.17 1
0.2%
180.37 1
0.2%
164.88 1
0.2%
155.45 1
0.2%
151.99 1
0.2%

면적
Real number (ℝ)

HIGH CORRELATION 

Distinct307
Distinct (%)69.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean74.217793
Minimum0
Maximum2521.1
Zeros1
Zeros (%)0.2%
Negative0
Negative (%)0.0%
Memory size4.0 KiB
2023-12-13T00:06:15.866972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3.1
Q17.15
median16.6
Q350.675
95-th percentile257.58
Maximum2521.1
Range2521.1
Interquartile range (IQR)43.525

Descriptive statistics

Standard deviation233.65931
Coefficient of variation (CV)3.1482924
Kurtosis69.343105
Mean74.217793
Median Absolute Deviation (MAD)12.4
Skewness7.6529121
Sum32952.7
Variance54596.674
MonotonicityNot monotonic
2023-12-13T00:06:16.055195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3.1 9
 
2.0%
8.9 7
 
1.6%
3.6 5
 
1.1%
3.2 5
 
1.1%
6.0 5
 
1.1%
4.4 4
 
0.9%
8.6 4
 
0.9%
3.8 4
 
0.9%
6.7 4
 
0.9%
2.8 4
 
0.9%
Other values (297) 393
88.5%
ValueCountFrequency (%)
0.0 1
 
0.2%
1.0 1
 
0.2%
1.2 2
0.5%
1.3 1
 
0.2%
1.5 1
 
0.2%
1.8 1
 
0.2%
1.9 1
 
0.2%
2.1 3
0.7%
2.3 1
 
0.2%
2.7 2
0.5%
ValueCountFrequency (%)
2521.1 2
0.5%
2142.9 1
0.2%
1132.0 1
0.2%
1093.0 1
0.2%
923.6 1
0.2%
922.7 1
0.2%
863.1 1
0.2%
764.5 1
0.2%
549.1 1
0.2%
540.3 2
0.5%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
Minimum2023-11-21 00:00:00
Maximum2023-11-21 00:00:00
2023-12-13T00:06:16.163540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:06:16.281018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-13T00:06:13.940120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:06:13.734025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:06:14.023991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:06:13.847756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:06:16.370498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
길이면적
길이1.0000.818
면적0.8181.000
2023-12-13T00:06:16.491106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
길이면적
길이1.0000.887
면적0.8871.000

Missing values

2023-12-13T00:06:14.134347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:06:14.265539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지형지물부호관리기관위치길이면적데이터기준일
0가로화단광주시경기도 광주시 쌍령동 372도21.9191.22023-11-21
1가로화단광주시경기도 광주시 쌍령동 372도25.98113.62023-11-21
2가로화단광주시경기도 광주시 쌍령동 372도23.7212.32023-11-21
3가로화단광주시경기도 광주시 신현동 568-10도4.223.62023-11-21
4가로화단광주시경기도 광주시 송정동 39-4도5.173.22023-11-21
5가로화단광주시경기도 광주시 직동 산7-2임6.025.62023-11-21
6가로화단광주시경기도 광주시 광주시 역동 28-88도0.00.02023-11-21
7가로화단광주시경기도 광주시 문형동 474-8전10.5710.32023-11-21
8가로화단광주시경기도 광주시 문형동 521-5전24.6525.62023-11-21
9가로화단광주시경기도 광주시 문형동 산66-3임6.526.32023-11-21
지형지물부호관리기관위치길이면적데이터기준일
434가로화단광주시경기도 광주시 탄벌동 90-10도4.914.72023-11-21
435가로화단광주시경기도 광주시 탄벌동 39-10전112.4115.02023-11-21
436가로화단광주시경기도 광주시 송정동 44-6도25.251.92023-11-21
437가로화단광주시경기도 광주시 송정동 41-5도24.6951.92023-11-21
438가로화단광주시경기도 광주시 탄벌동 698-9천46.87144.02023-11-21
439가로화단광주시경기도 광주시 송정동 40-3도32.5267.82023-11-21
440가로화단광주시경기도 광주시 송정동 547-2도37.42130.02023-11-21
441가로화단광주시경기도 광주시 송정동 50-2도26.4756.82023-11-21
442가로화단광주시경기도 광주시 송정동 488도24.7978.72023-11-21
443가로화단광주시경기도 광주시 탄벌동 527-56임114.94117.22023-11-21

Duplicate rows

Most frequently occurring

지형지물부호관리기관위치길이면적데이터기준일# duplicates
3가로화단광주시경기도 광주시 신현동 산131-52임8.468.52023-11-213
0가로화단광주시경기도 광주시 곤지암읍 신촌리 산1-6도50.45540.32023-11-212
1가로화단광주시경기도 광주시 송정동 529도233.872521.12023-11-212
2가로화단광주시경기도 광주시 송정동 62-6천2.263.12023-11-212
4가로화단광주시경기도 광주시 중대동 231-6도43.1850.42023-11-212
5가로화단광주시경기도 광주시 초월읍 산아리 77-13도93.81230.82023-11-212
6가로화단광주시경기도 광주시 탄벌동 527-56임114.94117.22023-11-212
7가로화단광주시경기도 광주시 탄벌동 702천8.948.92023-11-212