Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory507.8 KiB
Average record size in memory52.0 B

Variable types

Numeric3
Categorical1
Text1

Dataset

Description경기도 성남시 도로상 조명시설 지오태깅 데이터입니다. NO(일련번호), SER(일련번호), LATITUDE(위도), LONGITUDE(경도) 데이터를 제공하고 있습니다.
Author경기도 성남시
URLhttps://www.data.go.kr/data/15110582/fileData.do

Alerts

순번(NO) is highly overall correlated with 구분(LAMP)High correlation
위도(LATITUDE) is highly overall correlated with 경도(LONGITUDE) and 1 other fieldsHigh correlation
경도(LONGITUDE) is highly overall correlated with 위도(LATITUDE)High correlation
구분(LAMP) is highly overall correlated with 순번(NO) and 1 other fieldsHigh correlation
순번(NO) has unique valuesUnique

Reproduction

Analysis started2023-12-12 15:03:57.049995
Analysis finished2023-12-12 15:03:59.239898
Duration2.19 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번(NO)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16726.018
Minimum2
Maximum33564
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T00:03:59.332420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile1626.95
Q18331.75
median16778.5
Q325036.25
95-th percentile31840.05
Maximum33564
Range33562
Interquartile range (IQR)16704.5

Descriptive statistics

Standard deviation9659.3659
Coefficient of variation (CV)0.57750542
Kurtosis-1.1915137
Mean16726.018
Median Absolute Deviation (MAD)8352.5
Skewness-0.002993532
Sum1.6726018 × 108
Variance93303351
MonotonicityNot monotonic
2023-12-13T00:03:59.490874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20795 1
 
< 0.1%
10146 1
 
< 0.1%
25926 1
 
< 0.1%
8524 1
 
< 0.1%
25569 1
 
< 0.1%
23247 1
 
< 0.1%
3916 1
 
< 0.1%
23836 1
 
< 0.1%
7283 1
 
< 0.1%
8657 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
2 1
< 0.1%
3 1
< 0.1%
10 1
< 0.1%
11 1
< 0.1%
13 1
< 0.1%
14 1
< 0.1%
18 1
< 0.1%
20 1
< 0.1%
35 1
< 0.1%
42 1
< 0.1%
ValueCountFrequency (%)
33564 1
< 0.1%
33563 1
< 0.1%
33561 1
< 0.1%
33558 1
< 0.1%
33557 1
< 0.1%
33554 1
< 0.1%
33551 1
< 0.1%
33548 1
< 0.1%
33547 1
< 0.1%
33545 1
< 0.1%

구분(LAMP)
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
7596 
2
2404 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row2

Common Values

ValueCountFrequency (%)
1 7596
76.0%
2 2404
 
24.0%

Length

2023-12-13T00:03:59.620940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:03:59.723927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 7596
76.0%
2 2404
 
24.0%
Distinct9918
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T00:04:00.019426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length6.9941
Min length3

Characters and Unicode

Total characters69941
Distinct characters136
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9836 ?
Unique (%)98.4%

Sample

1st row서현동-993
2nd row서현동-482
3rd row운중동-1051
4th row수진동-370
5th row탄리로-10
ValueCountFrequency (%)
광명로-50 2
 
< 0.1%
자혜로-3 2
 
< 0.1%
수정남로-12 2
 
< 0.1%
금상로-12 2
 
< 0.1%
성남대로-14 2
 
< 0.1%
둔토로-1 2
 
< 0.1%
둔촌대로-37 2
 
< 0.1%
금상로-8 2
 
< 0.1%
서판교로-26 2
 
< 0.1%
탄리로-8 2
 
< 0.1%
Other values (9909) 9981
99.8%
2023-12-13T00:04:00.473722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 10000
 
14.3%
7620
 
10.9%
1 5631
 
8.1%
2 3507
 
5.0%
3 2991
 
4.3%
4 2693
 
3.9%
5 2623
 
3.8%
6 2508
 
3.6%
2404
 
3.4%
7 2340
 
3.3%
Other values (126) 27624
39.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 31030
44.4%
Decimal Number 28910
41.3%
Dash Punctuation 10000
 
14.3%
Space Separator 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7620
24.6%
2404
 
7.7%
1408
 
4.5%
1054
 
3.4%
931
 
3.0%
727
 
2.3%
724
 
2.3%
713
 
2.3%
599
 
1.9%
589
 
1.9%
Other values (114) 14261
46.0%
Decimal Number
ValueCountFrequency (%)
1 5631
19.5%
2 3507
12.1%
3 2991
10.3%
4 2693
9.3%
5 2623
9.1%
6 2508
8.7%
7 2340
8.1%
8 2281
7.9%
0 2180
 
7.5%
9 2156
 
7.5%
Dash Punctuation
ValueCountFrequency (%)
- 10000
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 38911
55.6%
Hangul 31030
44.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7620
24.6%
2404
 
7.7%
1408
 
4.5%
1054
 
3.4%
931
 
3.0%
727
 
2.3%
724
 
2.3%
713
 
2.3%
599
 
1.9%
589
 
1.9%
Other values (114) 14261
46.0%
Common
ValueCountFrequency (%)
- 10000
25.7%
1 5631
14.5%
2 3507
 
9.0%
3 2991
 
7.7%
4 2693
 
6.9%
5 2623
 
6.7%
6 2508
 
6.4%
7 2340
 
6.0%
8 2281
 
5.9%
0 2180
 
5.6%
Other values (2) 2157
 
5.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 38911
55.6%
Hangul 31030
44.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 10000
25.7%
1 5631
14.5%
2 3507
 
9.0%
3 2991
 
7.7%
4 2693
 
6.9%
5 2623
 
6.7%
6 2508
 
6.4%
7 2340
 
6.0%
8 2281
 
5.9%
0 2180
 
5.6%
Other values (2) 2157
 
5.5%
Hangul
ValueCountFrequency (%)
7620
24.6%
2404
 
7.7%
1408
 
4.5%
1054
 
3.4%
931
 
3.0%
727
 
2.3%
724
 
2.3%
713
 
2.3%
599
 
1.9%
589
 
1.9%
Other values (114) 14261
46.0%

위도(LATITUDE)
Real number (ℝ)

HIGH CORRELATION 

Distinct9504
Distinct (%)95.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.408452
Minimum37.334188
Maximum37.474066
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T00:04:00.629688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum37.334188
5-th percentile37.348686
Q137.380715
median37.406767
Q337.441348
95-th percentile37.457631
Maximum37.474066
Range0.139878
Interquartile range (IQR)0.060632928

Descriptive statistics

Standard deviation0.035293352
Coefficient of variation (CV)0.00094345929
Kurtosis-1.1513408
Mean37.408452
Median Absolute Deviation (MAD)0.0321665
Skewness-0.17387962
Sum374084.52
Variance0.0012456207
MonotonicityNot monotonic
2023-12-13T00:04:00.773050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.41408602 13
 
0.1%
37.39210776 12
 
0.1%
37.38511558 10
 
0.1%
37.38485282 7
 
0.1%
37.40755873 7
 
0.1%
37.40619383 5
 
0.1%
37.398161 4
 
< 0.1%
37.391597 4
 
< 0.1%
37.460852 4
 
< 0.1%
37.356462 3
 
< 0.1%
Other values (9494) 9931
99.3%
ValueCountFrequency (%)
37.334188 2
< 0.1%
37.334378 1
< 0.1%
37.334389 1
< 0.1%
37.334417 1
< 0.1%
37.334582 1
< 0.1%
37.334622 1
< 0.1%
37.334625 1
< 0.1%
37.334816 1
< 0.1%
37.334854 1
< 0.1%
37.334909 1
< 0.1%
ValueCountFrequency (%)
37.474066 1
< 0.1%
37.473889 1
< 0.1%
37.473791 1
< 0.1%
37.473571 1
< 0.1%
37.473567 1
< 0.1%
37.473437 1
< 0.1%
37.473426 1
< 0.1%
37.473405 1
< 0.1%
37.47326 1
< 0.1%
37.473231 1
< 0.1%

경도(LONGITUDE)
Real number (ℝ)

HIGH CORRELATION 

Distinct9326
Distinct (%)93.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean127.12608
Minimum127.0572
Maximum127.18154
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T00:04:00.971158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum127.0572
5-th percentile127.08628
Q1127.10961
median127.12645
Q3127.14187
95-th percentile127.1659
Maximum127.18154
Range0.1243345
Interquartile range (IQR)0.032257

Descriptive statistics

Standard deviation0.023516558
Coefficient of variation (CV)0.00018498611
Kurtosis-0.24606025
Mean127.12608
Median Absolute Deviation (MAD)0.0162985
Skewness-0.068282547
Sum1271260.8
Variance0.00055302851
MonotonicityNot monotonic
2023-12-13T00:04:01.157063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
127.1220354 13
 
0.1%
127.0979202 12
 
0.1%
127.1553804 10
 
0.1%
127.1451299 7
 
0.1%
127.1276577 7
 
0.1%
127.1456108 5
 
0.1%
127.106995 4
 
< 0.1%
127.113121 4
 
< 0.1%
127.116934 4
 
< 0.1%
127.106054 4
 
< 0.1%
Other values (9316) 9930
99.3%
ValueCountFrequency (%)
127.057204 1
< 0.1%
127.057235 1
< 0.1%
127.057294 1
< 0.1%
127.057435 1
< 0.1%
127.057628 1
< 0.1%
127.057868 1
< 0.1%
127.057939 1
< 0.1%
127.059236 1
< 0.1%
127.059354 1
< 0.1%
127.05955 1
< 0.1%
ValueCountFrequency (%)
127.1815385 1
< 0.1%
127.1815066 1
< 0.1%
127.1813171 1
< 0.1%
127.1810692 1
< 0.1%
127.1809866 2
< 0.1%
127.180928 1
< 0.1%
127.1808439 1
< 0.1%
127.1807632 1
< 0.1%
127.1807082 1
< 0.1%
127.1806262 1
< 0.1%

Interactions

2023-12-13T00:03:58.690263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:03:57.996672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:03:58.328566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:03:58.819206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:03:58.095865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:03:58.458300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:03:58.939945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:03:58.216027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:03:58.587616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:04:01.276414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번(NO)구분(LAMP)위도(LATITUDE)경도(LONGITUDE)
순번(NO)1.0000.9940.8930.826
구분(LAMP)0.9941.0000.6810.623
위도(LATITUDE)0.8930.6811.0000.712
경도(LONGITUDE)0.8260.6230.7121.000
2023-12-13T00:04:01.424208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번(NO)위도(LATITUDE)경도(LONGITUDE)구분(LAMP)
순번(NO)1.0000.0590.0000.932
위도(LATITUDE)0.0591.0000.5540.530
경도(LONGITUDE)0.0000.5541.0000.483
구분(LAMP)0.9320.5300.4831.000

Missing values

2023-12-13T00:03:59.088375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:03:59.194057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번(NO)구분(LAMP)일련번호(SER)위도(LATITUDE)경도(LONGITUDE)
20794207951서현동-99337.382096127.131069
20264202651서현동-48237.379582127.136118
956595661운중동-105137.392278127.080505
25169251701수진동-37037.43767127.133184
431243132탄리로-1037.439653127.137208
30677306781삼평동-104237.403055127.111955
16045160461정자동-110737.36651127.10707
23291232921복정동-33437.463946127.127143
23878238791복정동-75337.457167127.125849
381538162시민로-12137.446492127.139661
순번(NO)구분(LAMP)일련번호(SER)위도(LATITUDE)경도(LONGITUDE)
25697256981신흥동-33837.447716127.144098
12299123001궁내동-3737.374313127.102003
20789207901서현동-98837.381928127.131321
19783197841서현동-15337.372174127.140612
20200202011서현동-41837.375454127.136178
5955962서판교로-6737.392236127.099375
14331143321구미동-180337.343937127.127966
11587115881삼평동-30837.402689127.10263
10878108791판교동-91137.397243127.099092
28799288001야탑동-35137.407948127.154699