Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory507.8 KiB
Average record size in memory52.0 B

Variable types

Numeric3
Categorical1
Text1

Dataset

Description서울특별시 서대문구 도로상 조명시설 지오태깅 데이터입니다. 일련번호, 위도, 경도 데이터를 제공하고 있습니다.
Author서울특별시 서대문구
URLhttps://www.data.go.kr/data/15109206/fileData.do

Alerts

순번 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 순번High correlation
순번 has unique valuesUnique
일련번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 02:52:20.936047
Analysis finished2023-12-12 02:52:23.065798
Duration2.13 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6125.6932
Minimum1
Maximum12205
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T11:52:23.158994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile623.95
Q13088.75
median6120.5
Q39169.25
95-th percentile11598.05
Maximum12205
Range12204
Interquartile range (IQR)6080.5

Descriptive statistics

Standard deviation3521.7454
Coefficient of variation (CV)0.57491378
Kurtosis-1.1992181
Mean6125.6932
Median Absolute Deviation (MAD)3041
Skewness-0.0070694083
Sum61256932
Variance12402691
MonotonicityNot monotonic
2023-12-12T11:52:23.316465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1947 1
 
< 0.1%
7239 1
 
< 0.1%
9310 1
 
< 0.1%
12175 1
 
< 0.1%
880 1
 
< 0.1%
4528 1
 
< 0.1%
316 1
 
< 0.1%
780 1
 
< 0.1%
8723 1
 
< 0.1%
1539 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
6 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
12 1
< 0.1%
13 1
< 0.1%
14 1
< 0.1%
ValueCountFrequency (%)
12205 1
< 0.1%
12204 1
< 0.1%
12203 1
< 0.1%
12202 1
< 0.1%
12200 1
< 0.1%
12199 1
< 0.1%
12198 1
< 0.1%
12196 1
< 0.1%
12195 1
< 0.1%
12194 1
< 0.1%

구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2
7575 
1
2425 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row2
4th row2
5th row2

Common Values

ValueCountFrequency (%)
2 7575
75.8%
1 2425
 
24.2%

Length

2023-12-12T11:52:23.444402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:52:23.558324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 7575
75.8%
1 2425
 
24.2%

일련번호
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T11:52:23.861837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length7.2126
Min length4

Characters and Unicode

Total characters72126
Distinct characters78
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st row북아현로-577
2nd row연희로-1134
3rd row연희로-667
4th row수색로-148
5th row연희로-1260
ValueCountFrequency (%)
북아현로-577 1
 
< 0.1%
북아현로-104 1
 
< 0.1%
북아현로-391 1
 
< 0.1%
증가로-770 1
 
< 0.1%
북가좌동-48 1
 
< 0.1%
남가좌동-451 1
 
< 0.1%
북아현로-142 1
 
< 0.1%
모래내로-66 1
 
< 0.1%
연세로-177 1
 
< 0.1%
증가로-397 1
 
< 0.1%
Other values (9990) 9990
99.9%
2023-12-12T11:52:24.333314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 10000
 
13.9%
7575
 
10.5%
1 4542
 
6.3%
2 3551
 
4.9%
3 3284
 
4.6%
4 2911
 
4.0%
5 2509
 
3.5%
2442
 
3.4%
6 2305
 
3.2%
7 2182
 
3.0%
Other values (68) 30825
42.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 34857
48.3%
Decimal Number 27269
37.8%
Dash Punctuation 10000
 
13.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7575
21.7%
2442
 
7.0%
2005
 
5.8%
1973
 
5.7%
1649
 
4.7%
1564
 
4.5%
1126
 
3.2%
1016
 
2.9%
933
 
2.7%
845
 
2.4%
Other values (57) 13729
39.4%
Decimal Number
ValueCountFrequency (%)
1 4542
16.7%
2 3551
13.0%
3 3284
12.0%
4 2911
10.7%
5 2509
9.2%
6 2305
8.5%
7 2182
8.0%
8 2074
7.6%
9 2002
7.3%
0 1909
7.0%
Dash Punctuation
ValueCountFrequency (%)
- 10000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 37269
51.7%
Hangul 34857
48.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7575
21.7%
2442
 
7.0%
2005
 
5.8%
1973
 
5.7%
1649
 
4.7%
1564
 
4.5%
1126
 
3.2%
1016
 
2.9%
933
 
2.7%
845
 
2.4%
Other values (57) 13729
39.4%
Common
ValueCountFrequency (%)
- 10000
26.8%
1 4542
12.2%
2 3551
 
9.5%
3 3284
 
8.8%
4 2911
 
7.8%
5 2509
 
6.7%
6 2305
 
6.2%
7 2182
 
5.9%
8 2074
 
5.6%
9 2002
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 37269
51.7%
Hangul 34857
48.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 10000
26.8%
1 4542
12.2%
2 3551
 
9.5%
3 3284
 
8.8%
4 2911
 
7.8%
5 2509
 
6.7%
6 2305
 
6.2%
7 2182
 
5.9%
8 2074
 
5.6%
9 2002
 
5.4%
Hangul
ValueCountFrequency (%)
7575
21.7%
2442
 
7.0%
2005
 
5.8%
1973
 
5.7%
1649
 
4.7%
1564
 
4.5%
1126
 
3.2%
1016
 
2.9%
933
 
2.7%
845
 
2.4%
Other values (57) 13729
39.4%

위도
Real number (ℝ)

Distinct6969
Distinct (%)69.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.575172
Minimum37.55534
Maximum37.606088
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T11:52:24.530131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum37.55534
5-th percentile37.55796
Q137.565064
median37.57514
Q337.58317
95-th percentile37.59771
Maximum37.606088
Range0.05074765
Interquartile range (IQR)0.018105838

Descriptive statistics

Standard deviation0.011988397
Coefficient of variation (CV)0.00031905102
Kurtosis-0.69954019
Mean37.575172
Median Absolute Deviation (MAD)0.008990305
Skewness0.3386721
Sum375751.72
Variance0.00014372166
MonotonicityNot monotonic
2023-12-12T11:52:24.691655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.57513727 32
 
0.3%
37.59063 21
 
0.2%
37.58281 16
 
0.2%
37.59332 14
 
0.1%
37.5631822 14
 
0.1%
37.55978863 13
 
0.1%
37.58491135 12
 
0.1%
37.55814305 11
 
0.1%
37.59366855 11
 
0.1%
37.59365 10
 
0.1%
Other values (6959) 9846
98.5%
ValueCountFrequency (%)
37.55534 1
< 0.1%
37.55535 2
< 0.1%
37.55538 1
< 0.1%
37.55541 1
< 0.1%
37.55542 1
< 0.1%
37.55545 2
< 0.1%
37.55547 1
< 0.1%
37.55549 1
< 0.1%
37.5555 1
< 0.1%
37.55551 1
< 0.1%
ValueCountFrequency (%)
37.60608765 2
< 0.1%
37.60586728 1
< 0.1%
37.60578019 1
< 0.1%
37.60571262 2
< 0.1%
37.6057068 1
< 0.1%
37.605706 1
< 0.1%
37.60567451 1
< 0.1%
37.60560891 1
< 0.1%
37.60554783 1
< 0.1%
37.60552394 2
< 0.1%

경도
Real number (ℝ)

Distinct7504
Distinct (%)75.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean126.9366
Minimum126.90302
Maximum126.96946
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T11:52:24.917771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.90302
5-th percentile126.91108
Q1126.92527
median126.93478
Q3126.94914
95-th percentile126.9617
Maximum126.96946
Range0.06644
Interquartile range (IQR)0.0238711

Descriptive statistics

Standard deviation0.015711442
Coefficient of variation (CV)0.00012377393
Kurtosis-0.92336501
Mean126.9366
Median Absolute Deviation (MAD)0.01216075
Skewness0.012539795
Sum1269366
Variance0.00024684939
MonotonicityNot monotonic
2023-12-12T11:52:25.109485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.9275107 32
 
0.3%
126.9526 20
 
0.2%
126.93765 14
 
0.1%
126.936757 13
 
0.1%
126.9596887 13
 
0.1%
126.93309 13
 
0.1%
126.9270902 12
 
0.1%
126.94917 11
 
0.1%
126.9440951 11
 
0.1%
126.9334809 11
 
0.1%
Other values (7494) 9850
98.5%
ValueCountFrequency (%)
126.90302 1
< 0.1%
126.90316 1
< 0.1%
126.90325 1
< 0.1%
126.90329 1
< 0.1%
126.9033 1
< 0.1%
126.90333 1
< 0.1%
126.90363 1
< 0.1%
126.90371 1
< 0.1%
126.90374 2
< 0.1%
126.90379 1
< 0.1%
ValueCountFrequency (%)
126.96946 1
< 0.1%
126.96944 1
< 0.1%
126.96935 1
< 0.1%
126.96928 1
< 0.1%
126.96914 1
< 0.1%
126.96907 1
< 0.1%
126.96883 1
< 0.1%
126.96869 1
< 0.1%
126.96865 1
< 0.1%
126.96843 1
< 0.1%

Interactions

2023-12-12T11:52:22.522759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:52:21.819535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:52:22.179227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:52:22.666447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:52:21.933262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:52:22.307688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:52:22.777086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:52:22.049718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:52:22.411169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T11:52:25.264216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번구분위도경도
순번1.0000.9940.9190.772
구분0.9941.0000.4150.349
위도0.9190.4151.0000.755
경도0.7720.3490.7551.000
2023-12-12T11:52:25.372404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번위도경도구분
순번1.0000.288-0.2650.931
위도0.2881.000-0.1890.318
경도-0.265-0.1891.0000.268
구분0.9310.3180.2681.000

Missing values

2023-12-12T11:52:22.912438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:52:23.017790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번구분일련번호위도경도
194619472북아현로-57737.564828126.958474
552255232연희로-113437.586982126.930687
343034312연희로-66737.571699126.927775
762776282수색로-14837.57567126.90567
608460852연희로-126037.59275126.933472
12034120351홍은동-8937.58399126.92457
560856092통일로-47137.587392126.94462
812081212증가로-73637.5847126.91371
645164522홍은중앙로-10237.598642126.949209
2012022신촌로-7337.557623126.958142
순번구분일련번호위도경도
423742382증가로-19437.578397126.927047
386938702연희로-86137.575351126.939481
865386542연희로-136537.5862126.92696
509350942통일로-21937.584764126.94934
518351842연희로-105737.585284126.934345
7027032성산로-937.559261126.940084
10249102501북아현동-7637.55755126.95615
281128122독립문로-10637.568907126.959911
11848118491남가좌동-26737.57416126.92208
529252932통일로-28737.585998126.940443