Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells10002
Missing cells (%)12.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory732.4 KiB
Average record size in memory75.0 B

Variable types

Unsupported2
Numeric2
Categorical4

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15537/S/1/datasetView.do

Alerts

차선 시설물 현황 is highly overall correlated with Unnamed: 2High correlation
Unnamed: 6 is highly overall correlated with Unnamed: 2High correlation
Unnamed: 2 is highly overall correlated with 차선 시설물 현황 and 1 other fieldsHigh correlation
Unnamed: 3 is highly overall correlated with Unnamed: 4 and 1 other fieldsHigh correlation
Unnamed: 4 is highly overall correlated with Unnamed: 3High correlation
Unnamed: 5 is highly overall correlated with Unnamed: 3High correlation
Unnamed: 0 has 10000 (100.0%) missing valuesMissing
Unnamed: 0 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-05-04 05:44:04.371862
Analysis finished2024-05-04 05:44:07.315569
Duration2.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Unnamed: 0
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

차선 시설물 현황
Real number (ℝ)

HIGH CORRELATION 

Distinct9999
Distinct (%)100.0%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean50334.773
Minimum13
Maximum99994
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-04T05:44:07.495665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum13
5-th percentile4782.2
Q124764
median50572
Q375499.5
95-th percentile95105.5
Maximum99994
Range99981
Interquartile range (IQR)50735.5

Descriptive statistics

Standard deviation29066.912
Coefficient of variation (CV)0.5774718
Kurtosis-1.216983
Mean50334.773
Median Absolute Deviation (MAD)25331
Skewness-0.023338136
Sum5.032974 × 108
Variance8.4488536 × 108
MonotonicityNot monotonic
2024-05-04T05:44:07.778244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
45334 1
 
< 0.1%
25958 1
 
< 0.1%
82007 1
 
< 0.1%
84136 1
 
< 0.1%
31496 1
 
< 0.1%
16724 1
 
< 0.1%
57321 1
 
< 0.1%
61730 1
 
< 0.1%
42383 1
 
< 0.1%
45126 1
 
< 0.1%
Other values (9989) 9989
99.9%
ValueCountFrequency (%)
13 1
< 0.1%
20 1
< 0.1%
28 1
< 0.1%
34 1
< 0.1%
56 1
< 0.1%
64 1
< 0.1%
80 1
< 0.1%
86 1
< 0.1%
91 1
< 0.1%
96 1
< 0.1%
ValueCountFrequency (%)
99994 1
< 0.1%
99982 1
< 0.1%
99981 1
< 0.1%
99972 1
< 0.1%
99932 1
< 0.1%
99929 1
< 0.1%
99923 1
< 0.1%
99913 1
< 0.1%
99910 1
< 0.1%
99890 1
< 0.1%

Unnamed: 2
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
강남구
2804 
강서구
1914 
강동구
1426 
관악구
1335 
광진구
1076 
Other values (3)
1445 

Length

Max length4
Median length3
Mean length3.0001
Min length3

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row강북구
2nd row강남구
3rd row강동구
4th row광진구
5th row강남구

Common Values

ValueCountFrequency (%)
강남구 2804
28.0%
강서구 1914
19.1%
강동구 1426
14.3%
관악구 1335
13.4%
광진구 1076
 
10.8%
강북구 1030
 
10.3%
구로구 414
 
4.1%
<NA> 1
 
< 0.1%

Length

2024-05-04T05:44:08.194181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T05:44:08.603690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
강남구 2804
28.0%
강서구 1914
19.1%
강동구 1426
14.3%
관악구 1335
13.4%
광진구 1076
 
10.8%
강북구 1030
 
10.3%
구로구 414
 
4.1%
na 1
 
< 0.1%

Unnamed: 3
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
주차금지
2479 
주정차금지
1871 
일시정지선
1276 
차선
1254 
중앙선
782 
Other values (12)
2338 

Length

Max length10
Median length7
Mean length4.4617
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row주정차금지
2nd row주차금지
3rd row주행유도선
4th row일시정지선
5th row주정차금지

Common Values

ValueCountFrequency (%)
주차금지 2479
24.8%
주정차금지 1871
18.7%
일시정지선 1276
12.8%
차선 1254
12.5%
중앙선 782
 
7.8%
가변주차장 769
 
7.7%
진로변경제한선 569
 
5.7%
주행유도선 270
 
2.7%
가장자리구역선 189
 
1.9%
자전거횡단도 187
 
1.9%
Other values (7) 354
 
3.5%

Length

2024-05-04T05:44:09.074561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
주차금지 2479
24.8%
주정차금지 1871
18.7%
일시정지선 1276
12.8%
차선 1254
12.5%
중앙선 782
 
7.8%
가변주차장 769
 
7.7%
진로변경제한선 569
 
5.7%
주행유도선 270
 
2.7%
가장자리구역선 189
 
1.9%
자전거횡단도 187
 
1.9%
Other values (7) 354
 
3.5%

Unnamed: 4
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
백색실선
3239 
황색점선
2574 
황색실선
2234 
백색점선
1511 
황색복선
329 
Other values (8)
 
113

Length

Max length7
Median length4
Mean length4.0117
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row황색실선
2nd row황색점선
3rd row백색점선
4th row백색실선
5th row황색실선

Common Values

ValueCountFrequency (%)
백색실선 3239
32.4%
황색점선 2574
25.7%
황색실선 2234
22.3%
백색점선 1511
15.1%
황색복선 329
 
3.3%
청색점선 34
 
0.3%
분홍색실선 21
 
0.2%
청색복선실선 21
 
0.2%
청색실선 15
 
0.1%
청색복선점선 9
 
0.1%
Other values (3) 13
 
0.1%

Length

2024-05-04T05:44:09.512115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
백색실선 3239
32.4%
황색점선 2574
25.7%
황색실선 2234
22.3%
백색점선 1511
15.1%
황색복선 329
 
3.3%
청색점선 34
 
0.3%
분홍색실선 21
 
0.2%
청색복선실선 21
 
0.2%
청색실선 15
 
0.1%
청색복선점선 9
 
0.1%
Other values (3) 13
 
0.1%

Unnamed: 5
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
구도
5501 
시도
4498 
<NA>
 
1

Length

Max length4
Median length2
Mean length2.0002
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row구도
2nd row시도
3rd row구도
4th row시도
5th row구도

Common Values

ValueCountFrequency (%)
구도 5501
55.0%
시도 4498
45.0%
<NA> 1
 
< 0.1%

Length

2024-05-04T05:44:09.959114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T05:44:10.313352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
구도 5501
55.0%
시도 4498
45.0%
na 1
 
< 0.1%

Unnamed: 6
Real number (ℝ)

HIGH CORRELATION 

Distinct9986
Distinct (%)99.9%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean200189.06
Minimum181826.28
Maximum216165.71
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-04T05:44:10.663928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum181826.28
5-th percentile184137.65
Q1192638.98
median202821.29
Q3207160.78
95-th percentile213330.12
Maximum216165.71
Range34339.43
Interquartile range (IQR)14521.798

Descriptive statistics

Standard deviation9423.9285
Coefficient of variation (CV)0.047075142
Kurtosis-1.0501169
Mean200189.06
Median Absolute Deviation (MAD)6601.1724
Skewness-0.39774193
Sum2.0016904 × 109
Variance88810429
MonotonicityNot monotonic
2024-05-04T05:44:11.098724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
203035.167966423 2
 
< 0.1%
201710.993014069 2
 
< 0.1%
202789.03125 2
 
< 0.1%
215239.969418725 2
 
< 0.1%
192919.28125 2
 
< 0.1%
202150.421875 2
 
< 0.1%
215341.815066342 2
 
< 0.1%
201198.390625 2
 
< 0.1%
202695.161672809 2
 
< 0.1%
193559.04290232 2
 
< 0.1%
Other values (9976) 9979
99.8%
ValueCountFrequency (%)
181826.281250001 1
< 0.1%
181979.703125001 1
< 0.1%
182069.859375 1
< 0.1%
182073.046875 1
< 0.1%
182114.84375 1
< 0.1%
182122.140625 1
< 0.1%
182123.56655701 1
< 0.1%
182124.296875 1
< 0.1%
182131.019771648 1
< 0.1%
182143.050582174 1
< 0.1%
ValueCountFrequency (%)
216165.710934405 1
< 0.1%
216153.223213106 1
< 0.1%
216142.480925081 1
< 0.1%
216125.50764621 1
< 0.1%
216095.095446054 1
< 0.1%
216068.514437031 1
< 0.1%
216063.861175105 1
< 0.1%
216049.8375001 1
< 0.1%
216046.531249999 1
< 0.1%
216043.801526661 1
< 0.1%

Unnamed: 7
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size156.2 KiB

Interactions

2024-05-04T05:44:05.727884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T05:44:05.336313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T05:44:05.914850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T05:44:05.503497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-04T05:44:11.372067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
차선 시설물 현황Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6
차선 시설물 현황1.0000.9150.5870.4370.2270.912
Unnamed: 20.9151.0000.3980.3190.0690.923
Unnamed: 30.5870.3981.0000.9120.6820.333
Unnamed: 40.4370.3190.9121.0000.6000.271
Unnamed: 50.2270.0690.6820.6001.0000.188
Unnamed: 60.9120.9230.3330.2710.1881.000
2024-05-04T05:44:11.655983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 5Unnamed: 2Unnamed: 4Unnamed: 3
Unnamed: 51.0000.0740.4690.545
Unnamed: 20.0741.0000.1610.194
Unnamed: 40.4690.1611.0000.631
Unnamed: 30.5450.1940.6311.000
2024-05-04T05:44:11.931540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
차선 시설물 현황Unnamed: 6Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5
차선 시설물 현황1.000-0.3820.7850.2750.2000.174
Unnamed: 6-0.3821.0000.8020.1370.1170.144
Unnamed: 20.7850.8021.0000.1940.1610.074
Unnamed: 30.2750.1370.1941.0000.6310.545
Unnamed: 40.2000.1170.1610.6311.0000.469
Unnamed: 50.1740.1440.0740.5450.4691.000

Missing values

2024-05-04T05:44:06.358578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-04T05:44:06.713913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-05-04T05:44:07.107283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

Unnamed: 0차선 시설물 현황Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7
45335<NA>45334강북구주정차금지황색실선구도201603.432560650.668739
18496<NA>18495강남구주차금지황색점선시도202059.270227546496.563466
37931<NA>37930강동구주행유도선백색점선구도215407.220861550214.697958
87169<NA>87168광진구일시정지선백색실선시도208507.390625549081.1875
10224<NA>10223강남구주정차금지황색실선구도203927.287069542851.945346
85999<NA>85998광진구가장자리구역선백색실선시도209762.120383549581.550431
69952<NA>69951강서구차선백색점선구도186537.03125551475.03125
65292<NA>65291강서구주차금지황색점선구도183299.168221550998.923504
47606<NA>47605강북구주차금지황색점선구도202267.172858557892.978581
17587<NA>17586강남구주차금지황색점선구도207072.359142542539.96267
Unnamed: 0차선 시설물 현황Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7
90646<NA>90645광진구주차금지황색점선구도207273.621993551072.565404
15267<NA>15266강남구주차금지황색점선구도204592.684527545562.846741
97089<NA>97088구로구가변주차장백색실선구도187031.604536545276.779049
11131<NA>11130강남구주정차금지황색실선구도204026.832543547543.667601
53102<NA>53101강북구차선백색점선시도202823.831193556950.709127
13484<NA>13483강남구주정차금지황색실선시도203771.324039544768.698355
60410<NA>60409강서구주정차금지황색실선시도184186.875551448.0625
93746<NA>93745광진구중앙선황색실선시도207613.021688552048.638775
73290<NA>73289관악구버스전용차선(가로)청색복선점선시도195789.130347542423.896028
5824<NA>5823강남구일시정지선백색실선구도203295.937357547251.171869