Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory585.9 KiB
Average record size in memory60.0 B

Variable types

Numeric4
Text1
Categorical1

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15067/S/1/datasetView.do

Alerts

NODE_ID is highly overall correlated with ARS_ID and 1 other fieldsHigh correlation
ARS_ID is highly overall correlated with NODE_ID and 1 other fieldsHigh correlation
Y좌표 is highly overall correlated with NODE_ID and 1 other fieldsHigh correlation
NODE_ID has unique valuesUnique
ARS_ID has unique valuesUnique

Reproduction

Analysis started2024-04-06 11:26:39.407311
Analysis finished2024-04-06 11:26:44.145239
Duration4.74 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

NODE_ID
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.132309 × 108
Minimum1 × 108
Maximum1.6700064 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T20:26:44.293242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1 × 108
5-th percentile1.0100029 × 108
Q11.0790011 × 108
median1.1390007 × 108
Q31.1990013 × 108
95-th percentile1.2300052 × 108
Maximum1.6700064 × 108
Range67000639
Interquartile range (IQR)12000015

Descriptive statistics

Standard deviation6997297.5
Coefficient of variation (CV)0.061796713
Kurtosis-0.80155146
Mean1.132309 × 108
Median Absolute Deviation (MAD)6000015
Skewness-0.11528895
Sum1.132309 × 1012
Variance4.8962172 × 1013
MonotonicityNot monotonic
2024-04-06T20:26:44.564942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
115000162 1
 
< 0.1%
115000557 1
 
< 0.1%
102900020 1
 
< 0.1%
104000258 1
 
< 0.1%
103900297 1
 
< 0.1%
113900157 1
 
< 0.1%
118900183 1
 
< 0.1%
122000697 1
 
< 0.1%
117900070 1
 
< 0.1%
120900119 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
100000001 1
< 0.1%
100000002 1
< 0.1%
100000003 1
< 0.1%
100000004 1
< 0.1%
100000006 1
< 0.1%
100000007 1
< 0.1%
100000008 1
< 0.1%
100000009 1
< 0.1%
100000010 1
< 0.1%
100000011 1
< 0.1%
ValueCountFrequency (%)
167000640 1
< 0.1%
124900141 1
< 0.1%
124900140 1
< 0.1%
124900139 1
< 0.1%
124900138 1
< 0.1%
124900137 1
< 0.1%
124900136 1
< 0.1%
124900135 1
< 0.1%
124900134 1
< 0.1%
124900133 1
< 0.1%

ARS_ID
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14345.807
Minimum1001
Maximum25999
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T20:26:44.831231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1001
5-th percentile2522.95
Q18589.75
median14603.5
Q320795.25
95-th percentile24429.05
Maximum25999
Range24998
Interquartile range (IQR)12205.5

Descriptive statistics

Standard deviation6984.6838
Coefficient of variation (CV)0.48687981
Kurtosis-1.1233465
Mean14345.807
Median Absolute Deviation (MAD)6071.5
Skewness-0.1550181
Sum1.4345807 × 108
Variance48785808
MonotonicityNot monotonic
2024-04-06T20:26:45.080928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
16259 1
 
< 0.1%
16851 1
 
< 0.1%
3551 1
 
< 0.1%
5697 1
 
< 0.1%
4533 1
 
< 0.1%
14889 1
 
< 0.1%
19881 1
 
< 0.1%
23179 1
 
< 0.1%
18565 1
 
< 0.1%
21785 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1001 1
< 0.1%
1002 1
< 0.1%
1003 1
< 0.1%
1004 1
< 0.1%
1006 1
< 0.1%
1007 1
< 0.1%
1008 1
< 0.1%
1010 1
< 0.1%
1011 1
< 0.1%
1012 1
< 0.1%
ValueCountFrequency (%)
25999 1
< 0.1%
25998 1
< 0.1%
25997 1
< 0.1%
25996 1
< 0.1%
25995 1
< 0.1%
25994 1
< 0.1%
25990 1
< 0.1%
25784 1
< 0.1%
25782 1
< 0.1%
25781 1
< 0.1%
Distinct6694
Distinct (%)66.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-06T20:26:45.798779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length19
Mean length7.7719
Min length2

Characters and Unicode

Total characters77719
Distinct characters662
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4105 ?
Unique (%)41.0%

Sample

1st row가양역9번출구.우성아파트
2nd row한광고
3rd row언북중학교입구
4th row벽산아파트
5th row신월초등학교
ValueCountFrequency (%)
벽산아파트 12
 
0.1%
새마을금고 11
 
0.1%
현대아파트 10
 
0.1%
구로디지털단지역 9
 
0.1%
북서울꿈의숲 9
 
0.1%
삼성래미안아파트 8
 
0.1%
가산디지털단지역 8
 
0.1%
신대방역 8
 
0.1%
합정역 8
 
0.1%
우성아파트 8
 
0.1%
Other values (6685) 9910
99.1%
2024-04-06T20:26:46.469363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2251
 
2.9%
2132
 
2.7%
. 2124
 
2.7%
2116
 
2.7%
2059
 
2.6%
1808
 
2.3%
1567
 
2.0%
1482
 
1.9%
1288
 
1.7%
1235
 
1.6%
Other values (652) 59657
76.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 72092
92.8%
Decimal Number 2514
 
3.2%
Other Punctuation 2150
 
2.8%
Uppercase Letter 669
 
0.9%
Close Punctuation 127
 
0.2%
Open Punctuation 125
 
0.2%
Lowercase Letter 32
 
< 0.1%
Dash Punctuation 9
 
< 0.1%
Space Separator 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2251
 
3.1%
2132
 
3.0%
2116
 
2.9%
2059
 
2.9%
1808
 
2.5%
1567
 
2.2%
1482
 
2.1%
1288
 
1.8%
1235
 
1.7%
1226
 
1.7%
Other values (605) 54928
76.2%
Uppercase Letter
ValueCountFrequency (%)
T 86
12.9%
K 71
10.6%
S 70
10.5%
C 66
9.9%
A 56
8.4%
P 52
7.8%
G 42
 
6.3%
M 38
 
5.7%
D 33
 
4.9%
B 29
 
4.3%
Other values (14) 126
18.8%
Decimal Number
ValueCountFrequency (%)
1 749
29.8%
2 472
18.8%
3 342
13.6%
4 211
 
8.4%
5 168
 
6.7%
0 160
 
6.4%
7 124
 
4.9%
6 122
 
4.9%
9 105
 
4.2%
8 61
 
2.4%
Other Punctuation
ValueCountFrequency (%)
. 2124
98.8%
· 12
 
0.6%
& 11
 
0.5%
, 2
 
0.1%
? 1
 
< 0.1%
Lowercase Letter
ValueCountFrequency (%)
e 24
75.0%
k 4
 
12.5%
t 2
 
6.2%
s 2
 
6.2%
Close Punctuation
ValueCountFrequency (%)
) 127
100.0%
Open Punctuation
ValueCountFrequency (%)
( 125
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 72092
92.8%
Common 4926
 
6.3%
Latin 701
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2251
 
3.1%
2132
 
3.0%
2116
 
2.9%
2059
 
2.9%
1808
 
2.5%
1567
 
2.2%
1482
 
2.1%
1288
 
1.8%
1235
 
1.7%
1226
 
1.7%
Other values (605) 54928
76.2%
Latin
ValueCountFrequency (%)
T 86
12.3%
K 71
10.1%
S 70
10.0%
C 66
9.4%
A 56
 
8.0%
P 52
 
7.4%
G 42
 
6.0%
M 38
 
5.4%
D 33
 
4.7%
B 29
 
4.1%
Other values (18) 158
22.5%
Common
ValueCountFrequency (%)
. 2124
43.1%
1 749
 
15.2%
2 472
 
9.6%
3 342
 
6.9%
4 211
 
4.3%
5 168
 
3.4%
0 160
 
3.2%
) 127
 
2.6%
( 125
 
2.5%
7 124
 
2.5%
Other values (9) 324
 
6.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 72092
92.8%
ASCII 5615
 
7.2%
None 12
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2251
 
3.1%
2132
 
3.0%
2116
 
2.9%
2059
 
2.9%
1808
 
2.5%
1567
 
2.2%
1482
 
2.1%
1288
 
1.8%
1235
 
1.7%
1226
 
1.7%
Other values (605) 54928
76.2%
ASCII
ValueCountFrequency (%)
. 2124
37.8%
1 749
 
13.3%
2 472
 
8.4%
3 342
 
6.1%
4 211
 
3.8%
5 168
 
3.0%
0 160
 
2.8%
) 127
 
2.3%
( 125
 
2.2%
7 124
 
2.2%
Other values (36) 1013
18.0%
None
ValueCountFrequency (%)
· 12
100.0%

X좌표
Real number (ℝ)

Distinct9992
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean126.98632
Minimum126.45723
Maximum127.18176
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T20:26:46.692866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.45723
5-th percentile126.84227
Q1126.91706
median126.99511
Q3127.05156
95-th percentile127.12836
Maximum127.18176
Range0.72453
Interquartile range (IQR)0.13450275

Descriptive statistics

Standard deviation0.086448895
Coefficient of variation (CV)0.00068077329
Kurtosis-0.73179472
Mean126.98632
Median Absolute Deviation (MAD)0.068365881
Skewness-0.063892168
Sum1269863.2
Variance0.0074734115
MonotonicityNot monotonic
2024-04-06T20:26:46.946235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
127.18176 2
 
< 0.1%
127.040517 2
 
< 0.1%
127.0360924585 2
 
< 0.1%
127.0520250488 2
 
< 0.1%
127.1443598929 2
 
< 0.1%
127.1480886874 2
 
< 0.1%
127.013138 2
 
< 0.1%
127.013707 2
 
< 0.1%
126.8128627805 1
 
< 0.1%
127.1090124148 1
 
< 0.1%
Other values (9982) 9982
99.8%
ValueCountFrequency (%)
126.45723 1
< 0.1%
126.7210313414 1
< 0.1%
126.797811 1
< 0.1%
126.7978638462 1
< 0.1%
126.797978 1
< 0.1%
126.798335 1
< 0.1%
126.7984631135 1
< 0.1%
126.7985207144 1
< 0.1%
126.7985641294 1
< 0.1%
126.7987623811 1
< 0.1%
ValueCountFrequency (%)
127.18176 2
< 0.1%
127.1817343335 1
< 0.1%
127.1816669472 1
< 0.1%
127.18013794 1
< 0.1%
127.18013 1
< 0.1%
127.1799002887 1
< 0.1%
127.1798392415 1
< 0.1%
127.179726 1
< 0.1%
127.1797196537 1
< 0.1%
127.1794170581 1
< 0.1%

Y좌표
Real number (ℝ)

HIGH CORRELATION 

Distinct9991
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.549975
Minimum37.43052
Maximum37.690177
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T20:26:47.196033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum37.43052
5-th percentile37.471265
Q137.50218
median37.549361
Q337.589644
95-th percentile37.646861
Maximum37.690177
Range0.25965706
Interquartile range (IQR)0.087464028

Descriptive statistics

Standard deviation0.054737504
Coefficient of variation (CV)0.0014577241
Kurtosis-0.76947379
Mean37.549975
Median Absolute Deviation (MAD)0.0442798
Skewness0.26675677
Sum375499.75
Variance0.0029961944
MonotonicityNot monotonic
2024-04-06T20:26:47.410422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.4763036239 2
 
< 0.1%
37.450172246 2
 
< 0.1%
37.4892113897 2
 
< 0.1%
37.6015704448 2
 
< 0.1%
37.5625221156 2
 
< 0.1%
37.5361286235 2
 
< 0.1%
37.5553932418 2
 
< 0.1%
37.5661382638 2
 
< 0.1%
37.553371595 2
 
< 0.1%
37.5039954584 1
 
< 0.1%
Other values (9981) 9981
99.8%
ValueCountFrequency (%)
37.4305199435 1
< 0.1%
37.4309469125 1
< 0.1%
37.4345128931 1
< 0.1%
37.4347964213 1
< 0.1%
37.4348585994 1
< 0.1%
37.4349735461 1
< 0.1%
37.4350042057 1
< 0.1%
37.4355241561 1
< 0.1%
37.4371542291 1
< 0.1%
37.4373210738 1
< 0.1%
ValueCountFrequency (%)
37.690177 1
< 0.1%
37.6899483575 1
< 0.1%
37.6898762161 1
< 0.1%
37.6893500743 1
< 0.1%
37.689202857 1
< 0.1%
37.6890118581 1
< 0.1%
37.688568 1
< 0.1%
37.6879883235 1
< 0.1%
37.6879397664 1
< 0.1%
37.6874938159 1
< 0.1%

정류소타입
Categorical

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
일반차로
5494 
마을버스
3704 
중앙차로
 
353
가로변시간
 
236
가로변전일
 
134

Length

Max length5
Median length4
Mean length4.0449
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반차로
2nd row마을버스
3rd row일반차로
4th row마을버스
5th row일반차로

Common Values

ValueCountFrequency (%)
일반차로 5494
54.9%
마을버스 3704
37.0%
중앙차로 353
 
3.5%
가로변시간 236
 
2.4%
가로변전일 134
 
1.3%
가상정류장 79
 
0.8%

Length

2024-04-06T20:26:47.588266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T20:26:47.747612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반차로 5494
54.9%
마을버스 3704
37.0%
중앙차로 353
 
3.5%
가로변시간 236
 
2.4%
가로변전일 134
 
1.3%
가상정류장 79
 
0.8%

Interactions

2024-04-06T20:26:43.105478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:26:41.027922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:26:41.695639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:26:42.390794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:26:43.267682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:26:41.202881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:26:41.888621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:26:42.587478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:26:43.432487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:26:41.345309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:26:42.076262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:26:42.771252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:26:43.702228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:26:41.507299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:26:42.239110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:26:42.945484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-06T20:26:47.866858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
NODE_IDARS_IDX좌표Y좌표정류소타입
NODE_ID1.0000.9770.8220.8350.132
ARS_ID0.9771.0000.7920.8590.285
X좌표0.8220.7921.0000.3970.207
Y좌표0.8350.8590.3971.0000.168
정류소타입0.1320.2850.2070.1681.000
2024-04-06T20:26:48.081364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
NODE_IDARS_IDX좌표Y좌표정류소타입
NODE_ID1.0000.998-0.051-0.6740.090
ARS_ID0.9981.000-0.052-0.6740.154
X좌표-0.051-0.0521.0000.2170.116
Y좌표-0.674-0.6740.2171.0000.089
정류소타입0.0900.1540.1160.0891.000

Missing values

2024-04-06T20:26:43.894716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T20:26:44.060675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

NODE_IDARS_ID정류소명X좌표Y좌표정류소타입
638211500016216259가양역9번출구.우성아파트126.85397437.561452일반차로
651211590023516467한광고126.85848137.537793마을버스
982112200002423124언북중학교입구127.03082137.520298일반차로
32901089000939682벽산아파트127.01993337.640982마을버스
632611500010616203신월초등학교126.83896937.538635일반차로
397711000019711297쌍용스윗닷홈아파트127.04542537.630398일반차로
32281089001389528당진슈퍼127.03230637.625575마을버스
1025412200074123816봉은사역코엑스인터컨티넨탈127.05744337.513613일반차로
20371060004317011금란교회127.10403337.60053중앙차로
1052812300017424264오금동대림아파트127.12799237.507916일반차로
NODE_IDARS_ID정류소명X좌표Y좌표정류소타입
32481089002039551수유1동주민센터.파출소127.01752537.630153마을버스
517411290001613839금강빌라.인왕중학교126.95162637.592211마을버스
3631009000301878서울대치과대학126.99780137.577408마을버스
400311000022211323월계삼호4차아파트127.06598237.626663일반차로
22591060002287324아남리치카운티아파트127.08538237.590406일반차로
7491020001523246중앙하이츠빌라앞126.95988137.537974일반차로
33921089000079890번3동주민센터127.04667537.626024마을버스
522511290020413912홍제우체국126.94633337.586761마을버스
825611900005620149상도초등학교입구126.93676637.503306일반차로
7313116900126179541호선구일역126.87221537.495217마을버스