Overview

Dataset statistics

Number of variables5
Number of observations403
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory17.4 KiB
Average record size in memory44.3 B

Variable types

Numeric4
Text1

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-21288/F/1/datasetView.do

Alerts

표준ID is highly overall correlated with ARS_ID and 1 other fieldsHigh correlation
ARS_ID is highly overall correlated with 표준ID and 1 other fieldsHigh correlation
Y좌표 is highly overall correlated with 표준ID and 1 other fieldsHigh correlation
표준ID has unique valuesUnique
ARS_ID has unique valuesUnique
X좌표 has unique valuesUnique

Reproduction

Analysis started2024-05-11 06:09:53.410199
Analysis finished2024-05-11 06:09:56.125418
Duration2.72 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

표준ID
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct403
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.1214662 × 108
Minimum1 × 108
Maximum1.2400045 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.7 KiB
2024-05-11T15:09:56.242000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1 × 108
5-th percentile1.0000039 × 108
Q11.0600043 × 108
median1.120004 × 108
Q31.1800001 × 108
95-th percentile1.2300001 × 108
Maximum1.2400045 × 108
Range24000453
Interquartile range (IQR)11999581

Descriptive statistics

Standard deviation7043318.7
Coefficient of variation (CV)0.062804554
Kurtosis-1.0560534
Mean1.1214662 × 108
Median Absolute Deviation (MAD)5999970
Skewness-0.098186989
Sum4.5195089 × 1010
Variance4.9608339 × 1013
MonotonicityStrictly increasing
2024-05-11T15:09:56.464195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100000001 1
 
0.2%
116000005 1
 
0.2%
116000372 1
 
0.2%
116000014 1
 
0.2%
116000013 1
 
0.2%
116000012 1
 
0.2%
116000011 1
 
0.2%
116000010 1
 
0.2%
116000009 1
 
0.2%
116000008 1
 
0.2%
Other values (393) 393
97.5%
ValueCountFrequency (%)
100000001 1
0.2%
100000002 1
0.2%
100000003 1
0.2%
100000004 1
0.2%
100000005 1
0.2%
100000362 1
0.2%
100000363 1
0.2%
100000365 1
0.2%
100000366 1
0.2%
100000367 1
0.2%
ValueCountFrequency (%)
124000454 1
0.2%
124000453 1
0.2%
124000370 1
0.2%
124000369 1
0.2%
124000366 1
0.2%
124000365 1
0.2%
124000364 1
0.2%
124000363 1
0.2%
124000362 1
0.2%
124000361 1
0.2%

ARS_ID
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct403
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13258.568
Minimum1001
Maximum25014
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.7 KiB
2024-05-11T15:09:56.700357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1001
5-th percentile1021.1
Q17012.5
median13032
Q319010.5
95-th percentile24011.9
Maximum25014
Range24013
Interquartile range (IQR)11998

Descriptive statistics

Standard deviation6951.7908
Coefficient of variation (CV)0.5243244
Kurtosis-1.0183557
Mean13258.568
Median Absolute Deviation (MAD)5981
Skewness-0.1008642
Sum5343203
Variance48327395
MonotonicityNot monotonic
2024-05-11T15:09:56.953888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1001 1
 
0.2%
17005 1
 
0.2%
17015 1
 
0.2%
17014 1
 
0.2%
17013 1
 
0.2%
17012 1
 
0.2%
17011 1
 
0.2%
17010 1
 
0.2%
17009 1
 
0.2%
17008 1
 
0.2%
Other values (393) 393
97.5%
ValueCountFrequency (%)
1001 1
0.2%
1002 1
0.2%
1003 1
0.2%
1004 1
0.2%
1005 1
0.2%
1006 1
0.2%
1007 1
0.2%
1008 1
0.2%
1009 1
0.2%
1010 1
0.2%
ValueCountFrequency (%)
25014 1
0.2%
25013 1
0.2%
25012 1
0.2%
25011 1
0.2%
25010 1
0.2%
25009 1
0.2%
25008 1
0.2%
25007 1
0.2%
25006 1
0.2%
25005 1
0.2%
Distinct230
Distinct (%)57.1%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2024-05-11T15:09:57.244769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length15
Mean length8.057072
Min length3

Characters and Unicode

Total characters3247
Distinct characters287
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique63 ?
Unique (%)15.6%

Sample

1st row종로2가사거리
2nd row창경궁.서울대학교병원
3rd row명륜3가.성대입구
4th row종로2가.삼일교
5th row혜화동로터리.여운형활동터
ValueCountFrequency (%)
서울역버스환승센터 4
 
1.0%
여의도환승센터 4
 
1.0%
청량리역환승센터 3
 
0.7%
구로디지털단지역 3
 
0.7%
강서구청사거리.서울디지털대학교 2
 
0.5%
신도림역 2
 
0.5%
북가좌동삼거리 2
 
0.5%
신촌오거리.현대백화점 2
 
0.5%
웨딩타운 2
 
0.5%
아현역 2
 
0.5%
Other values (220) 377
93.5%
2024-05-11T15:09:57.756219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
174
 
5.4%
. 174
 
5.4%
77
 
2.4%
77
 
2.4%
69
 
2.1%
62
 
1.9%
59
 
1.8%
57
 
1.8%
48
 
1.5%
47
 
1.4%
Other values (277) 2403
74.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2983
91.9%
Other Punctuation 177
 
5.5%
Decimal Number 53
 
1.6%
Uppercase Letter 26
 
0.8%
Open Punctuation 3
 
0.1%
Close Punctuation 3
 
0.1%
Lowercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
174
 
5.8%
77
 
2.6%
77
 
2.6%
69
 
2.3%
62
 
2.1%
59
 
2.0%
57
 
1.9%
48
 
1.6%
47
 
1.6%
45
 
1.5%
Other values (253) 2268
76.0%
Uppercase Letter
ValueCountFrequency (%)
C 4
15.4%
T 4
15.4%
S 4
15.4%
B 2
7.7%
K 2
7.7%
G 2
7.7%
V 2
7.7%
L 2
7.7%
M 2
7.7%
D 2
7.7%
Decimal Number
ValueCountFrequency (%)
2 12
22.6%
1 10
18.9%
3 9
17.0%
0 8
15.1%
4 4
 
7.5%
6 4
 
7.5%
5 4
 
7.5%
9 2
 
3.8%
Other Punctuation
ValueCountFrequency (%)
. 174
98.3%
, 2
 
1.1%
· 1
 
0.6%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2983
91.9%
Common 236
 
7.3%
Latin 28
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
174
 
5.8%
77
 
2.6%
77
 
2.6%
69
 
2.3%
62
 
2.1%
59
 
2.0%
57
 
1.9%
48
 
1.6%
47
 
1.6%
45
 
1.5%
Other values (253) 2268
76.0%
Common
ValueCountFrequency (%)
. 174
73.7%
2 12
 
5.1%
1 10
 
4.2%
3 9
 
3.8%
0 8
 
3.4%
4 4
 
1.7%
6 4
 
1.7%
5 4
 
1.7%
( 3
 
1.3%
) 3
 
1.3%
Other values (3) 5
 
2.1%
Latin
ValueCountFrequency (%)
C 4
14.3%
T 4
14.3%
S 4
14.3%
B 2
7.1%
K 2
7.1%
G 2
7.1%
V 2
7.1%
L 2
7.1%
e 2
7.1%
M 2
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2983
91.9%
ASCII 263
 
8.1%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
174
 
5.8%
77
 
2.6%
77
 
2.6%
69
 
2.3%
62
 
2.1%
59
 
2.0%
57
 
1.9%
48
 
1.6%
47
 
1.6%
45
 
1.5%
Other values (253) 2268
76.0%
ASCII
ValueCountFrequency (%)
. 174
66.2%
2 12
 
4.6%
1 10
 
3.8%
3 9
 
3.4%
0 8
 
3.0%
C 4
 
1.5%
4 4
 
1.5%
6 4
 
1.5%
5 4
 
1.5%
T 4
 
1.5%
Other values (13) 30
 
11.4%
None
ValueCountFrequency (%)
· 1
100.0%

X좌표
Real number (ℝ)

UNIQUE 

Distinct403
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean126.98339
Minimum126.80983
Maximum127.17136
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.7 KiB
2024-05-11T15:09:57.972458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.80983
5-th percentile126.86023
Q1126.92187
median126.97788
Q3127.04025
95-th percentile127.11709
Maximum127.17136
Range0.36153454
Interquartile range (IQR)0.11838386

Descriptive statistics

Standard deviation0.078718721
Coefficient of variation (CV)0.0006199135
Kurtosis-0.65505808
Mean126.98339
Median Absolute Deviation (MAD)0.05869478
Skewness0.13490743
Sum51174.308
Variance0.006196637
MonotonicityNot monotonic
2024-05-11T15:09:58.175798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.9877522923 1
 
0.2%
126.8746997523 1
 
0.2%
126.902058345 1
 
0.2%
126.9020198843 1
 
0.2%
126.9015481288 1
 
0.2%
126.8551315518 1
 
0.2%
126.8554228318 1
 
0.2%
126.8620566339 1
 
0.2%
126.8601789687 1
 
0.2%
126.8670186731 1
 
0.2%
Other values (393) 393
97.5%
ValueCountFrequency (%)
126.8098252947 1
0.2%
126.8109375565 1
0.2%
126.8155482638 1
0.2%
126.8171527623 1
0.2%
126.8221992473 1
0.2%
126.8228597284 1
0.2%
126.8262227042 1
0.2%
126.827513 1
0.2%
126.8324754968 1
0.2%
126.8339921625 1
0.2%
ValueCountFrequency (%)
127.1713598368 1
0.2%
127.1706308587 1
0.2%
127.1615969135 1
0.2%
127.1610838354 1
0.2%
127.1488398094 1
0.2%
127.1477890164 1
0.2%
127.1429328548 1
0.2%
127.142104768 1
0.2%
127.1367996291 1
0.2%
127.1358552825 1
0.2%

Y좌표
Real number (ℝ)

HIGH CORRELATION 

Distinct402
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.552203
Minimum37.434859
Maximum37.689203
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.7 KiB
2024-05-11T15:09:58.613692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum37.434859
5-th percentile37.469094
Q137.509392
median37.557294
Q337.580701
95-th percentile37.643361
Maximum37.689203
Range0.25434426
Interquartile range (IQR)0.071308832

Descriptive statistics

Standard deviation0.051795365
Coefficient of variation (CV)0.0013792896
Kurtosis-0.20101744
Mean37.552203
Median Absolute Deviation (MAD)0.034130617
Skewness0.16161185
Sum15133.538
Variance0.0026827598
MonotonicityNot monotonic
2024-05-11T15:09:58.784246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.5553932418 2
 
0.5%
37.5698055407 1
 
0.2%
37.4998806283 1
 
0.2%
37.4846560288 1
 
0.2%
37.4830994287 1
 
0.2%
37.4971710251 1
 
0.2%
37.4973830961 1
 
0.2%
37.4964773551 1
 
0.2%
37.496906783 1
 
0.2%
37.4993764843 1
 
0.2%
Other values (392) 392
97.3%
ValueCountFrequency (%)
37.4348585994 1
0.2%
37.4401933055 1
0.2%
37.4409634671 1
0.2%
37.4478432162 1
0.2%
37.449067 1
0.2%
37.4522224011 1
0.2%
37.4530892573 1
0.2%
37.4557109244 1
0.2%
37.4557491207 1
0.2%
37.4568017217 1
0.2%
ValueCountFrequency (%)
37.689202857 1
0.2%
37.688568 1
0.2%
37.6839229454 1
0.2%
37.682672 1
0.2%
37.6777606492 1
0.2%
37.677262 1
0.2%
37.673289362 1
0.2%
37.67275 1
0.2%
37.6699992027 1
0.2%
37.669197 1
0.2%

Interactions

2024-05-11T15:09:55.341213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:09:53.725455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:09:54.169614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:09:54.706253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:09:55.470757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:09:53.825935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:09:54.300844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:09:54.871990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:09:55.623769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:09:53.947969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:09:54.426176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:09:55.030214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:09:55.774664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:09:54.060619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:09:54.572588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:09:55.184376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-11T15:09:58.889413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
표준IDARS_IDX좌표Y좌표
표준ID1.0001.0000.9430.900
ARS_ID1.0001.0000.9430.902
X좌표0.9430.9431.0000.746
Y좌표0.9000.9020.7461.000
2024-05-11T15:09:58.992395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
표준IDARS_IDX좌표Y좌표
표준ID1.0000.986-0.104-0.688
ARS_ID0.9861.000-0.110-0.694
X좌표-0.104-0.1101.0000.148
Y좌표-0.688-0.6940.1481.000

Missing values

2024-05-11T15:09:55.940892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-11T15:09:56.069061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

표준IDARS_ID정류소명X좌표Y좌표
01000000011001종로2가사거리126.98775237.569806
11000000021002창경궁.서울대학교병원126.99652137.579433
21000000031003명륜3가.성대입구126.99825137.58258
31000000041004종로2가.삼일교126.98761337.568579
41000000051005혜화동로터리.여운형활동터127.00174437.586243
510000036213037영천시장126.96313537.569528
610000036313036영천시장126.96253337.569939
71000003651037동대문역.흥인지문127.01245237.572105
81000003661041동묘앞127.0186137.573832
91000003671044동묘앞127.01955337.574246
표준IDARS_ID정류소명X좌표Y좌표
39312400036125001상일초교127.1713637.546125
39412400036225002상일초교127.17063137.545794
39512400036325006강동자이·프라자아파트127.14778937.53621
39612400036425008길동주민센터.둔촌2동주민센터127.14210537.533886
39712400036525010길동사거리.강동세무서127.13585537.534733
39812400036625012강동역127.13151637.536108
39912400036925003초이동127.16159737.542139
40012400037025004초이동127.16108437.541846
40112400045325013천호역127.12705237.537653
40212400045425014천호역127.12553937.537948