Overview

Dataset statistics

Number of variables1
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory156.2 KiB
Average record size in memory16.0 B

Variable types

Categorical1

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15249/F/1/datasetView.do

Alerts

Dataset has 1 (< 0.1%) duplicate rowsDuplicates
PK is highly imbalanced (99.2%)Imbalance

Reproduction

Analysis started2024-03-13 09:54:47.657129
Analysis finished2024-03-13 09:54:47.791638
Duration0.13 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

PK
Categorical

IMBALANCE 

Distinct29
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Yƣ+D[D)
9972 
:>ǻ)F&z/kH xGxg`?r^ͤR36/4`Ft)"x -
 
1
ç'៿wW??zbݟkW?W>֕vkq}k?_]vg_㋯N{{Zl1/߾m]|uqbf>3zh>_>hvxwz9]?WF;?]~=»-F#[ ]Kh_|s?vFlrҘ>1ݽ1=טn aWp- =ݽiz4fۣF{:>smkNhLon$4@iN''oӇ<fO~w?}+XQќWV?=tnt?bȸaάYΰgkj#zx*VXx6=X5nE'Xc.[le\gWMJM6{xRxt3?9mM8ʼڰk8{z CxUlL٣tmwƳMj<uŀ/^>GbcsfULcK3t}`CQ`tq6n\kՒ'e gƓ팗q#tی=q8FGSW4bo1!6q) ZX1X:HdzV1{
 
1
y
 
1
=UeX-
 
1
Other values (24)
 
24

Length

Max length336
Median length13
Mean length13.0854
Min length1

Unique

Unique28 ?
Unique (%)0.3%

Sample

1st rowYƣ+D[D)
2nd rowYƣ+D[D)
3rd rowYƣ+D[D)
4th rowYƣ+D[D)
5th rowYƣ+D[D)

Common Values

ValueCountFrequency (%)
Yƣ+D[D) 9972
99.7%
:>ǻ)F&z/kH xGxg`?r^ͤR36/4`Ft)"x - 1
 
< 0.1%
ç'៿wW??zbݟkW?W>֕vkq}k?_]vg_㋯N{{Zl1/߾m]|uqbf>3zh>_>hvxwz9]?WF;?]~=»-F#[ ]Kh_|s?vFlrҘ>1ݽ1=טn aWp- =ݽiz4fۣF{:>smkNhLon$4@iN''oӇ<fO~w?}+XQќWV?=tnt?bȸaάYΰgkj#zx*VXx6=X5nE'Xc.[le\gWMJM6{xRxt3?9mM8ʼڰk8{z CxUlL٣tmwƳMj<uŀ/^>GbcsfULcK3t}`CQ`tq6n\kՒ'e gƓ팗q#tی=q8FGSW4bo1!6q) ZX1X:HdzV1{ 1
 
< 0.1%
y 1
 
< 0.1%
=UeX- 1
 
< 0.1%
)=ga5[0f3 1
 
< 0.1%
j/@$Opf[WNrxz:qr04kաҚjnPh)<뇳dl?/aW߲ss=tZ[n=X3|liʉ `|EN8xI! 1
 
< 0.1%
x|`G㮶u_;ѐUOղwj s4ȥ-ZeN xe|o 1
 
< 0.1%
.XL0xaUk3 1
 
< 0.1%
O3QDGԶ5xZdtCD;' s-K*.|*Ow DIKx;~>/yb 1L5X>rtuPd BA\]lD#%~z:EHd::t!EЊ+&xg֞@C 1
 
< 0.1%
Other values (19) 19
 
0.2%

Length

2024-03-13T18:54:47.869080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
yƣ+d[d) 9972
99.0%
r 2
 
< 0.1%
zo7m3_vgcg$h>vmx"eaq 1
 
< 0.1%
avjrijz 1
 
< 0.1%
a2c}g<hr2w 1
 
< 0.1%
$-huanv 1
 
< 0.1%
gi"d'$<+=cvte3vt)ez#$tm!^f 1
 
< 0.1%
1
 
< 0.1%
1f'-u4!)8 1
 
< 0.1%
7ssz 1
 
< 0.1%
Other values (88) 88
 
0.9%

Missing values

2024-03-13T18:54:47.711502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-13T18:54:47.760569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

PK
8950Yƣ+D[D)
25843Yƣ+D[D)
19873Yƣ+D[D)
631Yƣ+D[D)
13586Yƣ+D[D)
2513Yƣ+D[D)
20380Yƣ+D[D)
21919Yƣ+D[D)
12112Yƣ+D[D)
763Yƣ+D[D)
PK
12437Yƣ+D[D)
28610Yƣ+D[D)
21998Yƣ+D[D)
30386Yƣ+D[D)
12031Yƣ+D[D)
21691Yƣ+D[D)
26688Yƣ+D[D)
17424Yƣ+D[D)
28424Yƣ+D[D)
13652Yƣ+D[D)

Duplicate rows

Most frequently occurring

PK# duplicates
0Yƣ+D[D)9972