Overview

Dataset statistics

Number of variables6
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.9 KiB
Average record size in memory50.3 B

Variable types

Numeric1
Categorical2
Text3

Dataset

Description연번,동명,경로당명,구분,전 화,도로명 주소
Author강북구
URLhttps://data.seoul.go.kr/dataList/OA-11582/S/1/datasetView.do

Alerts

연번 is highly overall correlated with 동명High correlation
동명 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
경로당명 has unique valuesUnique
전 화 has unique valuesUnique

Reproduction

Analysis started2024-04-06 12:07:19.493971
Analysis finished2024-04-06 12:07:20.748822
Duration1.25 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.5
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-06T21:07:20.890631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile95.05
Maximum100
Range99
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.011492
Coefficient of variation (CV)0.57448499
Kurtosis-1.2
Mean50.5
Median Absolute Deviation (MAD)25
Skewness0
Sum5050
Variance841.66667
MonotonicityStrictly increasing
2024-04-06T21:07:21.225176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%

동명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)13.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
삼양
12 
번3
12 
송중
송천
번2
Other values (8)
49 

Length

Max length3
Median length2
Mean length2.07
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row삼양
2nd row삼양
3rd row삼양
4th row삼양
5th row삼양

Common Values

ValueCountFrequency (%)
삼양 12
12.0%
번3 12
12.0%
송중 9
9.0%
송천 9
9.0%
번2 9
9.0%
미아 8
8.0%
삼각산 7
7.0%
수2 7
7.0%
인수 7
7.0%
번1 6
6.0%
Other values (3) 14
14.0%

Length

2024-04-06T21:07:21.526114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
삼양 12
12.0%
번3 12
12.0%
송중 9
9.0%
송천 9
9.0%
번2 9
9.0%
미아 8
8.0%
삼각산 7
7.0%
수2 7
7.0%
인수 7
7.0%
번1 6
6.0%
Other values (3) 14
14.0%

경로당명
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2024-04-06T21:07:22.047867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length9
Mean length3.75
Min length2

Characters and Unicode

Total characters375
Distinct characters153
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st row경남
2nd row래미안미아1차
3rd row미송
4th row미아벽산
5th row미양할머니
ValueCountFrequency (%)
래미안 2
 
2.0%
경남 1
 
1.0%
래미안미아1차 1
 
1.0%
삼흥 1
 
1.0%
번동3단지 1
 
1.0%
해모로 1
 
1.0%
한진 1
 
1.0%
한양 1
 
1.0%
한솔솔파크 1
 
1.0%
오현 1
 
1.0%
Other values (91) 91
89.2%
2024-04-06T21:07:22.997276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16
 
4.3%
15
 
4.0%
12
 
3.2%
10
 
2.7%
9
 
2.4%
9
 
2.4%
9
 
2.4%
6
 
1.6%
2 6
 
1.6%
6
 
1.6%
Other values (143) 277
73.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 344
91.7%
Decimal Number 18
 
4.8%
Uppercase Letter 7
 
1.9%
Space Separator 2
 
0.5%
Other Punctuation 2
 
0.5%
Close Punctuation 1
 
0.3%
Open Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
16
 
4.7%
15
 
4.4%
12
 
3.5%
10
 
2.9%
9
 
2.6%
9
 
2.6%
9
 
2.6%
6
 
1.7%
6
 
1.7%
6
 
1.7%
Other values (126) 246
71.5%
Decimal Number
ValueCountFrequency (%)
2 6
33.3%
1 5
27.8%
3 2
 
11.1%
5 2
 
11.1%
6 1
 
5.6%
0 1
 
5.6%
4 1
 
5.6%
Uppercase Letter
ValueCountFrequency (%)
S 2
28.6%
K 2
28.6%
I 1
14.3%
A 1
14.3%
U 1
14.3%
Other Punctuation
ValueCountFrequency (%)
. 1
50.0%
& 1
50.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 344
91.7%
Common 24
 
6.4%
Latin 7
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
16
 
4.7%
15
 
4.4%
12
 
3.5%
10
 
2.9%
9
 
2.6%
9
 
2.6%
9
 
2.6%
6
 
1.7%
6
 
1.7%
6
 
1.7%
Other values (126) 246
71.5%
Common
ValueCountFrequency (%)
2 6
25.0%
1 5
20.8%
3 2
 
8.3%
2
 
8.3%
5 2
 
8.3%
) 1
 
4.2%
. 1
 
4.2%
6 1
 
4.2%
0 1
 
4.2%
4 1
 
4.2%
Other values (2) 2
 
8.3%
Latin
ValueCountFrequency (%)
S 2
28.6%
K 2
28.6%
I 1
14.3%
A 1
14.3%
U 1
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 344
91.7%
ASCII 31
 
8.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
16
 
4.7%
15
 
4.4%
12
 
3.5%
10
 
2.9%
9
 
2.6%
9
 
2.6%
9
 
2.6%
6
 
1.7%
6
 
1.7%
6
 
1.7%
Other values (126) 246
71.5%
ASCII
ValueCountFrequency (%)
2 6
19.4%
1 5
16.1%
S 2
 
6.5%
K 2
 
6.5%
3 2
 
6.5%
2
 
6.5%
5 2
 
6.5%
) 1
 
3.2%
. 1
 
3.2%
6 1
 
3.2%
Other values (7) 7
22.6%

구분
Categorical

Distinct8
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
구립
42 
사립아파트
34 
구임차
사립
사립아파트(150세대미만)
Other values (3)

Length

Max length14
Median length13
Mean length4.21
Min length2

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row사립아파트
2nd row사립아파트
3rd row구립
4th row사립아파트
5th row구립

Common Values

ValueCountFrequency (%)
구립 42
42.0%
사립아파트 34
34.0%
구임차 6
 
6.0%
사립 6
 
6.0%
사립아파트(150세대미만) 5
 
5.0%
사립아파트(임대) 4
 
4.0%
사립아파트(영구) 2
 
2.0%
사립아파트(구관리,영구) 1
 
1.0%

Length

2024-04-06T21:07:23.344060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T21:07:23.932290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
구립 42
42.0%
사립아파트 34
34.0%
구임차 6
 
6.0%
사립 6
 
6.0%
사립아파트(150세대미만 5
 
5.0%
사립아파트(임대 4
 
4.0%
사립아파트(영구 2
 
2.0%
사립아파트(구관리,영구 1
 
1.0%

전 화
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2024-04-06T21:07:24.423685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length8
Mean length8.42
Min length8

Characters and Unicode

Total characters842
Distinct characters18
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st row070-4248-6800
2nd row980-1295
3rd row985-2476
4th row945-6365
5th row984-5212
ValueCountFrequency (%)
070-4248-6800 1
 
1.0%
945-6483 1
 
1.0%
988-2122 1
 
1.0%
945-5888 1
 
1.0%
984-6152 1
 
1.0%
984-8723 1
 
1.0%
987-7175 1
 
1.0%
988-1117 1
 
1.0%
980-7036 1
 
1.0%
989-5927 1
 
1.0%
Other values (92) 92
90.2%
2024-04-06T21:07:25.294104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9 148
17.6%
8 121
14.4%
- 103
12.2%
0 75
8.9%
5 65
7.7%
2 62
7.4%
7 56
 
6.7%
4 53
 
6.3%
1 52
 
6.2%
6 51
 
6.1%
Other values (8) 56
 
6.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 726
86.2%
Dash Punctuation 103
 
12.2%
Open Punctuation 3
 
0.4%
Close Punctuation 3
 
0.4%
Other Letter 3
 
0.4%
Other Punctuation 2
 
0.2%
Space Separator 2
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
9 148
20.4%
8 121
16.7%
0 75
10.3%
5 65
9.0%
2 62
8.5%
7 56
 
7.7%
4 53
 
7.3%
1 52
 
7.2%
6 51
 
7.0%
3 43
 
5.9%
Other Letter
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Dash Punctuation
ValueCountFrequency (%)
- 103
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 839
99.6%
Hangul 3
 
0.4%

Most frequent character per script

Common
ValueCountFrequency (%)
9 148
17.6%
8 121
14.4%
- 103
12.3%
0 75
8.9%
5 65
7.7%
2 62
7.4%
7 56
 
6.7%
4 53
 
6.3%
1 52
 
6.2%
6 51
 
6.1%
Other values (5) 53
 
6.3%
Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 839
99.6%
Hangul 3
 
0.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9 148
17.6%
8 121
14.4%
- 103
12.3%
0 75
8.9%
5 65
7.7%
2 62
7.4%
7 56
 
6.7%
4 53
 
6.3%
1 52
 
6.2%
6 51
 
6.1%
Other values (5) 53
 
6.3%
Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Distinct97
Distinct (%)97.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2024-04-06T21:07:25.943604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length31
Mean length24.64
Min length14

Characters and Unicode

Total characters2464
Distinct characters143
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique94 ?
Unique (%)94.0%

Sample

1st row강북구 삼양로74길 8 (미아동, 경남아너스빌)
2nd row강북구 삼양로 256 (미아동, 삼성래미안)
3rd row강북구 인수봉로 42 (미아동)
4th row강북구 솔샘로 159 (미아동, 벽산라이브파크)
5th row강북구 삼양로35가길 7-4 (미아동)
ValueCountFrequency (%)
강북구 98
 
20.9%
미아동 42
 
9.0%
번동 25
 
5.3%
수유동 22
 
4.7%
오현로 7
 
1.5%
우이동 6
 
1.3%
삼양로 5
 
1.1%
솔샘로 5
 
1.1%
8 3
 
0.6%
23 3
 
0.6%
Other values (211) 253
53.9%
2024-04-06T21:07:26.730016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
387
 
15.7%
114
 
4.6%
100
 
4.1%
100
 
4.1%
98
 
4.0%
( 98
 
4.0%
98
 
4.0%
) 98
 
4.0%
1 95
 
3.9%
87
 
3.5%
Other values (133) 1189
48.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1369
55.6%
Decimal Number 434
 
17.6%
Space Separator 387
 
15.7%
Open Punctuation 98
 
4.0%
Close Punctuation 98
 
4.0%
Other Punctuation 51
 
2.1%
Dash Punctuation 26
 
1.1%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
114
 
8.3%
100
 
7.3%
100
 
7.3%
98
 
7.2%
98
 
7.2%
87
 
6.4%
77
 
5.6%
52
 
3.8%
40
 
2.9%
38
 
2.8%
Other values (116) 565
41.3%
Decimal Number
ValueCountFrequency (%)
1 95
21.9%
2 60
13.8%
5 43
9.9%
3 42
9.7%
4 41
9.4%
7 36
 
8.3%
9 35
 
8.1%
6 29
 
6.7%
0 29
 
6.7%
8 24
 
5.5%
Other Punctuation
ValueCountFrequency (%)
, 50
98.0%
? 1
 
2.0%
Space Separator
ValueCountFrequency (%)
387
100.0%
Open Punctuation
ValueCountFrequency (%)
( 98
100.0%
Close Punctuation
ValueCountFrequency (%)
) 98
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 26
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1369
55.6%
Common 1094
44.4%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
114
 
8.3%
100
 
7.3%
100
 
7.3%
98
 
7.2%
98
 
7.2%
87
 
6.4%
77
 
5.6%
52
 
3.8%
40
 
2.9%
38
 
2.8%
Other values (116) 565
41.3%
Common
ValueCountFrequency (%)
387
35.4%
( 98
 
9.0%
) 98
 
9.0%
1 95
 
8.7%
2 60
 
5.5%
, 50
 
4.6%
5 43
 
3.9%
3 42
 
3.8%
4 41
 
3.7%
7 36
 
3.3%
Other values (6) 144
 
13.2%
Latin
ValueCountFrequency (%)
A 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1369
55.6%
ASCII 1095
44.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
387
35.3%
( 98
 
8.9%
) 98
 
8.9%
1 95
 
8.7%
2 60
 
5.5%
, 50
 
4.6%
5 43
 
3.9%
3 42
 
3.8%
4 41
 
3.7%
7 36
 
3.3%
Other values (7) 145
 
13.2%
Hangul
ValueCountFrequency (%)
114
 
8.3%
100
 
7.3%
100
 
7.3%
98
 
7.2%
98
 
7.2%
87
 
6.4%
77
 
5.6%
52
 
3.8%
40
 
2.9%
38
 
2.8%
Other values (116) 565
41.3%

Interactions

2024-04-06T21:07:20.201271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-06T21:07:26.917362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번동명경로당명구분전 화도로명 주소
연번1.0000.9671.0000.2151.0001.000
동명0.9671.0001.0000.3701.0001.000
경로당명1.0001.0001.0001.0001.0001.000
구분0.2150.3701.0001.0001.0000.000
전 화1.0001.0001.0001.0001.0001.000
도로명 주소1.0001.0001.0000.0001.0001.000
2024-04-06T21:07:27.136669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분동명
구분1.0000.168
동명0.1681.000
2024-04-06T21:07:27.305795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번동명구분
연번1.0000.8540.098
동명0.8541.0000.168
구분0.0980.1681.000

Missing values

2024-04-06T21:07:20.454560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T21:07:20.668391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번동명경로당명구분전 화도로명 주소
01삼양경남사립아파트070-4248-6800강북구 삼양로74길 8 (미아동, 경남아너스빌)
12삼양래미안미아1차사립아파트980-1295강북구 삼양로 256 (미아동, 삼성래미안)
23삼양미송구립985-2476강북구 인수봉로 42 (미아동)
34삼양미아벽산사립아파트945-6365강북구 솔샘로 159 (미아동, 벽산라이브파크)
45삼양미양할머니구립984-5212강북구 삼양로35가길 7-4 (미아동)
56삼양삼일구임차988-2328강북구 삼양로41길 61 (미아동)
67삼양솔샘사립아파트(임대)986-1977강북구 솔샘로 159 (미아동, 벽산라이브파크)
78삼양실개울구립988-0235강북구 삼양로43길 41-1 (미아동)
89삼양은혜구임차989-9924삼양로63가길 7-26 (미아동) 스카이팰리스11A 101호
910삼양청암구립980-2905강북구 솔매로22길 11 (미아동)
연번동명경로당명구분전 화도로명 주소
9091우이소귓골구임차905-0637강북구 삼양로173가길 55 (우이동) 메이트하우스 102동 201호
9192우이우이구립903-1007강북구 삼양로170길 23 (우이동)
9293우이우이성원사립아파트999-2139, 9085441(할)강북구 삼양로 658 (우이동, 성원아파트)
9394인수가오사립905-4967강북구 삼양로111길 56 (수유동)
9495인수극동사립아파트990-5283강북구 인수봉로72길 4 (수유동, 극동아파트)
9596인수무너미구립900-4046강북구 인수봉로55가길 21 (수유동)
9697인수삼산구립902-5374강북구 인수봉로79길 62-4 (수유동)
9798인수인수구립908-2263강북구 인수봉로72길 15-12 (수유동)
9899인수제일구립996-4132강북구 삼양로87길 3 (수유동)
99100인수한신구립994-0090강북구 인수봉로48길 5-6 (수유동)