Overview

Dataset statistics

Number of variables8
Number of observations300
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory19.8 KiB
Average record size in memory67.4 B

Variable types

Numeric3
Text2
DateTime1
Categorical2

Dataset

Description울산광역시 동구에 소재한 아파트들의 현황 자료로 아파트명, 세대수, 동수, 준공연도, 주소 등의 데이터를 포함하고 있습니다.
URLhttps://www.data.go.kr/data/15105488/fileData.do

Alerts

담당기관 has constant value ""Constant
기관 연락처 has constant value ""Constant
연번 is highly overall correlated with 세대수High correlation
세대수 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
동수 is highly overall correlated with 세대수High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 08:42:52.559855
Analysis finished2023-12-12 08:42:54.377287
Duration1.82 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct300
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean150.5
Minimum1
Maximum300
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.8 KiB
2023-12-12T17:42:54.478120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile15.95
Q175.75
median150.5
Q3225.25
95-th percentile285.05
Maximum300
Range299
Interquartile range (IQR)149.5

Descriptive statistics

Standard deviation86.746758
Coefficient of variation (CV)0.57639042
Kurtosis-1.2
Mean150.5
Median Absolute Deviation (MAD)75
Skewness0
Sum45150
Variance7525
MonotonicityStrictly increasing
2023-12-12T17:42:54.673268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
208 1
 
0.3%
206 1
 
0.3%
205 1
 
0.3%
204 1
 
0.3%
203 1
 
0.3%
202 1
 
0.3%
201 1
 
0.3%
200 1
 
0.3%
199 1
 
0.3%
Other values (290) 290
96.7%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
300 1
0.3%
299 1
0.3%
298 1
0.3%
297 1
0.3%
296 1
0.3%
295 1
0.3%
294 1
0.3%
293 1
0.3%
292 1
0.3%
291 1
0.3%
Distinct289
Distinct (%)96.3%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2023-12-12T17:42:55.057616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length6.4
Min length1

Characters and Unicode

Total characters1920
Distinct characters266
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique282 ?
Unique (%)94.0%

Sample

1st row일진아파트
2nd row남목아파트
3rd row건남아파트
4th row대성아파트2차
5th row반석아파트
ValueCountFrequency (%)
5
 
1.5%
2차 4
 
1.2%
7차 3
 
0.9%
대호하이빌 3
 
0.9%
처용빌라트 3
 
0.9%
전하 3
 
0.9%
에스코빌 3
 
0.9%
전하국민아파트 2
 
0.6%
명성블루빌 2
 
0.6%
조양빌라트 2
 
0.6%
Other values (300) 307
91.1%
2023-12-12T17:42:55.593077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
111
 
5.8%
92
 
4.8%
81
 
4.2%
76
 
4.0%
62
 
3.2%
49
 
2.6%
45
 
2.3%
42
 
2.2%
39
 
2.0%
37
 
1.9%
Other values (256) 1286
67.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1703
88.7%
Decimal Number 80
 
4.2%
Space Separator 37
 
1.9%
Uppercase Letter 29
 
1.5%
Close Punctuation 28
 
1.5%
Open Punctuation 28
 
1.5%
Lowercase Letter 8
 
0.4%
Dash Punctuation 5
 
0.3%
Other Punctuation 1
 
0.1%
Letter Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
111
 
6.5%
92
 
5.4%
81
 
4.8%
76
 
4.5%
62
 
3.6%
49
 
2.9%
45
 
2.6%
42
 
2.5%
39
 
2.3%
36
 
2.1%
Other values (220) 1070
62.8%
Uppercase Letter
ValueCountFrequency (%)
S 4
13.8%
A 4
13.8%
I 4
13.8%
V 3
10.3%
E 2
6.9%
K 2
6.9%
B 2
6.9%
C 2
6.9%
P 1
 
3.4%
N 1
 
3.4%
Other values (4) 4
13.8%
Decimal Number
ValueCountFrequency (%)
2 22
27.5%
1 19
23.8%
3 10
12.5%
6 6
 
7.5%
4 5
 
6.2%
7 4
 
5.0%
8 4
 
5.0%
5 4
 
5.0%
0 3
 
3.8%
9 3
 
3.8%
Lowercase Letter
ValueCountFrequency (%)
e 3
37.5%
r 1
 
12.5%
t 1
 
12.5%
a 1
 
12.5%
u 1
 
12.5%
l 1
 
12.5%
Space Separator
ValueCountFrequency (%)
37
100.0%
Close Punctuation
ValueCountFrequency (%)
) 28
100.0%
Open Punctuation
ValueCountFrequency (%)
( 28
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1703
88.7%
Common 179
 
9.3%
Latin 38
 
2.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
111
 
6.5%
92
 
5.4%
81
 
4.8%
76
 
4.5%
62
 
3.6%
49
 
2.9%
45
 
2.6%
42
 
2.5%
39
 
2.3%
36
 
2.1%
Other values (220) 1070
62.8%
Latin
ValueCountFrequency (%)
S 4
 
10.5%
A 4
 
10.5%
I 4
 
10.5%
V 3
 
7.9%
e 3
 
7.9%
E 2
 
5.3%
K 2
 
5.3%
B 2
 
5.3%
C 2
 
5.3%
P 1
 
2.6%
Other values (11) 11
28.9%
Common
ValueCountFrequency (%)
37
20.7%
) 28
15.6%
( 28
15.6%
2 22
12.3%
1 19
10.6%
3 10
 
5.6%
6 6
 
3.4%
4 5
 
2.8%
- 5
 
2.8%
7 4
 
2.2%
Other values (5) 15
8.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1703
88.7%
ASCII 216
 
11.2%
Number Forms 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
111
 
6.5%
92
 
5.4%
81
 
4.8%
76
 
4.5%
62
 
3.6%
49
 
2.9%
45
 
2.6%
42
 
2.5%
39
 
2.3%
36
 
2.1%
Other values (220) 1070
62.8%
ASCII
ValueCountFrequency (%)
37
17.1%
) 28
13.0%
( 28
13.0%
2 22
10.2%
1 19
 
8.8%
3 10
 
4.6%
6 6
 
2.8%
4 5
 
2.3%
- 5
 
2.3%
S 4
 
1.9%
Other values (25) 52
24.1%
Number Forms
ValueCountFrequency (%)
1
100.0%

세대수
Real number (ℝ)

HIGH CORRELATION 

Distinct128
Distinct (%)42.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean143.30333
Minimum3
Maximum3027
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.8 KiB
2023-12-12T17:42:55.771416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile10
Q117
median30
Q398.25
95-th percentile724.35
Maximum3027
Range3024
Interquartile range (IQR)81.25

Descriptive statistics

Standard deviation337.28312
Coefficient of variation (CV)2.3536307
Kurtosis27.175207
Mean143.30333
Median Absolute Deviation (MAD)16
Skewness4.702847
Sum42991
Variance113759.9
MonotonicityNot monotonic
2023-12-12T17:42:55.952023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
19 33
 
11.0%
16 18
 
6.0%
18 15
 
5.0%
12 11
 
3.7%
14 11
 
3.7%
13 8
 
2.7%
15 7
 
2.3%
36 7
 
2.3%
40 6
 
2.0%
30 6
 
2.0%
Other values (118) 178
59.3%
ValueCountFrequency (%)
3 1
 
0.3%
5 1
 
0.3%
6 2
 
0.7%
7 2
 
0.7%
8 5
1.7%
9 1
 
0.3%
10 4
 
1.3%
11 1
 
0.3%
12 11
3.7%
13 8
2.7%
ValueCountFrequency (%)
3027 1
0.3%
2110 1
0.3%
1897 1
0.3%
1810 1
0.3%
1475 1
0.3%
1371 1
0.3%
1345 1
0.3%
1316 1
0.3%
1206 1
0.3%
991 1
0.3%

동수
Real number (ℝ)

HIGH CORRELATION 

Distinct16
Distinct (%)5.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.89
Minimum1
Maximum26
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.8 KiB
2023-12-12T17:42:56.113325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q31
95-th percentile6.05
Maximum26
Range25
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2.9314179
Coefficient of variation (CV)1.5510147
Kurtosis26.677754
Mean1.89
Median Absolute Deviation (MAD)0
Skewness4.8260912
Sum567
Variance8.5932107
MonotonicityNot monotonic
2023-12-12T17:42:56.263930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
1 237
79.0%
2 31
 
10.3%
4 8
 
2.7%
3 4
 
1.3%
5 3
 
1.0%
9 3
 
1.0%
8 2
 
0.7%
13 2
 
0.7%
6 2
 
0.7%
15 2
 
0.7%
Other values (6) 6
 
2.0%
ValueCountFrequency (%)
1 237
79.0%
2 31
 
10.3%
3 4
 
1.3%
4 8
 
2.7%
5 3
 
1.0%
6 2
 
0.7%
7 1
 
0.3%
8 2
 
0.7%
9 3
 
1.0%
12 1
 
0.3%
ValueCountFrequency (%)
26 1
 
0.3%
19 1
 
0.3%
18 1
 
0.3%
16 1
 
0.3%
15 2
0.7%
13 2
0.7%
12 1
 
0.3%
9 3
1.0%
8 2
0.7%
7 1
 
0.3%
Distinct279
Distinct (%)93.0%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
Minimum1978-12-23 00:00:00
Maximum2023-07-20 00:00:00
2023-12-12T17:42:56.437667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:42:56.619703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

주소
Text

Distinct295
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2023-12-12T17:42:57.032701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length19
Mean length15.503333
Min length13

Characters and Unicode

Total characters4651
Distinct characters76
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique290 ?
Unique (%)96.7%

Sample

1st row울산광역시 동구남목20길 40
2nd row울산광역시 동구남목8길 5
3rd row울산광역시 동구진성8길 143
4th row울산광역시 동구진성10길 146
5th row울산광역시 동구진성10길 69
ValueCountFrequency (%)
울산광역시 300
33.4%
동구방어진순환도로 29
 
3.2%
동구꽃바위로 15
 
1.7%
동구봉수로 11
 
1.2%
5 10
 
1.1%
동구북진1길 9
 
1.0%
7 7
 
0.8%
31 7
 
0.8%
15 7
 
0.8%
동구문재로 6
 
0.7%
Other values (266) 498
55.4%
2023-12-12T17:42:57.665586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
600
 
12.9%
306
 
6.6%
302
 
6.5%
300
 
6.5%
300
 
6.5%
300
 
6.5%
300
 
6.5%
300
 
6.5%
1 201
 
4.3%
201
 
4.3%
Other values (66) 1541
33.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3166
68.1%
Decimal Number 872
 
18.7%
Space Separator 600
 
12.9%
Dash Punctuation 13
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
306
9.7%
302
9.5%
300
9.5%
300
9.5%
300
9.5%
300
9.5%
300
9.5%
201
 
6.3%
108
 
3.4%
99
 
3.1%
Other values (54) 650
20.5%
Decimal Number
ValueCountFrequency (%)
1 201
23.1%
2 114
13.1%
5 105
12.0%
3 88
10.1%
0 80
 
9.2%
4 73
 
8.4%
7 62
 
7.1%
6 54
 
6.2%
9 51
 
5.8%
8 44
 
5.0%
Space Separator
ValueCountFrequency (%)
600
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3166
68.1%
Common 1485
31.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
306
9.7%
302
9.5%
300
9.5%
300
9.5%
300
9.5%
300
9.5%
300
9.5%
201
 
6.3%
108
 
3.4%
99
 
3.1%
Other values (54) 650
20.5%
Common
ValueCountFrequency (%)
600
40.4%
1 201
 
13.5%
2 114
 
7.7%
5 105
 
7.1%
3 88
 
5.9%
0 80
 
5.4%
4 73
 
4.9%
7 62
 
4.2%
6 54
 
3.6%
9 51
 
3.4%
Other values (2) 57
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3166
68.1%
ASCII 1485
31.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
600
40.4%
1 201
 
13.5%
2 114
 
7.7%
5 105
 
7.1%
3 88
 
5.9%
0 80
 
5.4%
4 73
 
4.9%
7 62
 
4.2%
6 54
 
3.6%
9 51
 
3.4%
Other values (2) 57
 
3.8%
Hangul
ValueCountFrequency (%)
306
9.7%
302
9.5%
300
9.5%
300
9.5%
300
9.5%
300
9.5%
300
9.5%
201
 
6.3%
108
 
3.4%
99
 
3.1%
Other values (54) 650
20.5%

담당기관
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
울산광역시 동구 건축주택과
300 

Length

Max length14
Median length14
Mean length14
Min length14

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row울산광역시 동구 건축주택과
2nd row울산광역시 동구 건축주택과
3rd row울산광역시 동구 건축주택과
4th row울산광역시 동구 건축주택과
5th row울산광역시 동구 건축주택과

Common Values

ValueCountFrequency (%)
울산광역시 동구 건축주택과 300
100.0%

Length

2023-12-12T17:42:57.842700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:42:57.961000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
울산광역시 300
33.3%
동구 300
33.3%
건축주택과 300
33.3%

기관 연락처
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
052-209-3794
300 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row052-209-3794
2nd row052-209-3794
3rd row052-209-3794
4th row052-209-3794
5th row052-209-3794

Common Values

ValueCountFrequency (%)
052-209-3794 300
100.0%

Length

2023-12-12T17:42:58.092962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:42:58.254245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
052-209-3794 300
100.0%

Interactions

2023-12-12T17:42:53.629942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:42:52.897394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:42:53.181827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:42:53.769932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:42:52.985819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:42:53.312610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:42:53.926455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:42:53.089100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:42:53.495709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:42:58.353546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번세대수동수
연번1.0000.1920.194
세대수0.1921.0000.912
동수0.1940.9121.000
2023-12-12T17:42:58.488603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번세대수동수
연번1.000-0.596-0.304
세대수-0.5961.0000.670
동수-0.3040.6701.000

Missing values

2023-12-12T17:42:54.138320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:42:54.312458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번아파트명세대수동수준공연도주소담당기관기관 연락처
01일진아파트8021978-12-23울산광역시 동구남목20길 40울산광역시 동구 건축주택과052-209-3794
12남목아파트2211980-01-22울산광역시 동구남목8길 5울산광역시 동구 건축주택과052-209-3794
23건남아파트4511980-09-17울산광역시 동구진성8길 143울산광역시 동구 건축주택과052-209-3794
34대성아파트2차7221982-02-19울산광역시 동구진성10길 146울산광역시 동구 건축주택과052-209-3794
45반석아파트4021982-08-02울산광역시 동구진성10길 69울산광역시 동구 건축주택과052-209-3794
56대한아파트4011982-11-02울산광역시 동구진성7길 54울산광역시 동구 건축주택과052-209-3794
67전하국민아파트2011983-01-04울산광역시 동구대송로 21울산광역시 동구 건축주택과052-209-3794
78유한국민아파트3811983-01-04울산광역시 동구진성6길 40울산광역시 동구 건축주택과052-209-3794
89전하국민아파트8321983-01-04울산광역시 동구진성10길 77울산광역시 동구 건축주택과052-209-3794
910삼창파크맨션11021983-03-17울산광역시 동구진성15가길 5울산광역시 동구 건축주택과052-209-3794
연번아파트명세대수동수준공연도주소담당기관기관 연락처
290291센트럴오션파크11212018-02-14울산광역시 동구내진4길 11울산광역시 동구 건축주택과052-209-3794
291292베이프런트2612019-03-22울산광역시 동구북진2길 20울산광역시 동구 건축주택과052-209-3794
292293삼성힐스파크9212019-08-29울산광역시 동구진성9길 2울산광역시 동구 건축주택과052-209-3794
293294신성 미소지움16212019-12-31울산광역시 동구방어진순환도로 470울산광역시 동구 건축주택과052-209-3794
294295전하 KCC스위첸20922020-02-06울산광역시 동구바드래1길 46울산광역시 동구 건축주택과052-209-3794
295296스웨첸월츠타워1차25522022-02-25울산광역시 동구바드래1길 46울산광역시 동구 건축주택과052-209-3794
296297지웰시티자이1단지137192023-05-26울산광역시 동구서부로50울산광역시 동구 건축주택과052-209-3794
297298지웰시티자이2단지131692023-05-26울산광역시 동구서부로51울산광역시 동구 건축주택과052-209-3794
298299로얄맨션(주상복합)9912023-06-19울산광역시 동구진성9길 60울산광역시 동구 건축주택과052-209-3794
299300스위첸웰츠타워2차38032023-07-20울산광역시 동구방어진순환도로 865울산광역시 동구 건축주택과052-209-3794