Overview

Dataset statistics

Number of variables9
Number of observations148
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.1 KiB
Average record size in memory76.9 B

Variable types

Numeric2
Text2
DateTime1
Categorical4

Dataset

Description울산광역시 동구 내 연립주택 현황에 대한 데이터로 주택명, 도로명 주소, 준공연도, 층수, 동수 등의 정보를 포함하고 있습니다
Author울산광역시 동구
URLhttps://www.data.go.kr/data/3075817/fileData.do

Alerts

담당기관 has constant value ""Constant
기관 연락처 has constant value ""Constant
세대수 is highly overall correlated with 동수 High correlation
동수 is highly overall correlated with 세대수 High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-15 02:31:08.030611
Analysis finished2024-03-15 02:31:09.764402
Duration1.73 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct148
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean74.5
Minimum1
Maximum148
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2024-03-15T11:31:09.963197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile8.35
Q137.75
median74.5
Q3111.25
95-th percentile140.65
Maximum148
Range147
Interquartile range (IQR)73.5

Descriptive statistics

Standard deviation42.868014
Coefficient of variation (CV)0.57540959
Kurtosis-1.2
Mean74.5
Median Absolute Deviation (MAD)37
Skewness0
Sum11026
Variance1837.6667
MonotonicityStrictly increasing
2024-03-15T11:31:10.335513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.7%
95 1
 
0.7%
97 1
 
0.7%
98 1
 
0.7%
99 1
 
0.7%
100 1
 
0.7%
101 1
 
0.7%
102 1
 
0.7%
103 1
 
0.7%
104 1
 
0.7%
Other values (138) 138
93.2%
ValueCountFrequency (%)
1 1
0.7%
2 1
0.7%
3 1
0.7%
4 1
0.7%
5 1
0.7%
6 1
0.7%
7 1
0.7%
8 1
0.7%
9 1
0.7%
10 1
0.7%
ValueCountFrequency (%)
148 1
0.7%
147 1
0.7%
146 1
0.7%
145 1
0.7%
144 1
0.7%
143 1
0.7%
142 1
0.7%
141 1
0.7%
140 1
0.7%
139 1
0.7%
Distinct129
Distinct (%)87.2%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2024-03-15T11:31:11.471883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length14
Mean length5.6891892
Min length4

Characters and Unicode

Total characters842
Distinct characters164
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique115 ?
Unique (%)77.7%

Sample

1st row대흥연립주택
2nd row삼연연립주택아파트
3rd row삼흥연립주택
4th row산성연립주택
5th row산성연립주택
ValueCountFrequency (%)
다운빌라 4
 
2.5%
신화빌라 3
 
1.9%
은빛에베로힐 3
 
1.9%
빌라드씨 3
 
1.9%
산성연립주택 3
 
1.9%
명진빌라 2
 
1.3%
우경아트빌라 2
 
1.3%
로얄빌라 2
 
1.3%
한듬빌라 2
 
1.3%
광운빌라 2
 
1.3%
Other values (126) 131
83.4%
2024-03-15T11:31:12.926865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
106
 
12.6%
89
 
10.6%
20
 
2.4%
19
 
2.3%
19
 
2.3%
19
 
2.3%
17
 
2.0%
16
 
1.9%
15
 
1.8%
15
 
1.8%
Other values (154) 507
60.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 758
90.0%
Decimal Number 44
 
5.2%
Lowercase Letter 16
 
1.9%
Uppercase Letter 13
 
1.5%
Space Separator 9
 
1.1%
Other Punctuation 1
 
0.1%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
106
 
14.0%
89
 
11.7%
20
 
2.6%
19
 
2.5%
19
 
2.5%
19
 
2.5%
17
 
2.2%
16
 
2.1%
15
 
2.0%
15
 
2.0%
Other values (120) 423
55.8%
Lowercase Letter
ValueCountFrequency (%)
e 2
12.5%
l 2
12.5%
r 2
12.5%
o 2
12.5%
a 1
6.2%
i 1
6.2%
s 1
6.2%
y 1
6.2%
h 1
6.2%
t 1
6.2%
Other values (2) 2
12.5%
Uppercase Letter
ValueCountFrequency (%)
V 2
15.4%
P 2
15.4%
J 2
15.4%
A 1
7.7%
K 1
7.7%
Y 1
7.7%
R 1
7.7%
T 1
7.7%
H 1
7.7%
I 1
7.7%
Decimal Number
ValueCountFrequency (%)
1 14
31.8%
0 8
18.2%
2 7
15.9%
7 4
 
9.1%
5 4
 
9.1%
4 3
 
6.8%
9 2
 
4.5%
8 1
 
2.3%
3 1
 
2.3%
Space Separator
ValueCountFrequency (%)
9
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 758
90.0%
Common 55
 
6.5%
Latin 29
 
3.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
106
 
14.0%
89
 
11.7%
20
 
2.6%
19
 
2.5%
19
 
2.5%
19
 
2.5%
17
 
2.2%
16
 
2.1%
15
 
2.0%
15
 
2.0%
Other values (120) 423
55.8%
Latin
ValueCountFrequency (%)
V 2
 
6.9%
P 2
 
6.9%
e 2
 
6.9%
J 2
 
6.9%
l 2
 
6.9%
r 2
 
6.9%
o 2
 
6.9%
A 1
 
3.4%
K 1
 
3.4%
Y 1
 
3.4%
Other values (12) 12
41.4%
Common
ValueCountFrequency (%)
1 14
25.5%
9
16.4%
0 8
14.5%
2 7
12.7%
7 4
 
7.3%
5 4
 
7.3%
4 3
 
5.5%
9 2
 
3.6%
, 1
 
1.8%
- 1
 
1.8%
Other values (2) 2
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 758
90.0%
ASCII 84
 
10.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
106
 
14.0%
89
 
11.7%
20
 
2.6%
19
 
2.5%
19
 
2.5%
19
 
2.5%
17
 
2.2%
16
 
2.1%
15
 
2.0%
15
 
2.0%
Other values (120) 423
55.8%
ASCII
ValueCountFrequency (%)
1 14
16.7%
9
 
10.7%
0 8
 
9.5%
2 7
 
8.3%
7 4
 
4.8%
5 4
 
4.8%
4 3
 
3.6%
9 2
 
2.4%
V 2
 
2.4%
P 2
 
2.4%
Other values (24) 29
34.5%
Distinct147
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2024-03-15T11:31:14.331457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length16
Mean length16.412162
Min length14

Characters and Unicode

Total characters2429
Distinct characters63
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique146 ?
Unique (%)98.6%

Sample

1st row울산광역시 동구 화정4가길 7
2nd row울산광역시 동구 진성3길 39
3rd row울산광역시 동구 진성8길 35
4th row울산광역시 동구 진성11길 106
5th row울산광역시 동구 진성11길 106
ValueCountFrequency (%)
울산광역시 148
25.0%
동구 147
24.8%
26 6
 
1.0%
양지3길 5
 
0.8%
월봉11길 5
 
0.8%
9 5
 
0.8%
53 4
 
0.7%
21 4
 
0.7%
7 4
 
0.7%
10 4
 
0.7%
Other values (178) 261
44.0%
2024-03-15T11:31:16.268901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
446
18.4%
154
 
6.3%
148
 
6.1%
148
 
6.1%
148
 
6.1%
148
 
6.1%
148
 
6.1%
148
 
6.1%
130
 
5.4%
1 114
 
4.7%
Other values (53) 697
28.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1506
62.0%
Decimal Number 458
 
18.9%
Space Separator 446
 
18.4%
Dash Punctuation 18
 
0.7%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
154
10.2%
148
9.8%
148
9.8%
148
9.8%
148
9.8%
148
9.8%
148
9.8%
130
8.6%
34
 
2.3%
22
 
1.5%
Other values (40) 278
18.5%
Decimal Number
ValueCountFrequency (%)
1 114
24.9%
2 60
13.1%
3 52
11.4%
4 46
10.0%
6 41
 
9.0%
5 37
 
8.1%
7 37
 
8.1%
0 28
 
6.1%
8 25
 
5.5%
9 18
 
3.9%
Space Separator
ValueCountFrequency (%)
446
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%
Uppercase Letter
ValueCountFrequency (%)
C 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1506
62.0%
Common 922
38.0%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
154
10.2%
148
9.8%
148
9.8%
148
9.8%
148
9.8%
148
9.8%
148
9.8%
130
8.6%
34
 
2.3%
22
 
1.5%
Other values (40) 278
18.5%
Common
ValueCountFrequency (%)
446
48.4%
1 114
 
12.4%
2 60
 
6.5%
3 52
 
5.6%
4 46
 
5.0%
6 41
 
4.4%
5 37
 
4.0%
7 37
 
4.0%
0 28
 
3.0%
8 25
 
2.7%
Other values (2) 36
 
3.9%
Latin
ValueCountFrequency (%)
C 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1506
62.0%
ASCII 923
38.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
446
48.3%
1 114
 
12.4%
2 60
 
6.5%
3 52
 
5.6%
4 46
 
5.0%
6 41
 
4.4%
5 37
 
4.0%
7 37
 
4.0%
0 28
 
3.0%
8 25
 
2.7%
Other values (3) 37
 
4.0%
Hangul
ValueCountFrequency (%)
154
10.2%
148
9.8%
148
9.8%
148
9.8%
148
9.8%
148
9.8%
148
9.8%
130
8.6%
34
 
2.3%
22
 
1.5%
Other values (40) 278
18.5%
Distinct139
Distinct (%)93.9%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
Minimum1978-09-06 00:00:00
Maximum2016-05-18 00:00:00
2024-03-15T11:31:16.575475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T11:31:16.990841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

층수
Categorical

Distinct5
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
4
97 
5
35 
3
 
9
2
 
6
6
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)0.7%

Sample

1st row2
2nd row2
3rd row2
4th row2
5th row2

Common Values

ValueCountFrequency (%)
4 97
65.5%
5 35
 
23.6%
3 9
 
6.1%
2 6
 
4.1%
6 1
 
0.7%

Length

2024-03-15T11:31:17.406940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T11:31:17.755437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
4 97
65.5%
5 35
 
23.6%
3 9
 
6.1%
2 6
 
4.1%
6 1
 
0.7%

동수
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
1
103 
2
43 
3
 
2

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row2
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 103
69.6%
2 43
29.1%
3 2
 
1.4%

Length

2024-03-15T11:31:18.146390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T11:31:18.473506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 103
69.6%
2 43
29.1%
3 2
 
1.4%

세대수
Real number (ℝ)

HIGH CORRELATION 

Distinct21
Distinct (%)14.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.885135
Minimum6
Maximum52
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2024-03-15T11:31:18.689364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6
5-th percentile7
Q19.75
median12.5
Q316
95-th percentile19
Maximum52
Range46
Interquartile range (IQR)6.25

Descriptive statistics

Standard deviation6.7698046
Coefficient of variation (CV)0.4875577
Kurtosis12.329944
Mean13.885135
Median Absolute Deviation (MAD)3.5
Skewness2.8830527
Sum2055
Variance45.830254
MonotonicityNot monotonic
2024-03-15T11:31:18.997144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
16 29
19.6%
12 25
16.9%
8 25
16.9%
19 13
8.8%
15 11
 
7.4%
11 7
 
4.7%
7 7
 
4.7%
14 5
 
3.4%
10 5
 
3.4%
6 4
 
2.7%
Other values (11) 17
11.5%
ValueCountFrequency (%)
6 4
 
2.7%
7 7
 
4.7%
8 25
16.9%
9 1
 
0.7%
10 5
 
3.4%
11 7
 
4.7%
12 25
16.9%
13 4
 
2.7%
14 5
 
3.4%
15 11
7.4%
ValueCountFrequency (%)
52 1
 
0.7%
48 1
 
0.7%
42 1
 
0.7%
36 1
 
0.7%
33 1
 
0.7%
25 1
 
0.7%
24 1
 
0.7%
19 13
8.8%
18 3
 
2.0%
17 2
 
1.4%

담당기관
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
울산광역시 동구 건축주택과
148 

Length

Max length14
Median length14
Mean length14
Min length14

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row울산광역시 동구 건축주택과
2nd row울산광역시 동구 건축주택과
3rd row울산광역시 동구 건축주택과
4th row울산광역시 동구 건축주택과
5th row울산광역시 동구 건축주택과

Common Values

ValueCountFrequency (%)
울산광역시 동구 건축주택과 148
100.0%

Length

2024-03-15T11:31:19.505694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T11:31:19.912701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
울산광역시 148
33.3%
동구 148
33.3%
건축주택과 148
33.3%

기관 연락처
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
052-209-3794
148 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row052-209-3794
2nd row052-209-3794
3rd row052-209-3794
4th row052-209-3794
5th row052-209-3794

Common Values

ValueCountFrequency (%)
052-209-3794 148
100.0%

Length

2024-03-15T11:31:20.406436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T11:31:20.824702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
052-209-3794 148
100.0%

Interactions

2024-03-15T11:31:08.859864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T11:31:08.493039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T11:31:09.013775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T11:31:08.713329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T11:31:21.072152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번층수동수세대수
연번1.0000.7760.4720.265
층수0.7761.0000.4620.413
동수0.4720.4621.0000.950
세대수0.2650.4130.9501.000
2024-03-15T11:31:21.500622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
동수층수
동수1.0000.391
층수0.3911.000
2024-03-15T11:31:21.850637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번세대수층수동수
연번1.000-0.3160.4240.314
세대수-0.3161.0000.2490.720
층수0.4240.2491.0000.391
동수0.3140.7200.3911.000

Missing values

2024-03-15T11:31:09.201978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T11:31:09.559885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번주택명도로명 주소준공연도층수동수세대수담당기관기관 연락처
01대흥연립주택울산광역시 동구 화정4가길 71978-09-062342울산광역시 동구 건축주택과052-209-3794
12삼연연립주택아파트울산광역시 동구 진성3길 391980-02-272215울산광역시 동구 건축주택과052-209-3794
23삼흥연립주택울산광역시 동구 진성8길 351980-04-042112울산광역시 동구 건축주택과052-209-3794
34산성연립주택울산광역시 동구 진성11길 1061981-09-302112울산광역시 동구 건축주택과052-209-3794
45산성연립주택울산광역시 동구 진성11길 1061981-09-302112울산광역시 동구 건축주택과052-209-3794
56산성연립주택울산광역시 동구 진성 12길 1041981-10-023112울산광역시 동구 건축주택과052-209-3794
67새운아파트울산광역시 동구 명덕7길 451981-12-144248울산광역시 동구 건축주택과052-209-3794
78동해연립울산광역시 동구 진성8길 531981-12-172325울산광역시 동구 건축주택과052-209-3794
89홍일아파트울산광역시 동구 진성13길 321982-05-064116울산광역시 동구 건축주택과052-209-3794
910홍일아파트울산광역시 동구 진성12길 531982-06-235119울산광역시 동구 건축주택과052-209-3794
연번주택명도로명 주소준공연도층수동수세대수담당기관기관 연락처
138139로얄팰리스울산광역시 동구 문재1길 222012-12-05517울산광역시 동구 건축주택과052-209-3794
139140Horbor Village울산광역시 동구 서진3길 312013-05-02517울산광역시 동구 건축주택과052-209-3794
140141Tethys울산광역시 동구 상진길 842013-09-125112울산광역시 동구 건축주택과052-209-3794
141142빌라드씨 104동울산광역시 동구 동해안로 6172014-01-16416울산광역시 동구 건축주택과052-209-3794
142143빌라드씨 105동울산광역시 동구 동해안로 6152014-01-16416울산광역시 동구 건축주택과052-209-3794
143144빌라드씨 101동, 102동울산광역시 동구 동해안로 617-22014-01-164211울산광역시 동구 건축주택과052-209-3794
144145썬빌리지울산광역시 동구 명덕3길 212014-01-295112울산광역시 동구 건축주택과052-209-3794
145146로하스2차오피스텔울산광역시 동구 내진3길 11-52014-02-28516울산광역시 동구 건축주택과052-209-3794
146147J-PARK울산광역시 동구 물목길 8-32015-01-204119울산광역시 동구 건축주택과052-209-3794
147148서희빌라울산광역시 동구 꽃바위로 321-12016-05-18618울산광역시 동구 건축주택과052-209-3794