Overview

Dataset statistics

Number of variables6
Number of observations37
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 KiB
Average record size in memory53.6 B

Variable types

Numeric2
Text2
DateTime1
Categorical1

Dataset

Description부산광역시 기장군 정관에 있는 공동주택 현황에 대한 데이터로 아파트명, 주소, 준공일자, 세대수 등의 항목을 제공합니다.입니다.(아파트명, 주소, 준공일자, 세대수 등)
Author부산광역시 기장군
URLhttps://www.data.go.kr/data/15069116/fileData.do

Alerts

비고 is highly imbalanced (69.7%)Imbalance
연번 has unique valuesUnique
아파트명 has unique valuesUnique
주소 has unique valuesUnique
세대수 has unique valuesUnique

Reproduction

Analysis started2024-03-23 05:41:39.911513
Analysis finished2024-03-23 05:41:41.498518
Duration1.59 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct37
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19
Minimum1
Maximum37
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size465.0 B
2024-03-23T14:41:41.609567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.8
Q110
median19
Q328
95-th percentile35.2
Maximum37
Range36
Interquartile range (IQR)18

Descriptive statistics

Standard deviation10.824355
Coefficient of variation (CV)0.56970291
Kurtosis-1.2
Mean19
Median Absolute Deviation (MAD)9
Skewness0
Sum703
Variance117.16667
MonotonicityStrictly increasing
2024-03-23T14:41:41.866985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=37)
ValueCountFrequency (%)
1 1
 
2.7%
29 1
 
2.7%
22 1
 
2.7%
23 1
 
2.7%
24 1
 
2.7%
25 1
 
2.7%
26 1
 
2.7%
27 1
 
2.7%
28 1
 
2.7%
30 1
 
2.7%
Other values (27) 27
73.0%
ValueCountFrequency (%)
1 1
2.7%
2 1
2.7%
3 1
2.7%
4 1
2.7%
5 1
2.7%
6 1
2.7%
7 1
2.7%
8 1
2.7%
9 1
2.7%
10 1
2.7%
ValueCountFrequency (%)
37 1
2.7%
36 1
2.7%
35 1
2.7%
34 1
2.7%
33 1
2.7%
32 1
2.7%
31 1
2.7%
30 1
2.7%
29 1
2.7%
28 1
2.7%

아파트명
Text

UNIQUE 

Distinct37
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size428.0 B
2024-03-23T14:41:42.337729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length9
Mean length7.3783784
Min length4

Characters and Unicode

Total characters273
Distinct characters96
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)100.0%

Sample

1st row성림아파트
2nd row재흥아파트
3rd row형진강변아파트
4th row롯데캐슬
5th row계룡리슈빌
ValueCountFrequency (%)
가화테라스 2
 
5.0%
정관이진캐스빌2차 1
 
2.5%
동일스위트3차 1
 
2.5%
이진캐스빌 1
 
2.5%
가화만사성정관타운 1
 
2.5%
이지더원3차 1
 
2.5%
이지더원5차 1
 
2.5%
양우내안애 1
 
2.5%
정관lh7단지 1
 
2.5%
성림아파트 1
 
2.5%
Other values (29) 29
72.5%
2024-03-23T14:41:42.987937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13
 
4.8%
13
 
4.8%
11
 
4.0%
10
 
3.7%
10
 
3.7%
8
 
2.9%
7
 
2.6%
2 7
 
2.6%
6
 
2.2%
6
 
2.2%
Other values (86) 182
66.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 240
87.9%
Decimal Number 20
 
7.3%
Uppercase Letter 8
 
2.9%
Space Separator 3
 
1.1%
Dash Punctuation 2
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13
 
5.4%
13
 
5.4%
11
 
4.6%
10
 
4.2%
10
 
4.2%
8
 
3.3%
7
 
2.9%
6
 
2.5%
6
 
2.5%
6
 
2.5%
Other values (75) 150
62.5%
Decimal Number
ValueCountFrequency (%)
2 7
35.0%
1 6
30.0%
5 3
15.0%
3 2
 
10.0%
7 1
 
5.0%
4 1
 
5.0%
Uppercase Letter
ValueCountFrequency (%)
B 4
50.0%
L 3
37.5%
H 1
 
12.5%
Space Separator
ValueCountFrequency (%)
3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 240
87.9%
Common 25
 
9.2%
Latin 8
 
2.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
13
 
5.4%
13
 
5.4%
11
 
4.6%
10
 
4.2%
10
 
4.2%
8
 
3.3%
7
 
2.9%
6
 
2.5%
6
 
2.5%
6
 
2.5%
Other values (75) 150
62.5%
Common
ValueCountFrequency (%)
2 7
28.0%
1 6
24.0%
3
12.0%
5 3
12.0%
3 2
 
8.0%
- 2
 
8.0%
7 1
 
4.0%
4 1
 
4.0%
Latin
ValueCountFrequency (%)
B 4
50.0%
L 3
37.5%
H 1
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 240
87.9%
ASCII 33
 
12.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
13
 
5.4%
13
 
5.4%
11
 
4.6%
10
 
4.2%
10
 
4.2%
8
 
3.3%
7
 
2.9%
6
 
2.5%
6
 
2.5%
6
 
2.5%
Other values (75) 150
62.5%
ASCII
ValueCountFrequency (%)
2 7
21.2%
1 6
18.2%
B 4
12.1%
3
9.1%
5 3
9.1%
L 3
9.1%
3 2
 
6.1%
- 2
 
6.1%
H 1
 
3.0%
7 1
 
3.0%

주소
Text

UNIQUE 

Distinct37
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size428.0 B
2024-03-23T14:41:43.370410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.27027
Min length8

Characters and Unicode

Total characters417
Distinct characters29
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)100.0%

Sample

1st row정관읍 달산리 141-5
2nd row정관읍 달산리 139
3rd row정관읍 매학리 41
4th row정관읍 용수리 1364
5th row정관읍 용수리 1330
ValueCountFrequency (%)
정관읍 35
32.1%
용수리 10
 
9.2%
모전리 10
 
9.2%
달산리 7
 
6.4%
방곡리 6
 
5.5%
정관택지 2
 
1.8%
723 2
 
1.8%
매학리 2
 
1.8%
1275 1
 
0.9%
728 1
 
0.9%
Other values (33) 33
30.3%
2024-03-23T14:41:44.013033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
72
17.3%
37
 
8.9%
37
 
8.9%
35
 
8.4%
35
 
8.4%
1 25
 
6.0%
3 20
 
4.8%
7 19
 
4.6%
2 12
 
2.9%
10
 
2.4%
Other values (19) 115
27.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 218
52.3%
Decimal Number 122
29.3%
Space Separator 72
 
17.3%
Dash Punctuation 3
 
0.7%
Uppercase Letter 2
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
37
17.0%
37
17.0%
35
16.1%
35
16.1%
10
 
4.6%
10
 
4.6%
10
 
4.6%
10
 
4.6%
7
 
3.2%
7
 
3.2%
Other values (6) 20
9.2%
Decimal Number
ValueCountFrequency (%)
1 25
20.5%
3 20
16.4%
7 19
15.6%
2 12
9.8%
5 10
 
8.2%
4 9
 
7.4%
8 8
 
6.6%
0 8
 
6.6%
9 6
 
4.9%
6 5
 
4.1%
Space Separator
ValueCountFrequency (%)
72
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 218
52.3%
Common 197
47.2%
Latin 2
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
37
17.0%
37
17.0%
35
16.1%
35
16.1%
10
 
4.6%
10
 
4.6%
10
 
4.6%
10
 
4.6%
7
 
3.2%
7
 
3.2%
Other values (6) 20
9.2%
Common
ValueCountFrequency (%)
72
36.5%
1 25
 
12.7%
3 20
 
10.2%
7 19
 
9.6%
2 12
 
6.1%
5 10
 
5.1%
4 9
 
4.6%
8 8
 
4.1%
0 8
 
4.1%
9 6
 
3.0%
Other values (2) 8
 
4.1%
Latin
ValueCountFrequency (%)
B 2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 218
52.3%
ASCII 199
47.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
72
36.2%
1 25
 
12.6%
3 20
 
10.1%
7 19
 
9.5%
2 12
 
6.0%
5 10
 
5.0%
4 9
 
4.5%
8 8
 
4.0%
0 8
 
4.0%
9 6
 
3.0%
Other values (3) 10
 
5.0%
Hangul
ValueCountFrequency (%)
37
17.0%
37
17.0%
35
16.1%
35
16.1%
10
 
4.6%
10
 
4.6%
10
 
4.6%
10
 
4.6%
7
 
3.2%
7
 
3.2%
Other values (6) 20
9.2%
Distinct35
Distinct (%)94.6%
Missing0
Missing (%)0.0%
Memory size428.0 B
Minimum1989-12-12 00:00:00
Maximum2019-06-18 00:00:00
2024-03-23T14:41:44.798796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T14:41:45.026735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)

세대수
Real number (ℝ)

UNIQUE 

Distinct37
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean756.86486
Minimum66
Maximum1934
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size465.0 B
2024-03-23T14:41:45.252621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum66
5-th percentile128.4
Q1426
median690
Q31028
95-th percentile1662
Maximum1934
Range1868
Interquartile range (IQR)602

Descriptive statistics

Standard deviation492.89976
Coefficient of variation (CV)0.65123879
Kurtosis-0.24307798
Mean756.86486
Median Absolute Deviation (MAD)294
Skewness0.67200518
Sum28004
Variance242950.18
MonotonicityNot monotonic
2024-03-23T14:41:45.484580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=37)
ValueCountFrequency (%)
70 1
 
2.7%
1934 1
 
2.7%
464 1
 
2.7%
1500 1
 
2.7%
539 1
 
2.7%
560 1
 
2.7%
1035 1
 
2.7%
426 1
 
2.7%
830 1
 
2.7%
258 1
 
2.7%
Other values (27) 27
73.0%
ValueCountFrequency (%)
66 1
2.7%
70 1
2.7%
143 1
2.7%
170 1
2.7%
185 1
2.7%
258 1
2.7%
269 1
2.7%
272 1
2.7%
396 1
2.7%
426 1
2.7%
ValueCountFrequency (%)
1934 1
2.7%
1758 1
2.7%
1638 1
2.7%
1533 1
2.7%
1500 1
2.7%
1301 1
2.7%
1249 1
2.7%
1128 1
2.7%
1035 1
2.7%
1028 1
2.7%

비고
Categorical

IMBALANCE 

Distinct2
Distinct (%)5.4%
Missing0
Missing (%)0.0%
Memory size428.0 B
입주사용중
35 
미착공
 
2

Length

Max length5
Median length5
Mean length4.8918919
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row입주사용중
2nd row입주사용중
3rd row입주사용중
4th row입주사용중
5th row입주사용중

Common Values

ValueCountFrequency (%)
입주사용중 35
94.6%
미착공 2
 
5.4%

Length

2024-03-23T14:41:45.889090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T14:41:46.150664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
입주사용중 35
94.6%
미착공 2
 
5.4%

Interactions

2024-03-23T14:41:40.741302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T14:41:40.435967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T14:41:40.945124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T14:41:40.579453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-23T14:41:46.266290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번아파트명주소준공일자세대수비고
연번1.0001.0001.0001.0000.6650.703
아파트명1.0001.0001.0001.0001.0001.000
주소1.0001.0001.0001.0001.0001.000
준공일자1.0001.0001.0001.0000.9151.000
세대수0.6651.0001.0000.9151.0000.000
비고0.7031.0001.0001.0000.0001.000
2024-03-23T14:41:46.530409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번세대수비고
연번1.000-0.0970.477
세대수-0.0971.0000.000
비고0.4770.0001.000

Missing values

2024-03-23T14:41:41.210734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T14:41:41.432508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번아파트명주소준공일자세대수비고
01성림아파트정관읍 달산리 141-51989-12-1270입주사용중
12재흥아파트정관읍 달산리 1391990-05-31170입주사용중
23형진강변아파트정관읍 매학리 411992-09-17143입주사용중
34롯데캐슬정관읍 용수리 13642008-11-27761입주사용중
45계룡리슈빌정관읍 용수리 13302008-12-16455입주사용중
56신동아파밀리에정관읍 용수리 13222008-12-19655입주사용중
67한진해모로정관읍 용수리 14032008-12-24763입주사용중
78정관휴먼시아1단지정관읍 용수리 13592008-12-291533입주사용중
89현진에버빌정관읍 모전리 7552009-01-23690입주사용중
910센트럴파크정관읍 달산리 11992009-12-11588입주사용중
연번아파트명주소준공일자세대수비고
2728양우내안애정관읍 용수리 12752016-06-24830입주사용중
2829정관LH7단지정관읍 모전리 6812017-07-061934입주사용중
2930정관이진캐스빌2차정관읍 용수리 12792017-08-18258입주사용중
3031가화테라스 1차정관읍 방곡리 4072017-09-29431입주사용중
3132파스텔라타운하우스정관읍 방곡리 4372018-02-2866입주사용중
3233두산위브더테라스정관읍 달산리 12442018-05-31272입주사용중
3334가화테라스 2차정관읍 방곡리 4052018-06-01396입주사용중
3435정관 행복주택정관읍 모전리 7172018-12-18856입주사용중
3536부산정관지구B-1BL연립주택정관택지 B-12019-06-18269미착공
3637부산정관지구B-5BL연립주택정관택지 B-52019-06-18185미착공