Overview

Dataset statistics

Number of variables6
Number of observations37
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 KiB
Average record size in memory53.6 B

Variable types

Numeric2
Text2
DateTime1
Categorical1

Dataset

Description부산광역시기장군_정관신도시주택현황_20180222
Author부산광역시 기장군
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15069116

Alerts

연번 is highly overall correlated with 비고High correlation
비고 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
아파트명 has unique valuesUnique
주소 has unique valuesUnique
세대수 has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:42:20.980712
Analysis finished2023-12-10 16:42:22.205494
Duration1.22 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct37
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19
Minimum1
Maximum37
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size465.0 B
2023-12-11T01:42:22.301998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.8
Q110
median19
Q328
95-th percentile35.2
Maximum37
Range36
Interquartile range (IQR)18

Descriptive statistics

Standard deviation10.824355
Coefficient of variation (CV)0.56970291
Kurtosis-1.2
Mean19
Median Absolute Deviation (MAD)9
Skewness0
Sum703
Variance117.16667
MonotonicityStrictly increasing
2023-12-11T01:42:22.485442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=37)
ValueCountFrequency (%)
1 1
 
2.7%
29 1
 
2.7%
22 1
 
2.7%
23 1
 
2.7%
24 1
 
2.7%
25 1
 
2.7%
26 1
 
2.7%
27 1
 
2.7%
28 1
 
2.7%
30 1
 
2.7%
Other values (27) 27
73.0%
ValueCountFrequency (%)
1 1
2.7%
2 1
2.7%
3 1
2.7%
4 1
2.7%
5 1
2.7%
6 1
2.7%
7 1
2.7%
8 1
2.7%
9 1
2.7%
10 1
2.7%
ValueCountFrequency (%)
37 1
2.7%
36 1
2.7%
35 1
2.7%
34 1
2.7%
33 1
2.7%
32 1
2.7%
31 1
2.7%
30 1
2.7%
29 1
2.7%
28 1
2.7%

아파트명
Text

UNIQUE 

Distinct37
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size428.0 B
2023-12-11T01:42:22.834289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length11
Mean length7.8648649
Min length4

Characters and Unicode

Total characters291
Distinct characters98
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)100.0%

Sample

1st row성림아파트
2nd row재흥아파트
3rd row형진강변아파트
4th row롯데캐슬
5th row계룡리슈빌
ValueCountFrequency (%)
성림아파트 1
 
2.7%
동일스위트2차 1
 
2.7%
협성르네상스타운 1
 
2.7%
동일스위트3차 1
 
2.7%
이진캐스빌 1
 
2.7%
가화만사성정관타운 1
 
2.7%
이지더원3차 1
 
2.7%
이지더원5차 1
 
2.7%
가화정관더테라스1차 1
 
2.7%
정관이진캐스빌2차 1
 
2.7%
Other values (27) 27
73.0%
2023-12-11T01:42:23.319338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13
 
4.5%
13
 
4.5%
13
 
4.5%
13
 
4.5%
10
 
3.4%
8
 
2.7%
2 7
 
2.4%
7
 
2.4%
7
 
2.4%
6
 
2.1%
Other values (88) 194
66.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 258
88.7%
Decimal Number 19
 
6.5%
Uppercase Letter 10
 
3.4%
Dash Punctuation 2
 
0.7%
Close Punctuation 1
 
0.3%
Open Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13
 
5.0%
13
 
5.0%
13
 
5.0%
13
 
5.0%
10
 
3.9%
8
 
3.1%
7
 
2.7%
7
 
2.7%
6
 
2.3%
6
 
2.3%
Other values (77) 162
62.8%
Decimal Number
ValueCountFrequency (%)
2 7
36.8%
1 6
31.6%
5 3
15.8%
3 2
 
10.5%
4 1
 
5.3%
Uppercase Letter
ValueCountFrequency (%)
L 4
40.0%
B 4
40.0%
H 2
20.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 258
88.7%
Common 23
 
7.9%
Latin 10
 
3.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
13
 
5.0%
13
 
5.0%
13
 
5.0%
13
 
5.0%
10
 
3.9%
8
 
3.1%
7
 
2.7%
7
 
2.7%
6
 
2.3%
6
 
2.3%
Other values (77) 162
62.8%
Common
ValueCountFrequency (%)
2 7
30.4%
1 6
26.1%
5 3
13.0%
3 2
 
8.7%
- 2
 
8.7%
) 1
 
4.3%
( 1
 
4.3%
4 1
 
4.3%
Latin
ValueCountFrequency (%)
L 4
40.0%
B 4
40.0%
H 2
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 258
88.7%
ASCII 33
 
11.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
13
 
5.0%
13
 
5.0%
13
 
5.0%
13
 
5.0%
10
 
3.9%
8
 
3.1%
7
 
2.7%
7
 
2.7%
6
 
2.3%
6
 
2.3%
Other values (77) 162
62.8%
ASCII
ValueCountFrequency (%)
2 7
21.2%
1 6
18.2%
L 4
12.1%
B 4
12.1%
5 3
9.1%
H 2
 
6.1%
3 2
 
6.1%
- 2
 
6.1%
) 1
 
3.0%
( 1
 
3.0%

주소
Text

UNIQUE 

Distinct37
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size428.0 B
2023-12-11T01:42:23.534380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.27027
Min length8

Characters and Unicode

Total characters417
Distinct characters29
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)100.0%

Sample

1st row정관읍 달산리 141-5
2nd row정관읍 달산리 139
3rd row정관읍 매학리 41
4th row정관읍 용수리 1364
5th row정관읍 용수리 1330
ValueCountFrequency (%)
정관읍 35
32.1%
용수리 10
 
9.2%
모전리 10
 
9.2%
달산리 7
 
6.4%
방곡리 6
 
5.5%
정관택지 2
 
1.8%
723 2
 
1.8%
매학리 2
 
1.8%
407 1
 
0.9%
728 1
 
0.9%
Other values (33) 33
30.3%
2023-12-11T01:42:23.994102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
72
17.3%
37
 
8.9%
37
 
8.9%
35
 
8.4%
35
 
8.4%
1 25
 
6.0%
3 20
 
4.8%
7 19
 
4.6%
2 12
 
2.9%
10
 
2.4%
Other values (19) 115
27.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 218
52.3%
Decimal Number 122
29.3%
Space Separator 72
 
17.3%
Dash Punctuation 3
 
0.7%
Uppercase Letter 2
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
37
17.0%
37
17.0%
35
16.1%
35
16.1%
10
 
4.6%
10
 
4.6%
10
 
4.6%
10
 
4.6%
7
 
3.2%
7
 
3.2%
Other values (6) 20
9.2%
Decimal Number
ValueCountFrequency (%)
1 25
20.5%
3 20
16.4%
7 19
15.6%
2 12
9.8%
5 10
 
8.2%
4 9
 
7.4%
8 8
 
6.6%
0 8
 
6.6%
9 6
 
4.9%
6 5
 
4.1%
Space Separator
ValueCountFrequency (%)
72
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 218
52.3%
Common 197
47.2%
Latin 2
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
37
17.0%
37
17.0%
35
16.1%
35
16.1%
10
 
4.6%
10
 
4.6%
10
 
4.6%
10
 
4.6%
7
 
3.2%
7
 
3.2%
Other values (6) 20
9.2%
Common
ValueCountFrequency (%)
72
36.5%
1 25
 
12.7%
3 20
 
10.2%
7 19
 
9.6%
2 12
 
6.1%
5 10
 
5.1%
4 9
 
4.6%
8 8
 
4.1%
0 8
 
4.1%
9 6
 
3.0%
Other values (2) 8
 
4.1%
Latin
ValueCountFrequency (%)
B 2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 218
52.3%
ASCII 199
47.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
72
36.2%
1 25
 
12.6%
3 20
 
10.1%
7 19
 
9.5%
2 12
 
6.0%
5 10
 
5.0%
4 9
 
4.5%
8 8
 
4.0%
0 8
 
4.0%
9 6
 
3.0%
Other values (3) 10
 
5.0%
Hangul
ValueCountFrequency (%)
37
17.0%
37
17.0%
35
16.1%
35
16.1%
10
 
4.6%
10
 
4.6%
10
 
4.6%
10
 
4.6%
7
 
3.2%
7
 
3.2%
Other values (6) 20
9.2%
Distinct35
Distinct (%)94.6%
Missing0
Missing (%)0.0%
Memory size428.0 B
Minimum1989-12-12 00:00:00
Maximum2019-06-18 00:00:00
2023-12-11T01:42:24.177915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:42:24.355367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)

세대수
Real number (ℝ)

UNIQUE 

Distinct37
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean760.54054
Minimum66
Maximum1934
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size465.0 B
2023-12-11T01:42:24.538485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum66
5-th percentile128.4
Q1426
median690
Q31028
95-th percentile1662
Maximum1934
Range1868
Interquartile range (IQR)602

Descriptive statistics

Standard deviation494.09826
Coefficient of variation (CV)0.64966722
Kurtosis-0.29284052
Mean760.54054
Median Absolute Deviation (MAD)300
Skewness0.64657042
Sum28140
Variance244133.09
MonotonicityNot monotonic
2023-12-11T01:42:24.762795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=37)
ValueCountFrequency (%)
70 1
 
2.7%
258 1
 
2.7%
464 1
 
2.7%
1500 1
 
2.7%
539 1
 
2.7%
560 1
 
2.7%
1035 1
 
2.7%
426 1
 
2.7%
431 1
 
2.7%
398 1
 
2.7%
Other values (27) 27
73.0%
ValueCountFrequency (%)
66 1
2.7%
70 1
2.7%
143 1
2.7%
170 1
2.7%
185 1
2.7%
258 1
2.7%
269 1
2.7%
272 1
2.7%
398 1
2.7%
426 1
2.7%
ValueCountFrequency (%)
1934 1
2.7%
1758 1
2.7%
1638 1
2.7%
1533 1
2.7%
1500 1
2.7%
1301 1
2.7%
1249 1
2.7%
1128 1
2.7%
1035 1
2.7%
1028 1
2.7%

비고
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)8.1%
Missing0
Missing (%)0.0%
Memory size428.0 B
입주사용중
30 
시공중
미착공
 
2

Length

Max length5
Median length5
Mean length4.6216216
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row입주사용중
2nd row입주사용중
3rd row입주사용중
4th row입주사용중
5th row입주사용중

Common Values

ValueCountFrequency (%)
입주사용중 30
81.1%
시공중 5
 
13.5%
미착공 2
 
5.4%

Length

2023-12-11T01:42:24.970940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:42:25.142818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
입주사용중 30
81.1%
시공중 5
 
13.5%
미착공 2
 
5.4%

Interactions

2023-12-11T01:42:21.401068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:42:21.225014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:42:21.492215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:42:21.313195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:42:25.244038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번아파트명주소준공일자세대수비고
연번1.0001.0001.0001.0000.7540.771
아파트명1.0001.0001.0001.0001.0001.000
주소1.0001.0001.0001.0001.0001.000
준공일자1.0001.0001.0001.0000.9151.000
세대수0.7541.0001.0000.9151.0000.000
비고0.7711.0001.0001.0000.0001.000
2023-12-11T01:42:25.372945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번세대수비고
연번1.000-0.0640.575
세대수-0.0641.0000.000
비고0.5750.0001.000

Missing values

2023-12-11T01:42:21.982080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:42:22.138680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번아파트명주소준공일자세대수비고
01성림아파트정관읍 달산리 141-51989-12-1270입주사용중
12재흥아파트정관읍 달산리 1391990-05-31170입주사용중
23형진강변아파트정관읍 매학리 411992-09-17143입주사용중
34롯데캐슬정관읍 용수리 13642008-11-27761입주사용중
45계룡리슈빌정관읍 용수리 13302008-12-16455입주사용중
56신동아파밀리에정관읍 용수리 13222008-12-19655입주사용중
67한진해모로정관읍 용수리 14032008-12-24763입주사용중
78정관휴먼시아1단지정관읍 용수리 13592008-12-291533입주사용중
89현진에버빌정관읍 모전리 7552009-01-23690입주사용중
910센트럴파크정관읍 달산리 11992009-12-11588입주사용중
연번아파트명주소준공일자세대수비고
2728가화정관더테라스1차정관읍 방곡리 4072017-04-24431입주사용중
2829정관이진캐스빌2차정관읍 용수리 12792017-07-30258입주사용중
2930가화정관더테라스2차정관읍 방곡리 4052018-09-15398시공중
3031LH국민임대정관읍 모전리 6812017-02-28830입주사용중
3132LH공공임대(행복주택)정관읍 모전리 7172018-08-31990시공중
3233정관읍도시공사공공임대정관읍 용수리 12752016-05-161934시공중
3334정관파스텔라타운하우스정관읍 방곡리 4372017-12-3166시공중
3435정관두산위브더테라스정관읍 달산리 12442018-03-30272시공중
3536부산정관지구B-1BL연립주택정관택지 B-12019-06-18269미착공
3637부산정관지구B-5BL연립주택정관택지 B-52019-06-18185미착공