Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 74 |
Missing cells | 74 |
Missing cells (%) | 14.3% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 4.4 KiB |
Average record size in memory | 60.8 B |
Variable types
Numeric | 3 |
---|---|
Text | 2 |
Categorical | 1 |
DateTime | 1 |
Dataset
Description | 광주광역시 동구 기계설비성능점검대상건축물 현황 데이터 입니다.데이터는 건물명, 주소, 연면적, 세대수 등으로 구성되어있습니다.건물이 공동주택일 경우 세대수로, 공동주택이 아닌경우 연면적 데이터를 제공하고 있습니다. |
---|---|
Author | 광주광역시 동구 |
URL | https://www.data.go.kr/data/15125535/fileData.do |
데이터기준일자 has constant value "" | Constant |
순번 is highly overall correlated with 연면적_제곱미터 and 2 other fields | High correlation |
연면적_제곱미터 is highly overall correlated with 순번 | High correlation |
세대수 is highly overall correlated with 순번 and 1 other fields | High correlation |
비고 is highly overall correlated with 순번 and 1 other fields | High correlation |
연면적_제곱미터 has 19 (25.7%) missing values | Missing |
세대수 has 55 (74.3%) missing values | Missing |
순번 has unique values | Unique |
건물명 has unique values | Unique |
Reproduction
Analysis started | 2023-12-16 16:02:14.232740 |
---|---|
Analysis finished | 2023-12-16 16:02:19.638993 |
Duration | 5.41 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
순번
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 74 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 37.5 |
Minimum | 1 |
---|---|
Maximum | 74 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 798.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 4.65 |
Q1 | 19.25 |
median | 37.5 |
Q3 | 55.75 |
95-th percentile | 70.35 |
Maximum | 74 |
Range | 73 |
Interquartile range (IQR) | 36.5 |
Descriptive statistics
Standard deviation | 21.505813 |
---|---|
Coefficient of variation (CV) | 0.57348835 |
Kurtosis | -1.2 |
Mean | 37.5 |
Median Absolute Deviation (MAD) | 18.5 |
Skewness | 0 |
Sum | 2775 |
Variance | 462.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 1.4% |
57 | 1 | 1.4% |
55 | 1 | 1.4% |
54 | 1 | 1.4% |
53 | 1 | 1.4% |
52 | 1 | 1.4% |
51 | 1 | 1.4% |
50 | 1 | 1.4% |
49 | 1 | 1.4% |
48 | 1 | 1.4% |
Other values (64) | 64 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
74 | 1 | |
73 | 1 | |
72 | 1 | |
71 | 1 | |
70 | 1 | |
69 | 1 | |
68 | 1 | |
67 | 1 | |
66 | 1 | |
65 | 1 |
건물명
Text
UNIQUE
 
Distinct | 74 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 724.0 B |
Value | Count | Frequency (%) |
롯데백화점 | 2 | 2.0% |
무등산 | 2 | 2.0% |
금남로 | 2 | 2.0% |
조선대학교 | 1 | 1.0% |
광주 | 1 | 1.0% |
갤러리존 | 1 | 1.0% |
사옥 | 1 | 1.0% |
광주지역사업부 | 1 | 1.0% |
주)아모레퍼시픽 | 1 | 1.0% |
하나은행 | 1 | 1.0% |
Other values (89) | 89 |
Most occurring characters
Value | Count | Frequency (%) |
28 | 4.9% | |
주 | 19 | 3.3% |
광 | 13 | 2.3% |
아 | 12 | 2.1% |
크 | 11 | 1.9% |
파 | 11 | 1.9% |
남 | 11 | 1.9% |
산 | 11 | 1.9% |
등 | 10 | 1.8% |
교 | 10 | 1.8% |
Other values (179) | 433 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 501 | |
Space Separator | 28 | 4.9% |
Uppercase Letter | 20 | 3.5% |
Decimal Number | 12 | 2.1% |
Open Punctuation | 2 | 0.4% |
Close Punctuation | 2 | 0.4% |
Other Punctuation | 2 | 0.4% |
Dash Punctuation | 1 | 0.2% |
Other Symbol | 1 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
주 | 19 | 3.8% |
광 | 13 | 2.6% |
아 | 12 | 2.4% |
크 | 11 | 2.2% |
파 | 11 | 2.2% |
남 | 11 | 2.2% |
산 | 11 | 2.2% |
등 | 10 | 2.0% |
교 | 10 | 2.0% |
학 | 9 | 1.8% |
Other values (156) | 384 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 5 | |
C | 4 | |
K | 2 | 10.0% |
I | 2 | 10.0% |
T | 1 | 5.0% |
X | 1 | 5.0% |
E | 1 | 5.0% |
P | 1 | 5.0% |
L | 1 | 5.0% |
A | 1 | 5.0% |
Decimal Number
Value | Count | Frequency (%) |
1 | 4 | |
2 | 4 | |
4 | 2 | |
7 | 1 | 8.3% |
5 | 1 | 8.3% |
Other Punctuation
Value | Count | Frequency (%) |
, | 1 | |
· | 1 |
Space Separator
Value | Count | Frequency (%) |
28 |
Open Punctuation
Value | Count | Frequency (%) |
( | 2 |
Close Punctuation
Value | Count | Frequency (%) |
) | 2 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1 |
Other Symbol
Value | Count | Frequency (%) |
㈜ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 502 | |
Common | 47 | 8.3% |
Latin | 20 | 3.5% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
주 | 19 | 3.8% |
광 | 13 | 2.6% |
아 | 12 | 2.4% |
크 | 11 | 2.2% |
파 | 11 | 2.2% |
남 | 11 | 2.2% |
산 | 11 | 2.2% |
등 | 10 | 2.0% |
교 | 10 | 2.0% |
학 | 9 | 1.8% |
Other values (157) | 385 |
Common
Value | Count | Frequency (%) |
28 | ||
1 | 4 | 8.5% |
2 | 4 | 8.5% |
4 | 2 | 4.3% |
( | 2 | 4.3% |
) | 2 | 4.3% |
7 | 1 | 2.1% |
- | 1 | 2.1% |
, | 1 | 2.1% |
5 | 1 | 2.1% |
Latin
Value | Count | Frequency (%) |
S | 5 | |
C | 4 | |
K | 2 | 10.0% |
I | 2 | 10.0% |
T | 1 | 5.0% |
X | 1 | 5.0% |
E | 1 | 5.0% |
P | 1 | 5.0% |
L | 1 | 5.0% |
A | 1 | 5.0% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 501 | |
ASCII | 66 | 11.6% |
None | 2 | 0.4% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
28 | ||
S | 5 | 7.6% |
C | 4 | 6.1% |
1 | 4 | 6.1% |
2 | 4 | 6.1% |
4 | 2 | 3.0% |
K | 2 | 3.0% |
I | 2 | 3.0% |
( | 2 | 3.0% |
) | 2 | 3.0% |
Other values (11) | 11 | 16.7% |
Hangul
Value | Count | Frequency (%) |
주 | 19 | 3.8% |
광 | 13 | 2.6% |
아 | 12 | 2.4% |
크 | 11 | 2.2% |
파 | 11 | 2.2% |
남 | 11 | 2.2% |
산 | 11 | 2.2% |
등 | 10 | 2.0% |
교 | 10 | 2.0% |
학 | 9 | 1.8% |
Other values (156) | 384 |
None
Value | Count | Frequency (%) |
㈜ | 1 | |
· | 1 |
주소
Text
Distinct | 73 |
---|---|
Distinct (%) | 98.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 724.0 B |
Value | Count | Frequency (%) |
계림동 | 10 | 6.8% |
학동 | 5 | 3.4% |
금남로5가 | 5 | 3.4% |
동명동 | 5 | 3.4% |
대인동 | 5 | 3.4% |
용산동 | 5 | 3.4% |
서석동 | 5 | 3.4% |
소태동 | 4 | 2.7% |
수기동 | 3 | 2.0% |
지산동 | 3 | 2.0% |
Other values (88) | 98 |
Most occurring characters
Value | Count | Frequency (%) |
74 | 12.8% | |
동 | 64 | 11.1% |
1 | 54 | 9.4% |
2 | 39 | 6.8% |
- | 34 | 5.9% |
3 | 29 | 5.0% |
5 | 25 | 4.3% |
0 | 21 | 3.6% |
9 | 18 | 3.1% |
남 | 17 | 2.9% |
Other values (31) | 202 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 242 | |
Other Letter | 227 | |
Space Separator | 74 | 12.8% |
Dash Punctuation | 34 | 5.9% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 64 | |
남 | 17 | 7.5% |
로 | 16 | 7.0% |
가 | 14 | 6.2% |
금 | 13 | 5.7% |
산 | 11 | 4.8% |
림 | 11 | 4.8% |
계 | 10 | 4.4% |
석 | 5 | 2.2% |
서 | 5 | 2.2% |
Other values (19) | 61 |
Decimal Number
Value | Count | Frequency (%) |
1 | 54 | |
2 | 39 | |
3 | 29 | |
5 | 25 | |
0 | 21 | 8.7% |
9 | 18 | 7.4% |
6 | 17 | 7.0% |
4 | 15 | 6.2% |
8 | 14 | 5.8% |
7 | 10 | 4.1% |
Space Separator
Value | Count | Frequency (%) |
74 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 34 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 350 | |
Hangul | 227 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 64 | |
남 | 17 | 7.5% |
로 | 16 | 7.0% |
가 | 14 | 6.2% |
금 | 13 | 5.7% |
산 | 11 | 4.8% |
림 | 11 | 4.8% |
계 | 10 | 4.4% |
석 | 5 | 2.2% |
서 | 5 | 2.2% |
Other values (19) | 61 |
Common
Value | Count | Frequency (%) |
74 | ||
1 | 54 | |
2 | 39 | |
- | 34 | |
3 | 29 | 8.3% |
5 | 25 | 7.1% |
0 | 21 | 6.0% |
9 | 18 | 5.1% |
6 | 17 | 4.9% |
4 | 15 | 4.3% |
Other values (2) | 24 | 6.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 350 | |
Hangul | 227 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
74 | ||
1 | 54 | |
2 | 39 | |
- | 34 | |
3 | 29 | 8.3% |
5 | 25 | 7.1% |
0 | 21 | 6.0% |
9 | 18 | 5.1% |
6 | 17 | 4.9% |
4 | 15 | 4.3% |
Other values (2) | 24 | 6.9% |
Hangul
Value | Count | Frequency (%) |
동 | 64 | |
남 | 17 | 7.5% |
로 | 16 | 7.0% |
가 | 14 | 6.2% |
금 | 13 | 5.7% |
산 | 11 | 4.8% |
림 | 11 | 4.8% |
계 | 10 | 4.4% |
석 | 5 | 2.2% |
서 | 5 | 2.2% |
Other values (19) | 61 |
연면적_제곱미터
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 55 |
---|---|
Distinct (%) | 100.0% |
Missing | 19 |
Missing (%) | 25.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 222258.77 |
Minimum | 10200 |
---|---|
Maximum | 10148363 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 798.0 B |
Quantile statistics
Minimum | 10200 |
---|---|
5-th percentile | 10578.72 |
Q1 | 12603.925 |
median | 18756.623 |
Q3 | 30338.198 |
95-th percentile | 118614.71 |
Maximum | 10148363 |
Range | 10138163 |
Interquartile range (IQR) | 17734.273 |
Descriptive statistics
Standard deviation | 1365994.4 |
---|---|
Coefficient of variation (CV) | 6.145964 |
Kurtosis | 54.529383 |
Mean | 222258.77 |
Median Absolute Deviation (MAD) | 7765.5825 |
Skewness | 7.371137 |
Sum | 12224233 |
Variance | 1.8659408 × 1012 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
131868.27 | 1 | 1.4% |
17381.7 | 1 | 1.4% |
17239.605 | 1 | 1.4% |
16879.0 | 1 | 1.4% |
16575.15 | 1 | 1.4% |
16931.9 | 1 | 1.4% |
14957.46 | 1 | 1.4% |
14937.03 | 1 | 1.4% |
13309.01 | 1 | 1.4% |
13251.36 | 1 | 1.4% |
Other values (45) | 45 | |
(Missing) | 19 |
Value | Count | Frequency (%) |
10200.0 | 1 | |
10378.15 | 1 | |
10506.83 | 1 | |
10609.53 | 1 | |
10651.17 | 1 | |
10797.02 | 1 | |
10797.47 | 1 | |
10891.76 | 1 | |
10991.04 | 1 | |
11249.92 | 1 |
Value | Count | Frequency (%) |
10148363.0 | 1 | |
647593.25 | 1 | |
131868.27 | 1 | |
112934.62 | 1 | |
85264.46 | 1 | |
60559.82 | 1 | |
49632.54 | 1 | |
49209.36 | 1 | |
46990.35 | 1 | |
45603.702 | 1 |
세대수
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 19 |
---|---|
Distinct (%) | 100.0% |
Missing | 55 |
Missing (%) | 74.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 915.52632 |
Minimum | 528 |
---|---|
Maximum | 2336 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 798.0 B |
Quantile statistics
Minimum | 528 |
---|---|
5-th percentile | 542.4 |
Q1 | 651 |
median | 784 |
Q3 | 955 |
95-th percentile | 1777.1 |
Maximum | 2336 |
Range | 1808 |
Interquartile range (IQR) | 304 |
Descriptive statistics
Standard deviation | 457.57991 |
---|---|
Coefficient of variation (CV) | 0.49979984 |
Kurtosis | 4.6174275 |
Mean | 915.52632 |
Median Absolute Deviation (MAD) | 168 |
Skewness | 2.0998979 |
Sum | 17395 |
Variance | 209379.37 |
Monotonicity | Strictly decreasing |
Value | Count | Frequency (%) |
1715 | 1 | 1.4% |
528 | 1 | 1.4% |
544 | 1 | 1.4% |
570 | 1 | 1.4% |
580 | 1 | 1.4% |
648 | 1 | 1.4% |
654 | 1 | 1.4% |
658 | 1 | 1.4% |
690 | 1 | 1.4% |
2336 | 1 | 1.4% |
Other values (9) | 9 | 12.2% |
(Missing) | 55 |
Value | Count | Frequency (%) |
528 | 1 | |
544 | 1 | |
570 | 1 | |
580 | 1 | |
648 | 1 | |
654 | 1 | |
658 | 1 | |
690 | 1 | |
772 | 1 | |
784 | 1 |
Value | Count | Frequency (%) |
2336 | 1 | |
1715 | 1 | |
1410 | 1 | |
1074 | 1 | |
958 | 1 | |
952 | 1 | |
908 | 1 | |
820 | 1 | |
794 | 1 | |
784 | 1 |
비고
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 2.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 724.0 B |
<NA> | |
---|---|
공동주택 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 55 | |
공동주택 | 19 | 25.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 55 | |
공동주택 | 19 | 25.7% |
데이터기준일자
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 724.0 B |
Minimum | 2023-12-11 00:00:00 |
---|---|
Maximum | 2023-12-11 00:00:00 |
순번 | 건물명 | 주소 | 연면적_제곱미터 | 세대수 | |
---|---|---|---|---|---|
순번 | 1.000 | 1.000 | 0.929 | 0.208 | 1.000 |
건물명 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
주소 | 0.929 | 1.000 | 1.000 | 1.000 | 1.000 |
연면적_제곱미터 | 0.208 | 1.000 | 1.000 | 1.000 | NaN |
세대수 | 1.000 | 1.000 | 1.000 | NaN | 1.000 |
순번 | 연면적_제곱미터 | 세대수 | 비고 | |
---|---|---|---|---|
순번 | 1.000 | -0.893 | -1.000 | 1.000 |
연면적_제곱미터 | -0.893 | 1.000 | NaN | 0.000 |
세대수 | -1.000 | NaN | 1.000 | 1.000 |
비고 | 1.000 | 0.000 | 1.000 | 1.000 |
순번 | 건물명 | 주소 | 연면적_제곱미터 | 세대수 | 비고 | 데이터기준일자 | |
---|---|---|---|---|---|---|---|
0 | 1 | 조선대학교 | 서석동 375 | 647593.25 | <NA> | <NA> | 2023-12-11 |
1 | 2 | 국립아시아문화전당 | 광산동 113 | 131868.27 | <NA> | <NA> | 2023-12-11 |
2 | 3 | 전남대학교병원 | 학동 8 | 112934.62 | <NA> | <NA> | 2023-12-11 |
3 | 4 | 롯데백화점 | 대인동 7-1 | 85264.46 | <NA> | <NA> | 2023-12-11 |
4 | 5 | 조선대학교병원 | 학동 539 | 60559.82 | <NA> | <NA> | 2023-12-11 |
5 | 6 | 조선이공대학교 | 서석동 290 | 49632.54 | <NA> | <NA> | 2023-12-11 |
6 | 7 | (주)케이티건물 주1동 | 서석동 31-9 | 49209.36 | <NA> | <NA> | 2023-12-11 |
7 | 8 | 광주은행 본점 | 대인동 7-12 | 46990.35 | <NA> | <NA> | 2023-12-11 |
8 | 9 | 유탑유블레스 원시티 | 수기동 68-1 | 45603.702 | <NA> | <NA> | 2023-12-11 |
9 | 10 | 용산차량기지 | 용산동 1-1 | 41158.27 | <NA> | <NA> | 2023-12-11 |
순번 | 건물명 | 주소 | 연면적_제곱미터 | 세대수 | 비고 | 데이터기준일자 | |
---|---|---|---|---|---|---|---|
64 | 65 | 월남호반베르디움 2차 | 월남동 638 | <NA> | 784 | 공동주택 | 2023-12-11 |
65 | 66 | 무등산골드클래스2차 | 용산동 284 | <NA> | 772 | 공동주택 | 2023-12-11 |
66 | 67 | 무등산골드클래스 | 소태동 1103 | <NA> | 690 | 공동주택 | 2023-12-11 |
67 | 68 | 계림두산위브 | 계림동 52 | <NA> | 658 | 공동주택 | 2023-12-11 |
68 | 69 | 월남호반베르디움1차 | 월남동 620 | <NA> | 654 | 공동주택 | 2023-12-11 |
69 | 70 | 푸른길 두산위브 | 계림동 1815 | <NA> | 648 | 공동주택 | 2023-12-11 |
70 | 71 | 모아미래도아파트 | 소태동 501 | <NA> | 580 | 공동주택 | 2023-12-11 |
71 | 72 | 용산지구 모아엘가 에듀파크 | 용산동 663 | <NA> | 570 | 공동주택 | 2023-12-11 |
72 | 73 | 내남지구2차진아리채 | 내남동 904 | <NA> | 544 | 공동주택 | 2023-12-11 |
73 | 74 | 광주용산엘에이치1단지 | 용산동 669 | <NA> | 528 | 공동주택 | 2023-12-11 |