Dataset statistics
Number of variables | 19 |
---|---|
Number of observations | 108 |
Missing cells | 360 |
Missing cells (%) | 17.5% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 16.8 KiB |
Average record size in memory | 159.2 B |
Variable types
Categorical | 3 |
---|---|
Text | 9 |
Numeric | 5 |
DateTime | 2 |
Dataset
Description | 서울특별시 동대문구 건설현장시공정보에 대한 데이터로건축구분,대지위치,허가일자,착공일자 등 등의 항목을 제공합니다. |
---|---|
Author | 서울특별시 동대문구 |
URL | https://www.data.go.kr/data/15004949/fileData.do |
데이터기준일자 has constant value "" | Constant |
연면적(m2)_증축연면적(m2) is highly overall correlated with 최대지상층수 and 4 other fields | High correlation |
최대지상층수 is highly overall correlated with 연면적(m2)_증축연면적(m2) and 3 other fields | High correlation |
최대지하층수 is highly overall correlated with 연면적(m2)_증축연면적(m2) and 3 other fields | High correlation |
세대수 is highly overall correlated with 연면적(m2)_증축연면적(m2) and 4 other fields | High correlation |
호수 is highly overall correlated with 연면적(m2)_증축연면적(m2) and 2 other fields | High correlation |
건축구분 is highly overall correlated with 세대수 | High correlation |
가구수 is highly overall correlated with 연면적(m2)_증축연면적(m2) and 1 other fields | High correlation |
가구수 is highly imbalanced (70.0%) | Imbalance |
착공일자 has 3 (2.8%) missing values | Missing |
부속용도 has 31 (28.7%) missing values | Missing |
세대수 has 67 (62.0%) missing values | Missing |
호수 has 68 (63.0%) missing values | Missing |
시공업체전화번호 has 41 (38.0%) missing values | Missing |
시공업체명 has 5 (4.6%) missing values | Missing |
설계사무소전화번호 has 52 (48.1%) missing values | Missing |
설계사무소명 has 21 (19.4%) missing values | Missing |
감리사무소전화번호 has 50 (46.3%) missing values | Missing |
감리사무소명 has 21 (19.4%) missing values | Missing |
대지위치 has unique values | Unique |
최대지하층수 has 38 (35.2%) zeros | Zeros |
Reproduction
Analysis started | 2024-04-21 00:59:06.515536 |
---|---|
Analysis finished | 2024-04-21 00:59:12.387207 |
Duration | 5.87 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
건축구분
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 4.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 996.0 B |
신축 | |
---|---|
해체신고 | |
해체허가 | 7 |
주택사업승인(신축) | 4 |
증축 | 2 |
Length
Max length | 10 |
---|---|
Median length | 2 |
Mean length | 2.6666667 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 신축 |
---|---|
2nd row | 신축 |
3rd row | 증축 |
4th row | 신축 |
5th row | 신축 |
Common Values
Value | Count | Frequency (%) |
신축 | 82 | |
해체신고 | 13 | 12.0% |
해체허가 | 7 | 6.5% |
주택사업승인(신축) | 4 | 3.7% |
증축 | 2 | 1.9% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
신축 | 82 | |
해체신고 | 13 | 12.0% |
해체허가 | 7 | 6.5% |
주택사업승인(신축 | 4 | 3.7% |
증축 | 2 | 1.9% |
대지위치
Text
UNIQUE
 
Distinct | 108 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 996.0 B |
Length
Max length | 27 |
---|---|
Median length | 26 |
Mean length | 22.203704 |
Min length | 19 |
Characters and Unicode
Total characters | 2398 |
---|---|
Distinct characters | 43 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 108 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 서울특별시 동대문구 답십리동 21-78 |
---|---|
2nd row | 서울특별시 동대문구 답십리동 22-3 외2필지 |
3rd row | 서울특별시 동대문구 답십리동 252-13 |
4th row | 서울특별시 동대문구 답십리동 266-1 |
5th row | 서울특별시 동대문구 답십리동 467-12 외1필지 |
Value | Count | Frequency (%) |
서울특별시 | 108 | |
동대문구 | 108 | |
장안동 | 22 | 4.8% |
외1필지 | 20 | 4.3% |
답십리동 | 19 | 4.1% |
전농동 | 16 | 3.5% |
용두동 | 13 | 2.8% |
휘경동 | 11 | 2.4% |
신설동 | 8 | 1.7% |
외2필지 | 6 | 1.3% |
Other values (115) | 132 |
Most occurring characters
Value | Count | Frequency (%) |
355 | 14.8% | |
동 | 216 | 9.0% |
문 | 114 | 4.8% |
서 | 108 | 4.5% |
특 | 108 | 4.5% |
별 | 108 | 4.5% |
시 | 108 | 4.5% |
대 | 108 | 4.5% |
구 | 108 | 4.5% |
울 | 108 | 4.5% |
Other values (33) | 957 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1413 | |
Decimal Number | 523 | 21.8% |
Space Separator | 355 | 14.8% |
Dash Punctuation | 107 | 4.5% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 216 | |
문 | 114 | 8.1% |
서 | 108 | 7.6% |
특 | 108 | 7.6% |
별 | 108 | 7.6% |
시 | 108 | 7.6% |
대 | 108 | 7.6% |
구 | 108 | 7.6% |
울 | 108 | 7.6% |
외 | 32 | 2.3% |
Other values (21) | 295 |
Decimal Number
Value | Count | Frequency (%) |
1 | 93 | |
2 | 84 | |
3 | 65 | |
4 | 59 | |
6 | 49 | |
9 | 43 | |
8 | 39 | |
5 | 38 | |
7 | 31 | 5.9% |
0 | 22 | 4.2% |
Space Separator
Value | Count | Frequency (%) |
355 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 107 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1413 | |
Common | 985 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 216 | |
문 | 114 | 8.1% |
서 | 108 | 7.6% |
특 | 108 | 7.6% |
별 | 108 | 7.6% |
시 | 108 | 7.6% |
대 | 108 | 7.6% |
구 | 108 | 7.6% |
울 | 108 | 7.6% |
외 | 32 | 2.3% |
Other values (21) | 295 |
Common
Value | Count | Frequency (%) |
355 | ||
- | 107 | 10.9% |
1 | 93 | 9.4% |
2 | 84 | 8.5% |
3 | 65 | 6.6% |
4 | 59 | 6.0% |
6 | 49 | 5.0% |
9 | 43 | 4.4% |
8 | 39 | 4.0% |
5 | 38 | 3.9% |
Other values (2) | 53 | 5.4% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1413 | |
ASCII | 985 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
355 | ||
- | 107 | 10.9% |
1 | 93 | 9.4% |
2 | 84 | 8.5% |
3 | 65 | 6.6% |
4 | 59 | 6.0% |
6 | 49 | 5.0% |
9 | 43 | 4.4% |
8 | 39 | 4.0% |
5 | 38 | 3.9% |
Other values (2) | 53 | 5.4% |
Hangul
Value | Count | Frequency (%) |
동 | 216 | |
문 | 114 | 8.1% |
서 | 108 | 7.6% |
특 | 108 | 7.6% |
별 | 108 | 7.6% |
시 | 108 | 7.6% |
대 | 108 | 7.6% |
구 | 108 | 7.6% |
울 | 108 | 7.6% |
외 | 32 | 2.3% |
Other values (21) | 295 |
연면적(m2)_증축연면적(m2)
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 106 |
---|---|
Distinct (%) | 98.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4977.8035 |
Minimum | 31.07 |
---|---|
Maximum | 64523.52 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.1 KiB |
Quantile statistics
Minimum | 31.07 |
---|---|
5-th percentile | 64.1895 |
Q1 | 361.88 |
median | 674.645 |
Q3 | 3290.3225 |
95-th percentile | 22619.579 |
Maximum | 64523.52 |
Range | 64492.45 |
Interquartile range (IQR) | 2928.4425 |
Descriptive statistics
Standard deviation | 10951.467 |
---|---|
Coefficient of variation (CV) | 2.20006 |
Kurtosis | 15.718345 |
Mean | 4977.8035 |
Median Absolute Deviation (MAD) | 609.89 |
Skewness | 3.7576161 |
Sum | 537602.78 |
Variance | 1.1993462 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2352.37 | 2 | 1.9% |
221.94 | 2 | 1.9% |
517.82 | 1 | 0.9% |
372.0 | 1 | 0.9% |
21985.79 | 1 | 0.9% |
4137.9 | 1 | 0.9% |
487.88 | 1 | 0.9% |
106.0 | 1 | 0.9% |
7620.12 | 1 | 0.9% |
10789.76 | 1 | 0.9% |
Other values (96) | 96 |
Value | Count | Frequency (%) |
31.07 | 1 | |
31.44 | 1 | |
32.5 | 1 | |
44.07 | 1 | |
58.74 | 1 | |
62.87 | 1 | |
66.64 | 1 | |
106.0 | 1 | |
121.36 | 1 | |
131.1 | 1 |
Value | Count | Frequency (%) |
64523.52 | 1 | |
60759.62 | 1 | |
50203.24 | 1 | |
29680.5 | 1 | |
24419.2 | 1 | |
22785.35 | 1 | |
22311.72 | 1 | |
21985.79 | 1 | |
18500.55 | 1 | |
17617.22 | 1 |
허가(신고)일자
Date
Distinct | 91 |
---|---|
Distinct (%) | 85.0% |
Missing | 1 |
Missing (%) | 0.9% |
Memory size | 996.0 B |
Minimum | 2018-09-21 00:00:00 |
---|---|
Maximum | 2023-03-09 00:00:00 |
착공일자
Text
MISSING
 
Distinct | 89 |
---|---|
Distinct (%) | 84.8% |
Missing | 3 |
Missing (%) | 2.8% |
Memory size | 996.0 B |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 9.9238095 |
Min length | 2 |
Characters and Unicode
Total characters | 1042 |
---|---|
Distinct characters | 13 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 75 ? |
---|---|
Unique (%) | 71.4% |
Sample
1st row | 2022-10-25 |
---|---|
2nd row | 2023-02-28 |
3rd row | 2020-11-20 |
4th row | 2022-08-03 |
5th row | 2022-06-24 |
Value | Count | Frequency (%) |
2023-03-09 | 3 | 2.9% |
2023-02-20 | 3 | 2.9% |
2020-05-11 | 2 | 1.9% |
2022-10-25 | 2 | 1.9% |
2022-10-01 | 2 | 1.9% |
2022-08-16 | 2 | 1.9% |
2022-05-31 | 2 | 1.9% |
2022-09-05 | 2 | 1.9% |
2023-02-28 | 2 | 1.9% |
2023-02-13 | 2 | 1.9% |
Other values (79) | 83 |
Most occurring characters
Value | Count | Frequency (%) |
2 | 320 | |
0 | 255 | |
- | 208 | |
1 | 106 | 10.2% |
3 | 49 | 4.7% |
5 | 24 | 2.3% |
6 | 20 | 1.9% |
9 | 16 | 1.5% |
8 | 16 | 1.5% |
4 | 15 | 1.4% |
Other values (3) | 13 | 1.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 832 | |
Dash Punctuation | 208 | 20.0% |
Other Letter | 2 | 0.2% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
2 | 320 | |
0 | 255 | |
1 | 106 | 12.7% |
3 | 49 | 5.9% |
5 | 24 | 2.9% |
6 | 20 | 2.4% |
9 | 16 | 1.9% |
8 | 16 | 1.9% |
4 | 15 | 1.8% |
7 | 11 | 1.3% |
Other Letter
Value | Count | Frequency (%) |
중 | 1 | |
지 | 1 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 208 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1040 | |
Hangul | 2 | 0.2% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
2 | 320 | |
0 | 255 | |
- | 208 | |
1 | 106 | 10.2% |
3 | 49 | 4.7% |
5 | 24 | 2.3% |
6 | 20 | 1.9% |
9 | 16 | 1.5% |
8 | 16 | 1.5% |
4 | 15 | 1.4% |
Hangul
Value | Count | Frequency (%) |
중 | 1 | |
지 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1040 | |
Hangul | 2 | 0.2% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
2 | 320 | |
0 | 255 | |
- | 208 | |
1 | 106 | 10.2% |
3 | 49 | 4.7% |
5 | 24 | 2.3% |
6 | 20 | 1.9% |
9 | 16 | 1.5% |
8 | 16 | 1.5% |
4 | 15 | 1.4% |
Hangul
Value | Count | Frequency (%) |
중 | 1 | |
지 | 1 |
최대지상층수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 21 |
---|---|
Distinct (%) | 19.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 8.462963 |
Minimum | 1 |
---|---|
Maximum | 43 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.1 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 4 |
median | 5.5 |
Q3 | 13.25 |
95-th percentile | 20 |
Maximum | 43 |
Range | 42 |
Interquartile range (IQR) | 9.25 |
Descriptive statistics
Standard deviation | 7.0623735 |
---|---|
Coefficient of variation (CV) | 0.83450366 |
Kurtosis | 4.1013782 |
Mean | 8.462963 |
Median Absolute Deviation (MAD) | 2.5 |
Skewness | 1.6680242 |
Sum | 914 |
Variance | 49.87712 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5 | 21 | |
6 | 12 | |
4 | 11 | |
1 | 8 | 7.4% |
3 | 8 | 7.4% |
20 | 7 | 6.5% |
7 | 6 | 5.6% |
17 | 6 | 5.6% |
2 | 6 | 5.6% |
10 | 3 | 2.8% |
Other values (11) | 20 |
Value | Count | Frequency (%) |
1 | 8 | 7.4% |
2 | 6 | 5.6% |
3 | 8 | 7.4% |
4 | 11 | |
5 | 21 | |
6 | 12 | |
7 | 6 | 5.6% |
8 | 2 | 1.9% |
9 | 1 | 0.9% |
10 | 3 | 2.8% |
Value | Count | Frequency (%) |
43 | 1 | 0.9% |
25 | 1 | 0.9% |
23 | 1 | 0.9% |
20 | 7 | |
19 | 2 | 1.9% |
18 | 3 | |
17 | 6 | |
16 | 1 | 0.9% |
15 | 2 | 1.9% |
14 | 3 |
최대지하층수
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 8 |
---|---|
Distinct (%) | 7.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.3796296 |
Minimum | 0 |
---|---|
Maximum | 7 |
Zeros | 38 |
Zeros (%) | 35.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.1 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 1 |
Q3 | 2 |
95-th percentile | 5 |
Maximum | 7 |
Range | 7 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 1.6784371 |
---|---|
Coefficient of variation (CV) | 1.2165853 |
Kurtosis | 1.9330669 |
Mean | 1.3796296 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 1.6025711 |
Sum | 149 |
Variance | 2.8171513 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 38 | |
0 | 38 | |
2 | 15 | 13.9% |
5 | 5 | 4.6% |
6 | 4 | 3.7% |
4 | 4 | 3.7% |
3 | 3 | 2.8% |
7 | 1 | 0.9% |
Value | Count | Frequency (%) |
0 | 38 | |
1 | 38 | |
2 | 15 | 13.9% |
3 | 3 | 2.8% |
4 | 4 | 3.7% |
5 | 5 | 4.6% |
6 | 4 | 3.7% |
7 | 1 | 0.9% |
Value | Count | Frequency (%) |
7 | 1 | 0.9% |
6 | 4 | 3.7% |
5 | 5 | 4.6% |
4 | 4 | 3.7% |
3 | 3 | 2.8% |
2 | 15 | 13.9% |
1 | 38 | |
0 | 38 |
주용도
Categorical
Distinct | 12 |
---|---|
Distinct (%) | 11.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 996.0 B |
공동주택 | |
---|---|
업무시설 | |
제2종근린생활시설 | |
단독주택 | |
제1종근린생활시설 | |
Other values (7) |
Length
Max length | 16 |
---|---|
Median length | 4 |
Mean length | 5.5 |
Min length | 4 |
Unique
Unique | 6 ? |
---|---|
Unique (%) | 5.6% |
Sample
1st row | 제2종근린생활시설 |
---|---|
2nd row | 공동주택 |
3rd row | 제1종근린생활시설 |
4th row | 제1종근린생활시설 |
5th row | 공동주택 |
Common Values
Value | Count | Frequency (%) |
공동주택 | 33 | |
업무시설 | 23 | |
제2종근린생활시설 | 18 | |
단독주택 | 17 | |
제1종근린생활시설 | 8 | 7.4% |
근린생활시설 | 3 | 2.8% |
공동주택(부속 도시형생활주택) | 1 | 0.9% |
근린생활시설(다가구주택) | 1 | 0.9% |
판매시설 | 1 | 0.9% |
숙박시설 | 1 | 0.9% |
Other values (2) | 2 | 1.9% |
Length
Value | Count | Frequency (%) |
공동주택 | 33 | |
업무시설 | 23 | |
제2종근린생활시설 | 18 | |
단독주택 | 17 | |
제1종근린생활시설 | 8 | 7.3% |
근린생활시설 | 3 | 2.8% |
공동주택(부속 | 1 | 0.9% |
도시형생활주택 | 1 | 0.9% |
근린생활시설(다가구주택 | 1 | 0.9% |
판매시설 | 1 | 0.9% |
Other values (3) | 3 | 2.8% |
부속용도
Text
MISSING
 
Distinct | 57 |
---|---|
Distinct (%) | 74.0% |
Missing | 31 |
Missing (%) | 28.7% |
Memory size | 996.0 B |
Length
Max length | 26 |
---|---|
Median length | 16 |
Mean length | 10.753247 |
Min length | 2 |
Characters and Unicode
Total characters | 828 |
---|---|
Distinct characters | 67 |
Distinct categories | 7 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 47 ? |
---|---|
Unique (%) | 61.0% |
Sample
1st row | 사무소 |
---|---|
2nd row | 도시형생활주택(단지형다세대주택) |
3rd row | 소매점,의원,사무소 |
4th row | 제2종근린생활시설 |
5th row | 도시형생활주택(단지형다세대주택) |
Value | Count | Frequency (%) |
오피스텔 | 14 | 12.7% |
및 | 9 | 8.2% |
근린생활시설 | 7 | 6.4% |
다세대주택 | 7 | 6.4% |
도시형생활주택 | 6 | 5.5% |
생활주택 | 4 | 3.6% |
사무소 | 3 | 2.7% |
도시형 | 3 | 2.7% |
다중주택 | 3 | 2.7% |
다가구주택 | 2 | 1.8% |
Other values (46) | 52 |
Most occurring characters
Value | Count | Frequency (%) |
주 | 57 | 6.9% |
택 | 54 | 6.5% |
시 | 48 | 5.8% |
생 | 44 | 5.3% |
활 | 43 | 5.2% |
33 | 4.0% | |
형 | 31 | 3.7% |
다 | 30 | 3.6% |
설 | 28 | 3.4% |
근 | 24 | 2.9% |
Other values (57) | 436 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 699 | |
Space Separator | 33 | 4.0% |
Decimal Number | 24 | 2.9% |
Other Punctuation | 23 | 2.8% |
Open Punctuation | 22 | 2.7% |
Close Punctuation | 22 | 2.7% |
Dash Punctuation | 5 | 0.6% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
주 | 57 | 8.2% |
택 | 54 | 7.7% |
시 | 48 | 6.9% |
생 | 44 | 6.3% |
활 | 43 | 6.2% |
형 | 31 | 4.4% |
다 | 30 | 4.3% |
설 | 28 | 4.0% |
근 | 24 | 3.4% |
린 | 23 | 3.3% |
Other values (45) | 317 |
Decimal Number
Value | Count | Frequency (%) |
2 | 15 | |
1 | 6 | 25.0% |
0 | 1 | 4.2% |
4 | 1 | 4.2% |
8 | 1 | 4.2% |
Other Punctuation
Value | Count | Frequency (%) |
, | 17 | |
/ | 5 | 21.7% |
. | 1 | 4.3% |
Space Separator
Value | Count | Frequency (%) |
33 |
Open Punctuation
Value | Count | Frequency (%) |
( | 22 |
Close Punctuation
Value | Count | Frequency (%) |
) | 22 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 5 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 699 | |
Common | 129 | 15.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
주 | 57 | 8.2% |
택 | 54 | 7.7% |
시 | 48 | 6.9% |
생 | 44 | 6.3% |
활 | 43 | 6.2% |
형 | 31 | 4.4% |
다 | 30 | 4.3% |
설 | 28 | 4.0% |
근 | 24 | 3.4% |
린 | 23 | 3.3% |
Other values (45) | 317 |
Common
Value | Count | Frequency (%) |
33 | ||
( | 22 | |
) | 22 | |
, | 17 | |
2 | 15 | |
1 | 6 | 4.7% |
/ | 5 | 3.9% |
- | 5 | 3.9% |
0 | 1 | 0.8% |
. | 1 | 0.8% |
Other values (2) | 2 | 1.6% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 699 | |
ASCII | 129 | 15.6% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
주 | 57 | 8.2% |
택 | 54 | 7.7% |
시 | 48 | 6.9% |
생 | 44 | 6.3% |
활 | 43 | 6.2% |
형 | 31 | 4.4% |
다 | 30 | 4.3% |
설 | 28 | 4.0% |
근 | 24 | 3.4% |
린 | 23 | 3.3% |
Other values (45) | 317 |
ASCII
Value | Count | Frequency (%) |
33 | ||
( | 22 | |
) | 22 | |
, | 17 | |
2 | 15 | |
1 | 6 | 4.7% |
/ | 5 | 3.9% |
- | 5 | 3.9% |
0 | 1 | 0.8% |
. | 1 | 0.8% |
Other values (2) | 2 | 1.6% |
세대수
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 27 |
---|---|
Distinct (%) | 65.9% |
Missing | 67 |
Missing (%) | 62.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 48.682927 |
Minimum | 1 |
---|---|
Maximum | 349 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.1 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 8 |
Q1 | 12 |
median | 19 |
Q3 | 40 |
95-th percentile | 284 |
Maximum | 349 |
Range | 348 |
Interquartile range (IQR) | 28 |
Descriptive statistics
Standard deviation | 79.502654 |
---|---|
Coefficient of variation (CV) | 1.6330705 |
Kurtosis | 7.8537962 |
Mean | 48.682927 |
Median Absolute Deviation (MAD) | 9 |
Skewness | 2.9082194 |
Sum | 1996 |
Variance | 6320.672 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
10 | 5 | 4.6% |
12 | 3 | 2.8% |
8 | 3 | 2.8% |
65 | 2 | 1.9% |
38 | 2 | 1.9% |
15 | 2 | 1.9% |
42 | 2 | 1.9% |
16 | 2 | 1.9% |
19 | 2 | 1.9% |
40 | 1 | 0.9% |
Other values (17) | 17 | 15.7% |
(Missing) | 67 |
Value | Count | Frequency (%) |
1 | 1 | 0.9% |
8 | 3 | |
9 | 1 | 0.9% |
10 | 5 | |
12 | 3 | |
14 | 1 | 0.9% |
15 | 2 | 1.9% |
16 | 2 | 1.9% |
17 | 1 | 0.9% |
19 | 2 | 1.9% |
Value | Count | Frequency (%) |
349 | 1 | |
299 | 1 | |
284 | 1 | |
143 | 1 | |
99 | 1 | |
65 | 2 | |
53 | 1 | |
42 | 2 | |
40 | 1 | |
38 | 2 |
호수
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 27 |
---|---|
Distinct (%) | 67.5% |
Missing | 68 |
Missing (%) | 63.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 72.675 |
Minimum | 1 |
---|---|
Maximum | 409 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.1 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3.5 |
median | 42 |
Q3 | 95.75 |
95-th percentile | 300.75 |
Maximum | 409 |
Range | 408 |
Interquartile range (IQR) | 92.25 |
Descriptive statistics
Standard deviation | 97.621033 |
---|---|
Coefficient of variation (CV) | 1.3432547 |
Kurtosis | 3.4616225 |
Mean | 72.675 |
Median Absolute Deviation (MAD) | 40 |
Skewness | 1.9117412 |
Sum | 2907 |
Variance | 9529.866 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 5 | 4.6% |
2 | 5 | 4.6% |
6 | 3 | 2.8% |
4 | 2 | 1.9% |
48 | 2 | 1.9% |
42 | 2 | 1.9% |
180 | 1 | 0.9% |
105 | 1 | 0.9% |
65 | 1 | 0.9% |
98 | 1 | 0.9% |
Other values (17) | 17 | 15.7% |
(Missing) | 68 |
Value | Count | Frequency (%) |
1 | 5 | |
2 | 5 | |
4 | 2 | 1.9% |
6 | 3 | |
9 | 1 | 0.9% |
18 | 1 | 0.9% |
35 | 1 | 0.9% |
40 | 1 | 0.9% |
42 | 2 | 1.9% |
48 | 2 | 1.9% |
Value | Count | Frequency (%) |
409 | 1 | |
315 | 1 | |
300 | 1 | |
240 | 1 | |
180 | 1 | |
171 | 1 | |
144 | 1 | |
142 | 1 | |
105 | 1 | |
98 | 1 |
가구수
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 6 |
---|---|
Distinct (%) | 5.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 996.0 B |
<NA> | |
---|---|
1 | 6 |
2 | 2 |
42 | 2 |
7 | 2 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.6574074 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.9% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 95 | |
1 | 6 | 5.6% |
2 | 2 | 1.9% |
42 | 2 | 1.9% |
7 | 2 | 1.9% |
3 | 1 | 0.9% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 95 | |
1 | 6 | 5.6% |
2 | 2 | 1.9% |
42 | 2 | 1.9% |
7 | 2 | 1.9% |
3 | 1 | 0.9% |
시공업체전화번호
Text
MISSING
 
Distinct | 60 |
---|---|
Distinct (%) | 89.6% |
Missing | 41 |
Missing (%) | 38.0% |
Memory size | 996.0 B |
Length
Max length | 13 |
---|---|
Median length | 12 |
Mean length | 11.686567 |
Min length | 11 |
Characters and Unicode
Total characters | 783 |
---|---|
Distinct characters | 12 |
Distinct categories | 3 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 53 ? |
---|---|
Unique (%) | 79.1% |
Sample
1st row | 031-5177-7300 |
---|---|
2nd row | 02-471-7708 |
3rd row | 032-425-7679 |
4th row | 02-980-8000 |
5th row | 02-355-4458 |
Value | Count | Frequency (%) |
02-2213-0691 | 3 | 4.5% |
02-2134-1956 | 2 | 3.0% |
031-406-4354 | 2 | 3.0% |
02-746-4401 | 2 | 3.0% |
02-831-3680 | 2 | 3.0% |
02-967-3001 | 2 | 3.0% |
02-888-8885 | 2 | 3.0% |
031-942-1665 | 1 | 1.5% |
062-380-2693 | 1 | 1.5% |
070-7436-7931 | 1 | 1.5% |
Other values (49) | 49 |
Most occurring characters
Value | Count | Frequency (%) |
- | 134 | |
0 | 130 | |
2 | 94 | |
1 | 86 | |
4 | 59 | |
3 | 56 | |
6 | 54 | |
5 | 48 | 6.1% |
9 | 41 | 5.2% |
8 | 40 | 5.1% |
Other values (2) | 41 | 5.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 647 | |
Dash Punctuation | 134 | 17.1% |
Space Separator | 2 | 0.3% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 130 | |
2 | 94 | |
1 | 86 | |
4 | 59 | |
3 | 56 | |
6 | 54 | |
5 | 48 | 7.4% |
9 | 41 | 6.3% |
8 | 40 | 6.2% |
7 | 39 | 6.0% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 134 |
Space Separator
Value | Count | Frequency (%) |
2 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 783 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
- | 134 | |
0 | 130 | |
2 | 94 | |
1 | 86 | |
4 | 59 | |
3 | 56 | |
6 | 54 | |
5 | 48 | 6.1% |
9 | 41 | 5.2% |
8 | 40 | 5.1% |
Other values (2) | 41 | 5.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 783 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 134 | |
0 | 130 | |
2 | 94 | |
1 | 86 | |
4 | 59 | |
3 | 56 | |
6 | 54 | |
5 | 48 | 6.1% |
9 | 41 | 5.2% |
8 | 40 | 5.1% |
Other values (2) | 41 | 5.2% |
시공업체명
Text
MISSING
 
Distinct | 81 |
---|---|
Distinct (%) | 78.6% |
Missing | 5 |
Missing (%) | 4.6% |
Memory size | 996.0 B |
Length
Max length | 13 |
---|---|
Median length | 11 |
Mean length | 8.5728155 |
Min length | 5 |
Characters and Unicode
Total characters | 883 |
---|---|
Distinct characters | 135 |
Distinct categories | 5 ? |
Distinct scripts | 2 ? |
Distinct blocks | 3 ? |
Unique
Unique | 65 ? |
---|---|
Unique (%) | 63.1% |
Sample
1st row | 에이치와이종합건설(주) |
---|---|
2nd row | 주식회사큰대종합건설 |
3rd row | (주)세호종합건설 |
4th row | (주)담을건설 |
5th row | (주)큰대종합건설 |
Value | Count | Frequency (%) |
주식회사 | 8 | 7.1% |
주)이도인건설 | 5 | 4.5% |
미래종합중기 | 3 | 2.7% |
현대건설(주 | 3 | 2.7% |
민산건설중기 | 3 | 2.7% |
주식회사큰대종합건설 | 2 | 1.8% |
주)큰대종합건설 | 2 | 1.8% |
신림종합건설(주 | 2 | 1.8% |
주)건희건설 | 2 | 1.8% |
주)블루버드건설 | 2 | 1.8% |
Other values (73) | 80 |
Most occurring characters
Value | Count | Frequency (%) |
주 | 94 | 10.6% |
( | 79 | 8.9% |
) | 79 | 8.9% |
건 | 78 | 8.8% |
설 | 73 | 8.3% |
종 | 34 | 3.9% |
합 | 34 | 3.9% |
이 | 20 | 2.3% |
사 | 15 | 1.7% |
회 | 15 | 1.7% |
Other values (125) | 362 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 714 | |
Open Punctuation | 79 | 8.9% |
Close Punctuation | 79 | 8.9% |
Space Separator | 9 | 1.0% |
Other Symbol | 2 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
주 | 94 | 13.2% |
건 | 78 | 10.9% |
설 | 73 | 10.2% |
종 | 34 | 4.8% |
합 | 34 | 4.8% |
이 | 20 | 2.8% |
사 | 15 | 2.1% |
회 | 15 | 2.1% |
식 | 15 | 2.1% |
에 | 13 | 1.8% |
Other values (121) | 323 |
Open Punctuation
Value | Count | Frequency (%) |
( | 79 |
Close Punctuation
Value | Count | Frequency (%) |
) | 79 |
Space Separator
Value | Count | Frequency (%) |
9 |
Other Symbol
Value | Count | Frequency (%) |
㈜ | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 716 | |
Common | 167 | 18.9% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
주 | 94 | 13.1% |
건 | 78 | 10.9% |
설 | 73 | 10.2% |
종 | 34 | 4.7% |
합 | 34 | 4.7% |
이 | 20 | 2.8% |
사 | 15 | 2.1% |
회 | 15 | 2.1% |
식 | 15 | 2.1% |
에 | 13 | 1.8% |
Other values (122) | 325 |
Common
Value | Count | Frequency (%) |
( | 79 | |
) | 79 | |
9 | 5.4% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 714 | |
ASCII | 167 | 18.9% |
None | 2 | 0.2% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
주 | 94 | 13.2% |
건 | 78 | 10.9% |
설 | 73 | 10.2% |
종 | 34 | 4.8% |
합 | 34 | 4.8% |
이 | 20 | 2.8% |
사 | 15 | 2.1% |
회 | 15 | 2.1% |
식 | 15 | 2.1% |
에 | 13 | 1.8% |
Other values (121) | 323 |
ASCII
Value | Count | Frequency (%) |
( | 79 | |
) | 79 | |
9 | 5.4% |
None
Value | Count | Frequency (%) |
㈜ | 2 |
설계사무소전화번호
Text
MISSING
 
Distinct | 45 |
---|---|
Distinct (%) | 80.4% |
Missing | 52 |
Missing (%) | 48.1% |
Memory size | 996.0 B |
Length
Max length | 13 |
---|---|
Median length | 12 |
Mean length | 11.642857 |
Min length | 11 |
Characters and Unicode
Total characters | 652 |
---|---|
Distinct characters | 12 |
Distinct categories | 3 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 36 ? |
---|---|
Unique (%) | 64.3% |
Sample
1st row | 02-3481-5222 |
---|---|
2nd row | 031-284-3361 |
3rd row | 053-813-7920 |
4th row | 02-3143-7716 |
5th row | 02-971-7087 |
Value | Count | Frequency (%) |
02-953-2225 | 3 | 5.4% |
02-3443-4050 | 3 | 5.4% |
02-953-2226 | 2 | 3.6% |
02-928-4484 | 2 | 3.6% |
031-284-3361 | 2 | 3.6% |
02-3295-4600 | 2 | 3.6% |
02-2273-5842 | 2 | 3.6% |
02-6959-4783 | 2 | 3.6% |
02-3461-2595 | 2 | 3.6% |
02-3494-3323 | 2 | 3.6% |
Other values (34) | 34 |
Most occurring characters
Value | Count | Frequency (%) |
- | 112 | |
2 | 103 | |
0 | 90 | |
3 | 73 | |
4 | 59 | |
5 | 48 | |
6 | 39 | 6.0% |
1 | 38 | 5.8% |
7 | 32 | 4.9% |
9 | 31 | 4.8% |
Other values (2) | 27 | 4.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 538 | |
Dash Punctuation | 112 | 17.2% |
Space Separator | 2 | 0.3% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
2 | 103 | |
0 | 90 | |
3 | 73 | |
4 | 59 | |
5 | 48 | |
6 | 39 | 7.2% |
1 | 38 | 7.1% |
7 | 32 | 5.9% |
9 | 31 | 5.8% |
8 | 25 | 4.6% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 112 |
Space Separator
Value | Count | Frequency (%) |
2 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 652 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
- | 112 | |
2 | 103 | |
0 | 90 | |
3 | 73 | |
4 | 59 | |
5 | 48 | |
6 | 39 | 6.0% |
1 | 38 | 5.8% |
7 | 32 | 4.9% |
9 | 31 | 4.8% |
Other values (2) | 27 | 4.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 652 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 112 | |
2 | 103 | |
0 | 90 | |
3 | 73 | |
4 | 59 | |
5 | 48 | |
6 | 39 | 6.0% |
1 | 38 | 5.8% |
7 | 32 | 4.9% |
9 | 31 | 4.8% |
Other values (2) | 27 | 4.1% |
설계사무소명
Text
MISSING
 
Distinct | 72 |
---|---|
Distinct (%) | 82.8% |
Missing | 21 |
Missing (%) | 19.4% |
Memory size | 996.0 B |
Length
Max length | 18 |
---|---|
Median length | 16 |
Mean length | 11.436782 |
Min length | 8 |
Characters and Unicode
Total characters | 995 |
---|---|
Distinct characters | 121 |
Distinct categories | 7 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 60 ? |
---|---|
Unique (%) | 69.0% |
Sample
1st row | 건축사사무소 루연 |
---|---|
2nd row | 아름다운건축사사무소 |
3rd row | 미성건축사사무소 |
4th row | 핍스알엔디 건축사사무소 |
5th row | (주)영화건축사사무소 |
Value | Count | Frequency (%) |
건축사사무소 | 18 | 15.1% |
주식회사 | 10 | 8.4% |
주)기하건축사사무소 | 4 | 3.4% |
주)국전건축사사무소 | 3 | 2.5% |
주)건축사사무소 | 2 | 1.7% |
상진엔지니어링건축사사무소 | 2 | 1.7% |
주)희성건축사사무소 | 2 | 1.7% |
주)기안건축사사무소 | 2 | 1.7% |
수플러스 | 2 | 1.7% |
건축사사무소한다스 | 2 | 1.7% |
Other values (65) | 72 |
Most occurring characters
Value | Count | Frequency (%) |
사 | 185 | |
소 | 90 | 9.0% |
축 | 89 | 8.9% |
건 | 89 | 8.9% |
무 | 87 | 8.7% |
주 | 50 | 5.0% |
( | 39 | 3.9% |
) | 39 | 3.9% |
34 | 3.4% | |
이 | 13 | 1.3% |
Other values (111) | 280 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 877 | |
Open Punctuation | 39 | 3.9% |
Close Punctuation | 39 | 3.9% |
Space Separator | 34 | 3.4% |
Uppercase Letter | 4 | 0.4% |
Other Punctuation | 1 | 0.1% |
Lowercase Letter | 1 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
사 | 185 | |
소 | 90 | 10.3% |
축 | 89 | 10.1% |
건 | 89 | 10.1% |
무 | 87 | 9.9% |
주 | 50 | 5.7% |
이 | 13 | 1.5% |
회 | 11 | 1.3% |
식 | 11 | 1.3% |
종 | 10 | 1.1% |
Other values (102) | 242 |
Uppercase Letter
Value | Count | Frequency (%) |
A | 1 | |
G | 1 | |
C | 1 | |
J | 1 |
Open Punctuation
Value | Count | Frequency (%) |
( | 39 |
Close Punctuation
Value | Count | Frequency (%) |
) | 39 |
Space Separator
Value | Count | Frequency (%) |
34 |
Other Punctuation
Value | Count | Frequency (%) |
& | 1 |
Lowercase Letter
Value | Count | Frequency (%) |
m | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 877 | |
Common | 113 | 11.4% |
Latin | 5 | 0.5% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
사 | 185 | |
소 | 90 | 10.3% |
축 | 89 | 10.1% |
건 | 89 | 10.1% |
무 | 87 | 9.9% |
주 | 50 | 5.7% |
이 | 13 | 1.5% |
회 | 11 | 1.3% |
식 | 11 | 1.3% |
종 | 10 | 1.1% |
Other values (102) | 242 |
Latin
Value | Count | Frequency (%) |
A | 1 | |
G | 1 | |
C | 1 | |
m | 1 | |
J | 1 |
Common
Value | Count | Frequency (%) |
( | 39 | |
) | 39 | |
34 | ||
& | 1 | 0.9% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 877 | |
ASCII | 118 | 11.9% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
사 | 185 | |
소 | 90 | 10.3% |
축 | 89 | 10.1% |
건 | 89 | 10.1% |
무 | 87 | 9.9% |
주 | 50 | 5.7% |
이 | 13 | 1.5% |
회 | 11 | 1.3% |
식 | 11 | 1.3% |
종 | 10 | 1.1% |
Other values (102) | 242 |
ASCII
Value | Count | Frequency (%) |
( | 39 | |
) | 39 | |
34 | ||
A | 1 | 0.8% |
& | 1 | 0.8% |
G | 1 | 0.8% |
C | 1 | 0.8% |
m | 1 | 0.8% |
J | 1 | 0.8% |
감리사무소전화번호
Text
MISSING
 
Distinct | 51 |
---|---|
Distinct (%) | 87.9% |
Missing | 50 |
Missing (%) | 46.3% |
Memory size | 996.0 B |
Length
Max length | 13 |
---|---|
Median length | 11 |
Mean length | 11.465517 |
Min length | 11 |
Characters and Unicode
Total characters | 665 |
---|---|
Distinct characters | 12 |
Distinct categories | 3 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 46 ? |
---|---|
Unique (%) | 79.3% |
Sample
1st row | 02-3481-5222 |
---|---|
2nd row | 02-945-9564 |
3rd row | 02-3436-3404 |
4th row | 02-953-2226 |
5th row | 02-549-6693 |
Value | Count | Frequency (%) |
02-3443-4050 | 3 | 5.2% |
02-953-2225 | 3 | 5.2% |
02-953-2226 | 2 | 3.4% |
02-6959-4783 | 2 | 3.4% |
02-458-4181 | 2 | 3.4% |
02-2273-5842 | 1 | 1.7% |
02-3481-5222 | 1 | 1.7% |
02-964-9777 | 1 | 1.7% |
070-5214-0030 | 1 | 1.7% |
02-582-0369 | 1 | 1.7% |
Other values (41) | 41 |
Most occurring characters
Value | Count | Frequency (%) |
- | 116 | |
2 | 109 | |
0 | 103 | |
4 | 57 | |
3 | 55 | |
5 | 55 | |
9 | 41 | 6.2% |
1 | 36 | 5.4% |
6 | 34 | 5.1% |
7 | 33 | 5.0% |
Other values (2) | 26 | 3.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 548 | |
Dash Punctuation | 116 | 17.4% |
Space Separator | 1 | 0.2% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
2 | 109 | |
0 | 103 | |
4 | 57 | |
3 | 55 | |
5 | 55 | |
9 | 41 | 7.5% |
1 | 36 | 6.6% |
6 | 34 | 6.2% |
7 | 33 | 6.0% |
8 | 25 | 4.6% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 116 |
Space Separator
Value | Count | Frequency (%) |
1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 665 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
- | 116 | |
2 | 109 | |
0 | 103 | |
4 | 57 | |
3 | 55 | |
5 | 55 | |
9 | 41 | 6.2% |
1 | 36 | 5.4% |
6 | 34 | 5.1% |
7 | 33 | 5.0% |
Other values (2) | 26 | 3.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 665 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 116 | |
2 | 109 | |
0 | 103 | |
4 | 57 | |
3 | 55 | |
5 | 55 | |
9 | 41 | 6.2% |
1 | 36 | 5.4% |
6 | 34 | 5.1% |
7 | 33 | 5.0% |
Other values (2) | 26 | 3.9% |
감리사무소명
Text
MISSING
 
Distinct | 75 |
---|---|
Distinct (%) | 86.2% |
Missing | 21 |
Missing (%) | 19.4% |
Memory size | 996.0 B |
Length
Max length | 19 |
---|---|
Median length | 16 |
Mean length | 11.45977 |
Min length | 8 |
Characters and Unicode
Total characters | 997 |
---|---|
Distinct characters | 122 |
Distinct categories | 6 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 67 ? |
---|---|
Unique (%) | 77.0% |
Sample
1st row | 건축사사무소 루연 |
---|---|
2nd row | 예마루종합건축사사무소 |
3rd row | 열린건축사사무소 |
4th row | 정원건축사사무소 |
5th row | (주)영화건축사사무소 |
Value | Count | Frequency (%) |
건축사사무소 | 22 | 17.7% |
주 | 5 | 4.0% |
주식회사 | 5 | 4.0% |
상진엔지니어링건축사사무소 | 5 | 4.0% |
주)기하건축사사무소 | 4 | 3.2% |
종합건축사사무소 | 3 | 2.4% |
주)국전건축사사무소 | 3 | 2.4% |
주)영화건축사사무소 | 2 | 1.6% |
문 | 2 | 1.6% |
아성 | 2 | 1.6% |
Other values (70) | 71 |
Most occurring characters
Value | Count | Frequency (%) |
사 | 178 | |
건 | 89 | 8.9% |
축 | 88 | 8.8% |
소 | 87 | 8.7% |
무 | 86 | 8.6% |
주 | 48 | 4.8% |
) | 42 | 4.2% |
( | 41 | 4.1% |
38 | 3.8% | |
종 | 12 | 1.2% |
Other values (112) | 288 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 873 | |
Close Punctuation | 42 | 4.2% |
Open Punctuation | 41 | 4.1% |
Space Separator | 38 | 3.8% |
Uppercase Letter | 2 | 0.2% |
Lowercase Letter | 1 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
사 | 178 | |
건 | 89 | 10.2% |
축 | 88 | 10.1% |
소 | 87 | 10.0% |
무 | 86 | 9.9% |
주 | 48 | 5.5% |
종 | 12 | 1.4% |
합 | 12 | 1.4% |
이 | 11 | 1.3% |
니 | 8 | 0.9% |
Other values (106) | 254 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 1 | |
J | 1 |
Close Punctuation
Value | Count | Frequency (%) |
) | 42 |
Open Punctuation
Value | Count | Frequency (%) |
( | 41 |
Space Separator
Value | Count | Frequency (%) |
38 |
Lowercase Letter
Value | Count | Frequency (%) |
m | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 873 | |
Common | 121 | 12.1% |
Latin | 3 | 0.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
사 | 178 | |
건 | 89 | 10.2% |
축 | 88 | 10.1% |
소 | 87 | 10.0% |
무 | 86 | 9.9% |
주 | 48 | 5.5% |
종 | 12 | 1.4% |
합 | 12 | 1.4% |
이 | 11 | 1.3% |
니 | 8 | 0.9% |
Other values (106) | 254 |
Common
Value | Count | Frequency (%) |
) | 42 | |
( | 41 | |
38 |
Latin
Value | Count | Frequency (%) |
C | 1 | |
m | 1 | |
J | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 873 | |
ASCII | 124 | 12.4% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
사 | 178 | |
건 | 89 | 10.2% |
축 | 88 | 10.1% |
소 | 87 | 10.0% |
무 | 86 | 9.9% |
주 | 48 | 5.5% |
종 | 12 | 1.4% |
합 | 12 | 1.4% |
이 | 11 | 1.3% |
니 | 8 | 0.9% |
Other values (106) | 254 |
ASCII
Value | Count | Frequency (%) |
) | 42 | |
( | 41 | |
38 | ||
C | 1 | 0.8% |
m | 1 | 0.8% |
J | 1 | 0.8% |
데이터기준일자
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 996.0 B |
Minimum | 2024-03-25 00:00:00 |
---|---|
Maximum | 2024-03-25 00:00:00 |
건축구분 | 연면적(m2)_증축연면적(m2) | 허가(신고)일자 | 착공일자 | 최대지상층수 | 최대지하층수 | 주용도 | 부속용도 | 세대수 | 호수 | 가구수 | 시공업체전화번호 | 시공업체명 | 설계사무소전화번호 | 설계사무소명 | 감리사무소전화번호 | 감리사무소명 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
건축구분 | 1.000 | 0.000 | 0.960 | 0.982 | 0.226 | 0.127 | 0.550 | 1.000 | 0.803 | 0.000 | 0.383 | 1.000 | 0.974 | NaN | 1.000 | 1.000 | 1.000 |
연면적(m2)_증축연면적(m2) | 0.000 | 1.000 | 1.000 | 0.000 | 0.818 | 0.731 | 0.000 | 0.000 | 0.919 | 0.841 | NaN | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
허가(신고)일자 | 0.960 | 1.000 | 1.000 | 0.990 | 0.996 | 0.993 | 0.984 | 0.989 | 0.987 | 1.000 | 1.000 | 0.969 | 0.991 | 0.909 | 0.976 | 0.921 | 0.986 |
착공일자 | 0.982 | 0.000 | 0.990 | 1.000 | 0.344 | 0.916 | 0.952 | 0.971 | 1.000 | 0.000 | 1.000 | 0.997 | 0.995 | 0.992 | 0.990 | 0.930 | 0.905 |
최대지상층수 | 0.226 | 0.818 | 0.996 | 0.344 | 1.000 | 0.649 | 0.534 | 0.586 | 0.833 | 0.608 | 0.369 | 0.000 | 0.000 | 0.000 | 0.000 | 0.935 | 0.926 |
최대지하층수 | 0.127 | 0.731 | 0.993 | 0.916 | 0.649 | 1.000 | 0.419 | 0.883 | 0.937 | 0.729 | 0.234 | 0.000 | 0.793 | 0.000 | 0.000 | 0.000 | 0.000 |
주용도 | 0.550 | 0.000 | 0.984 | 0.952 | 0.534 | 0.419 | 1.000 | 0.950 | 0.000 | 0.000 | 0.442 | 0.988 | 0.826 | 0.878 | 0.938 | 0.766 | 0.923 |
부속용도 | 1.000 | 0.000 | 0.989 | 0.971 | 0.586 | 0.883 | 0.950 | 1.000 | 0.992 | 0.000 | 1.000 | 0.970 | 0.960 | 0.962 | 0.967 | 0.930 | 0.981 |
세대수 | 0.803 | 0.919 | 0.987 | 1.000 | 0.833 | 0.937 | 0.000 | 0.992 | 1.000 | 0.678 | NaN | 1.000 | 0.991 | 0.816 | 0.958 | 0.897 | 0.673 |
호수 | 0.000 | 0.841 | 1.000 | 0.000 | 0.608 | 0.729 | 0.000 | 0.000 | 0.678 | 1.000 | 0.000 | 0.714 | 0.931 | 0.746 | 0.790 | 1.000 | 1.000 |
가구수 | 0.383 | NaN | 1.000 | 1.000 | 0.369 | 0.234 | 0.442 | 1.000 | NaN | 0.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
시공업체전화번호 | 1.000 | 0.000 | 0.969 | 0.997 | 0.000 | 0.000 | 0.988 | 0.970 | 1.000 | 0.714 | 1.000 | 1.000 | 1.000 | 1.000 | 0.998 | 0.962 | 0.943 |
시공업체명 | 0.974 | 0.000 | 0.991 | 0.995 | 0.000 | 0.793 | 0.826 | 0.960 | 0.991 | 0.931 | 1.000 | 1.000 | 1.000 | 0.996 | 0.997 | 0.982 | 0.991 |
설계사무소전화번호 | NaN | 0.000 | 0.909 | 0.992 | 0.000 | 0.000 | 0.878 | 0.962 | 0.816 | 0.746 | 1.000 | 1.000 | 0.996 | 1.000 | 0.999 | 0.994 | 0.998 |
설계사무소명 | 1.000 | 0.000 | 0.976 | 0.990 | 0.000 | 0.000 | 0.938 | 0.967 | 0.958 | 0.790 | 1.000 | 0.998 | 0.997 | 0.999 | 1.000 | 0.996 | 0.998 |
감리사무소전화번호 | 1.000 | 0.000 | 0.921 | 0.930 | 0.935 | 0.000 | 0.766 | 0.930 | 0.897 | 1.000 | 1.000 | 0.962 | 0.982 | 0.994 | 0.996 | 1.000 | 1.000 |
감리사무소명 | 1.000 | 0.000 | 0.986 | 0.905 | 0.926 | 0.000 | 0.923 | 0.981 | 0.673 | 1.000 | 1.000 | 0.943 | 0.991 | 0.998 | 0.998 | 1.000 | 1.000 |
가구수 | 주용도 | 건축구분 | |
---|---|---|---|
가구수 | 1.000 | 0.309 | 0.369 |
주용도 | 0.309 | 1.000 | 0.331 |
건축구분 | 0.369 | 0.331 | 1.000 |
연면적(m2)_증축연면적(m2) | 최대지상층수 | 최대지하층수 | 세대수 | 호수 | 건축구분 | 주용도 | 가구수 | |
---|---|---|---|---|---|---|---|---|
연면적(m2)_증축연면적(m2) | 1.000 | 0.878 | 0.764 | 0.844 | 0.768 | 0.000 | 0.000 | 1.000 |
최대지상층수 | 0.878 | 1.000 | 0.688 | 0.645 | 0.721 | 0.143 | 0.286 | 0.211 |
최대지하층수 | 0.764 | 0.688 | 1.000 | 0.592 | 0.701 | 0.072 | 0.185 | 0.000 |
세대수 | 0.844 | 0.645 | 0.592 | 1.000 | 0.460 | 0.574 | 0.000 | 1.000 |
호수 | 0.768 | 0.721 | 0.701 | 0.460 | 1.000 | 0.000 | 0.000 | 0.000 |
건축구분 | 0.000 | 0.143 | 0.072 | 0.574 | 0.000 | 1.000 | 0.331 | 0.369 |
주용도 | 0.000 | 0.286 | 0.185 | 0.000 | 0.000 | 0.331 | 1.000 | 0.309 |
가구수 | 1.000 | 0.211 | 0.000 | 1.000 | 0.000 | 0.369 | 0.309 | 1.000 |
건축구분 | 대지위치 | 연면적(m2)_증축연면적(m2) | 허가(신고)일자 | 착공일자 | 최대지상층수 | 최대지하층수 | 주용도 | 부속용도 | 세대수 | 호수 | 가구수 | 시공업체전화번호 | 시공업체명 | 설계사무소전화번호 | 설계사무소명 | 감리사무소전화번호 | 감리사무소명 | 데이터기준일자 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 신축 | 서울특별시 동대문구 답십리동 21-78 | 517.82 | 2022-06-15 | 2022-10-25 | 4 | 1 | 제2종근린생활시설 | 사무소 | <NA> | <NA> | <NA> | 031-5177-7300 | 에이치와이종합건설(주) | 02-3481-5222 | 건축사사무소 루연 | 02-3481-5222 | 건축사사무소 루연 | 2024-03-25 |
1 | 신축 | 서울특별시 동대문구 답십리동 22-3 외2필지 | 625.4 | 2022-10-18 | 2023-02-28 | 6 | 1 | 공동주택 | 도시형생활주택(단지형다세대주택) | 16 | 2 | <NA> | <NA> | 주식회사큰대종합건설 | 031-284-3361 | 아름다운건축사사무소 | 02-945-9564 | 예마루종합건축사사무소 | 2024-03-25 |
2 | 증축 | 서울특별시 동대문구 답십리동 252-13 | 1189.79 | 2020-11-09 | 2020-11-20 | 5 | 1 | 제1종근린생활시설 | 소매점,의원,사무소 | <NA> | <NA> | <NA> | <NA> | (주)세호종합건설 | <NA> | 미성건축사사무소 | 02-3436-3404 | 열린건축사사무소 | 2024-03-25 |
3 | 신축 | 서울특별시 동대문구 답십리동 266-1 | 3147.96 | 2022-04-14 | 2022-08-03 | 9 | 2 | 제1종근린생활시설 | 제2종근린생활시설 | <NA> | <NA> | <NA> | 02-471-7708 | (주)담을건설 | <NA> | 핍스알엔디 건축사사무소 | <NA> | 정원건축사사무소 | 2024-03-25 |
4 | 신축 | 서울특별시 동대문구 답십리동 467-12 외1필지 | 420.87 | 2022-04-08 | 2022-06-24 | 5 | 0 | 공동주택 | 도시형생활주택(단지형다세대주택) | 12 | <NA> | <NA> | 032-425-7679 | (주)큰대종합건설 | <NA> | (주)영화건축사사무소 | <NA> | (주)영화건축사사무소 | 2024-03-25 |
5 | 신축 | 서울특별시 동대문구 답십리동 482-5 | 628.83 | 2022-06-30 | 2022-09-23 | 6 | 1 | 공동주택 | 도시형생활주택(단지형다세대주택,근린생활시설) | 16 | 2 | <NA> | <NA> | (주)큰대종합건설 | <NA> | (주)영화건축사사무소 | <NA> | (주)영화건축사사무소 | 2024-03-25 |
6 | 증축 | 서울특별시 동대문구 답십리동 483-8 | 209.43 | 2021-09-29 | 2021-11-02 | 3 | 1 | 제2종근린생활시설 | 일반음식점.사무소 | <NA> | <NA> | <NA> | <NA> | 몰드재팬주식회사 | <NA> | 천지인종합건축사사무소(주) | <NA> | 천지인종합건축사사무소(주) | 2024-03-25 |
7 | 신축 | 서울특별시 동대문구 답십리동 487-2 | 2108.69 | 2020-12-24 | 2022-01-03 | 13 | 1 | 업무시설 | 오피스텔 | 10 | 70 | <NA> | 02-980-8000 | (주)반석 | 053-813-7920 | 건축사사무소 한영 | 02-953-2226 | (주) 기하건축사사무소 | 2024-03-25 |
8 | 신축 | 서울특별시 동대문구 답십리동 487-31 | 121.36 | 2022-04-12 | 2022-06-16 | 3 | 0 | 제2종근린생활시설 | 일반음식점 | <NA> | <NA> | 1 | 02-355-4458 | (주)시티종합건설 | 02-3143-7716 | (주)건축사사무소 모도건축 | <NA> | 담우건축사사무소 | 2024-03-25 |
9 | 신축 | 서울특별시 동대문구 답십리동 493-1 외2필지 | 18500.55 | 2021-06-14 | 2021-11-15 | 20 | 6 | 업무시설 | 오피스텔 | <NA> | 144 | <NA> | <NA> | 신영건설(주) | <NA> | 건축사사무소아라그룹 | 02-549-6693 | (주)건축사사무소아라그룹 | 2024-03-25 |
건축구분 | 대지위치 | 연면적(m2)_증축연면적(m2) | 허가(신고)일자 | 착공일자 | 최대지상층수 | 최대지하층수 | 주용도 | 부속용도 | 세대수 | 호수 | 가구수 | 시공업체전화번호 | 시공업체명 | 설계사무소전화번호 | 설계사무소명 | 감리사무소전화번호 | 감리사무소명 | 데이터기준일자 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
98 | 신축 | 서울특별시 동대문구 휘경동 187-5 외1필지 | 2940.21 | 2021-02-03 | 2021-08-10 | 14 | 1 | 업무시설 | 오피스텔 | <NA> | 98 | <NA> | 031-406-4354 | 신림종합건설(주) | <NA> | (주)동심원건축사사무소 | 02-3452-0597 | (주)동심원건축사사무소 | 2024-03-25 |
99 | 주택사업승인(신축) | 서울특별시 동대문구 휘경동 244-1 | 16659.87 | 2020-12-30 | 2022-07-01 | 19 | 3 | 공동주택 | 청년주택 | 349 | <NA> | <NA> | <NA> | (주)우방 | <NA> | (주)A&G건축사사무소 | <NA> | <NA> | 2024-03-25 |
100 | 신축 | 서울특별시 동대문구 휘경동 267-86 | 4621.78 | 2022-05-11 | 2022-08-16 | 17 | 1 | 업무시설 | 오피스텔 | 26 | 2 | <NA> | <NA> | <NA> | 02-6959-4783 | 건축사사무소 아성 | 02-6959-4783 | 건축사사무소 아성 | 2024-03-25 |
101 | 해체신고 | 서울특별시 동대문구 용두동 184-2 | 32.5 | 2023-03-09 | 2023-03-09 | 1 | 0 | 제2종근린생활시설 | <NA> | <NA> | <NA> | <NA> | <NA> | 미래종합중기 | <NA> | <NA> | <NA> | <NA> | 2024-03-25 |
102 | 해체허가 | 서울특별시 동대문구 전농동 213-28 | 258.3 | 2023-03-09 | 2023-03-22 | 4 | 0 | 제2종근린생활시설 | <NA> | <NA> | <NA> | <NA> | <NA> | 정인건설산업 주식회사 | <NA> | <NA> | <NA> | <NA> | 2024-03-25 |
103 | 신축 | 서울특별시 동대문구 휘경동 293-155 외1필지 | 515.96 | 2021-09-28 | 2022-04-20 | 5 | 0 | 단독주택 | 다중주택(12호), 근린생활시설(일반음식점) | <NA> | <NA> | 1 | 02-2213-0691 | (주)이도인건설 | 02-928-4484 | 건축사사무소 하늘 | 02-933-5021 | 이현 건축사사무소 | 2024-03-25 |
104 | 신축 | 서울특별시 동대문구 휘경동 294-29 | 303.92 | 2022-03-25 | 2022-05-31 | 4 | 0 | 단독주택 | 근린생활시설 및 다중주택 | <NA> | <NA> | 1 | 02-6925-1260 | (주)하우올리씨앤디 | 02-333-1220 | (주)필아트건축사사무소 | 02-333-1220 | (주)필아트건축사사무소 | 2024-03-25 |
105 | 신축 | 서울특별시 동대문구 휘경동 43-121 외1필지 | 131.1 | 2018-11-06 | 2020-02-10 | 2 | 0 | 제2종근린생활시설 | 사무소 | <NA> | <NA> | <NA> | 02-3295-1500 | 태우종합건설㈜ | 02-953-2225 | (주)기하건축사사무소 | 02-953-2225 | (주)기하건축사사무소 | 2024-03-25 |
106 | 해체신고 | 서울특별시 동대문구 휘경동 43-143 | 146.47 | <NA> | 중지 | 2 | 0 | 단독주택 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 2024-03-25 |
107 | 신축 | 서울특별시 동대문구 휘경동 75-6 | 3905.27 | 2022-08-26 | 2022-10-18 | 10 | 2 | 공동주택 | 업무시설 | 28 | 65 | <NA> | 02-351-4081 | (주)에스하임월드 | 032-464-6775 | 건축사사무소한다스 | 02-925-8800 | 새빛건축사사무소 | 2024-03-25 |