Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 21 |
Missing cells | 89 |
Missing cells (%) | 47.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.6 KiB |
Average record size in memory | 79.3 B |
Variable types
Unsupported | 4 |
---|---|
Text | 4 |
Categorical | 1 |
Dataset
Description | 개별주택 가격정보 |
---|---|
Author | 국토교통부 |
URL | https://www.vworld.kr/dtmk/dtmk_ntads_s002.do?dsId=30520 |
Unnamed: 8 has constant value "" | Constant |
Unnamed: 1 has 6 (28.6%) missing values | Missing |
Unnamed: 2 has 1 (4.8%) missing values | Missing |
Unnamed: 4 has 5 (23.8%) missing values | Missing |
Unnamed: 5 has 21 (100.0%) missing values | Missing |
Unnamed: 6 has 18 (85.7%) missing values | Missing |
Unnamed: 7 has 18 (85.7%) missing values | Missing |
Unnamed: 8 has 20 (95.2%) missing values | Missing |
테이블정의서 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2024-04-16 02:28:54.680732 |
---|---|
Analysis finished | 2024-04-16 02:28:56.502381 |
Duration | 1.82 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
테이블정의서
Unsupported
REJECTED
  UNSUPPORTED
 
Missing | 0 |
---|---|
Missing (%) | 0.0% |
Memory size | 300.0 B |
Unnamed: 1
Text
MISSING
 
Distinct | 15 |
---|---|
Distinct (%) | 100.0% |
Missing | 6 |
Missing (%) | 28.6% |
Memory size | 300.0 B |
Value | Count | Frequency (%) |
컬럼id | 1 | 6.7% |
pnu | 1 | 6.7% |
bild_regstr_unqno | 1 | 6.7% |
dong_no | 1 | 6.7% |
pann_year | 1 | 6.7% |
stdmt | 1 | 6.7% |
potvale | 1 | 6.7% |
pjji_yn | 1 | 6.7% |
pann_gbn | 1 | 6.7% |
lndbuk_area | 1 | 6.7% |
Other values (5) | 5 |
Most occurring characters
Value | Count | Frequency (%) |
A | 15 | |
N | 14 | 10.9% |
_ | 14 | 10.9% |
R | 9 | 7.0% |
E | 9 | 7.0% |
D | 8 | 6.2% |
P | 7 | 5.4% |
C | 6 | 4.7% |
L | 6 | 4.7% |
O | 5 | 3.9% |
Other values (15) | 36 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 113 | |
Connector Punctuation | 14 | 10.9% |
Other Letter | 2 | 1.6% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
A | 15 | |
N | 14 | |
R | 9 | 8.0% |
E | 9 | 8.0% |
D | 8 | 7.1% |
P | 7 | 6.2% |
C | 6 | 5.3% |
L | 6 | 5.3% |
O | 5 | 4.4% |
T | 5 | 4.4% |
Other values (12) | 29 |
Other Letter
Value | Count | Frequency (%) |
컬 | 1 | |
럼 | 1 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 14 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 113 | |
Common | 14 | 10.9% |
Hangul | 2 | 1.6% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
A | 15 | |
N | 14 | |
R | 9 | 8.0% |
E | 9 | 8.0% |
D | 8 | 7.1% |
P | 7 | 6.2% |
C | 6 | 5.3% |
L | 6 | 5.3% |
O | 5 | 4.4% |
T | 5 | 4.4% |
Other values (12) | 29 |
Hangul
Value | Count | Frequency (%) |
컬 | 1 | |
럼 | 1 |
Common
Value | Count | Frequency (%) |
_ | 14 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 127 | |
Hangul | 2 | 1.6% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
A | 15 | |
N | 14 | |
_ | 14 | |
R | 9 | 7.1% |
E | 9 | 7.1% |
D | 8 | 6.3% |
P | 7 | 5.5% |
C | 6 | 4.7% |
L | 6 | 4.7% |
O | 5 | 3.9% |
Other values (13) | 34 |
Hangul
Value | Count | Frequency (%) |
컬 | 1 | |
럼 | 1 |
Unnamed: 2
Text
MISSING
 
Distinct | 20 |
---|---|
Distinct (%) | 100.0% |
Missing | 1 |
Missing (%) | 4.8% |
Memory size | 300.0 B |
Value | Count | Frequency (%) |
허재민 | 1 | 4.0% |
토지대장면적 | 1 | 4.0% |
pann_year | 1 | 4.0% |
dong_no | 1 | 4.0% |
bild_regstr_unqno | 1 | 4.0% |
pnu | 1 | 4.0% |
인덱스키 | 1 | 4.0% |
공시일자 | 1 | 4.0% |
원천시군구코드 | 1 | 4.0% |
주거면적 | 1 | 4.0% |
Other values (15) | 15 |
Most occurring characters
Value | Count | Frequency (%) |
N | 7 | 4.9% |
시 | 5 | 3.5% |
5 | 3.5% | |
_ | 4 | 2.8% |
지 | 4 | 2.8% |
, | 4 | 2.8% |
공 | 4 | 2.8% |
면 | 4 | 2.8% |
적 | 4 | 2.8% |
격 | 4 | 2.8% |
Other values (66) | 97 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 92 | |
Uppercase Letter | 37 | |
Space Separator | 5 | 3.5% |
Connector Punctuation | 4 | 2.8% |
Other Punctuation | 4 | 2.8% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
시 | 5 | 5.4% |
지 | 4 | 4.3% |
공 | 4 | 4.3% |
면 | 4 | 4.3% |
적 | 4 | 4.3% |
격 | 4 | 4.3% |
가 | 4 | 4.3% |
주 | 3 | 3.3% |
대 | 3 | 3.3% |
호 | 2 | 2.2% |
Other values (46) | 55 |
Uppercase Letter
Value | Count | Frequency (%) |
N | 7 | |
T | 3 | 8.1% |
O | 3 | 8.1% |
R | 3 | 8.1% |
D | 3 | 8.1% |
U | 2 | 5.4% |
P | 2 | 5.4% |
E | 2 | 5.4% |
G | 2 | 5.4% |
S | 2 | 5.4% |
Other values (7) | 8 |
Space Separator
Value | Count | Frequency (%) |
5 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 4 |
Other Punctuation
Value | Count | Frequency (%) |
, | 4 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 92 | |
Latin | 37 | |
Common | 13 | 9.2% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
시 | 5 | 5.4% |
지 | 4 | 4.3% |
공 | 4 | 4.3% |
면 | 4 | 4.3% |
적 | 4 | 4.3% |
격 | 4 | 4.3% |
가 | 4 | 4.3% |
주 | 3 | 3.3% |
대 | 3 | 3.3% |
호 | 2 | 2.2% |
Other values (46) | 55 |
Latin
Value | Count | Frequency (%) |
N | 7 | |
T | 3 | 8.1% |
O | 3 | 8.1% |
R | 3 | 8.1% |
D | 3 | 8.1% |
U | 2 | 5.4% |
P | 2 | 5.4% |
E | 2 | 5.4% |
G | 2 | 5.4% |
S | 2 | 5.4% |
Other values (7) | 8 |
Common
Value | Count | Frequency (%) |
5 | ||
_ | 4 | |
, | 4 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 92 | |
ASCII | 50 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
N | 7 | |
5 | 10.0% | |
_ | 4 | 8.0% |
, | 4 | 8.0% |
T | 3 | 6.0% |
O | 3 | 6.0% |
R | 3 | 6.0% |
D | 3 | 6.0% |
U | 2 | 4.0% |
P | 2 | 4.0% |
Other values (10) | 14 |
Hangul
Value | Count | Frequency (%) |
시 | 5 | 5.4% |
지 | 4 | 4.3% |
공 | 4 | 4.3% |
면 | 4 | 4.3% |
적 | 4 | 4.3% |
격 | 4 | 4.3% |
가 | 4 | 4.3% |
주 | 3 | 3.3% |
대 | 3 | 3.3% |
호 | 2 | 2.2% |
Other values (46) | 55 |
Unnamed: 3
Categorical
Distinct | 6 |
---|---|
Distinct (%) | 28.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 300.0 B |
VARCHAR2 | |
---|---|
<NA> | |
NUMBER | |
CHAR | |
테이블ID |
Length
Max length | 8 |
---|---|
Median length | 6 |
Mean length | 5.5714286 |
Min length | 2 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 9.5% |
Sample
1st row | <NA> |
---|---|
2nd row | 테이블ID |
3rd row | <NA> |
4th row | 타입 |
5th row | VARCHAR2 |
Common Values
Value | Count | Frequency (%) |
VARCHAR2 | 6 | |
<NA> | 5 | |
NUMBER | 5 | |
CHAR | 3 | |
테이블ID | 1 | 4.8% |
타입 | 1 | 4.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
varchar2 | 6 | |
na | 5 | |
number | 5 | |
char | 3 | |
테이블id | 1 | 4.8% |
타입 | 1 | 4.8% |
Unnamed: 4
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 5 |
---|---|
Missing (%) | 23.8% |
Memory size | 300.0 B |
Unnamed: 5
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 21 |
---|---|
Missing (%) | 100.0% |
Memory size | 321.0 B |
Unnamed: 6
Text
MISSING
 
Distinct | 3 |
---|---|
Distinct (%) | 100.0% |
Missing | 18 |
Missing (%) | 85.7% |
Memory size | 300.0 B |
Value | Count | Frequency (%) |
작성일 | 1 | |
테이블명 | 1 | |
pk/fk | 1 |
Most occurring characters
Value | Count | Frequency (%) |
K | 2 | |
작 | 1 | |
성 | 1 | |
일 | 1 | |
테 | 1 | |
이 | 1 | |
블 | 1 | |
명 | 1 | |
P | 1 | |
/ | 1 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 7 | |
Uppercase Letter | 4 | |
Other Punctuation | 1 | 8.3% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
작 | 1 | |
성 | 1 | |
일 | 1 | |
테 | 1 | |
이 | 1 | |
블 | 1 | |
명 | 1 |
Uppercase Letter
Value | Count | Frequency (%) |
K | 2 | |
P | 1 | |
F | 1 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 7 | |
Latin | 4 | |
Common | 1 | 8.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
작 | 1 | |
성 | 1 | |
일 | 1 | |
테 | 1 | |
이 | 1 | |
블 | 1 | |
명 | 1 |
Latin
Value | Count | Frequency (%) |
K | 2 | |
P | 1 | |
F | 1 |
Common
Value | Count | Frequency (%) |
/ | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 7 | |
ASCII | 5 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
K | 2 | |
P | 1 | |
/ | 1 | |
F | 1 |
Hangul
Value | Count | Frequency (%) |
작 | 1 | |
성 | 1 | |
일 | 1 | |
테 | 1 | |
이 | 1 | |
블 | 1 | |
명 | 1 |
Unnamed: 7
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 18 |
---|---|
Missing (%) | 85.7% |
Memory size | 300.0 B |
Unnamed: 8
Text
CONSTANT
  MISSING
 
Distinct | 1 |
---|---|
Distinct (%) | 100.0% |
Missing | 20 |
Missing (%) | 95.2% |
Memory size | 300.0 B |
Value | Count | Frequency (%) |
참조테이블명/비고 | 1 |
Most occurring characters
Value | Count | Frequency (%) |
참 | 1 | |
조 | 1 | |
테 | 1 | |
이 | 1 | |
블 | 1 | |
명 | 1 | |
/ | 1 | |
비 | 1 | |
고 | 1 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 8 | |
Other Punctuation | 1 | 11.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
참 | 1 | |
조 | 1 | |
테 | 1 | |
이 | 1 | |
블 | 1 | |
명 | 1 | |
비 | 1 | |
고 | 1 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 8 | |
Common | 1 | 11.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
참 | 1 | |
조 | 1 | |
테 | 1 | |
이 | 1 | |
블 | 1 | |
명 | 1 | |
비 | 1 | |
고 | 1 |
Common
Value | Count | Frequency (%) |
/ | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 8 | |
ASCII | 1 | 11.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
참 | 1 | |
조 | 1 | |
테 | 1 | |
이 | 1 | |
블 | 1 | |
명 | 1 | |
비 | 1 | |
고 | 1 |
ASCII
Value | Count | Frequency (%) |
/ | 1 |
Unnamed: 1 | Unnamed: 2 | Unnamed: 3 | Unnamed: 6 | |
---|---|---|---|---|
Unnamed: 1 | 1.000 | 1.000 | 1.000 | NaN |
Unnamed: 2 | 1.000 | 1.000 | 1.000 | 1.000 |
Unnamed: 3 | 1.000 | 1.000 | 1.000 | 0.000 |
Unnamed: 6 | NaN | 1.000 | 0.000 | 1.000 |
테이블정의서 | Unnamed: 1 | Unnamed: 2 | Unnamed: 3 | Unnamed: 4 | Unnamed: 5 | Unnamed: 6 | Unnamed: 7 | Unnamed: 8 | |
---|---|---|---|---|---|---|---|---|---|
0 | 작성자 | <NA> | 허재민 | <NA> | NaN | <NA> | 작성일 | 2016-01-19 00:00:00 | <NA> |
1 | 주제영역명 | <NA> | 가격업무 | 테이블ID | APMM_HP_PRC_MNG | <NA> | 테이블명 | 개별주택 가격정보 | <NA> |
2 | 테이블설명 | <NA> | 개별주택 가격정보 | <NA> | NaN | <NA> | <NA> | NaN | <NA> |
3 | No | 컬럼ID | 컬럼명 | 타입 | 길이(Byte) | <NA> | PK/FK | Default | 참조테이블명/비고 |
4 | 1 | PNU | 토지코드 | VARCHAR2 | 19 | <NA> | <NA> | NaN | <NA> |
5 | 2 | BILD_REGSTR_UNQNO | 건축물대장고유번호 | VARCHAR2 | 19 | <NA> | <NA> | NaN | <NA> |
6 | 3 | DONG_NO | 동번호 | VARCHAR2 | 5 | <NA> | <NA> | NaN | <NA> |
7 | 4 | PANN_YEAR | 공시년도 | VARCHAR2 | 4 | <NA> | <NA> | NaN | <NA> |
8 | 5 | STDMT | 기준월 | CHAR | 2 | <NA> | <NA> | NaN | <NA> |
9 | 6 | POTVALE | 공시가격 | NUMBER | 13 | <NA> | <NA> | NaN | <NA> |
테이블정의서 | Unnamed: 1 | Unnamed: 2 | Unnamed: 3 | Unnamed: 4 | Unnamed: 5 | Unnamed: 6 | Unnamed: 7 | Unnamed: 8 | |
---|---|---|---|---|---|---|---|---|---|
11 | 8 | PANN_GBN | 공시구분 | CHAR | 1 | <NA> | <NA> | NaN | <NA> |
12 | 9 | LNDBUK_AREA | 토지대장면적 | NUMBER | 13,2 | <NA> | <NA> | NaN | <NA> |
13 | 10 | CALC_LAREA | 산정대지면적 | NUMBER | 13,2 | <NA> | <NA> | NaN | <NA> |
14 | 11 | HPRC_GAREA | 주택가격연면적 | NUMBER | 13,2 | <NA> | <NA> | NaN | <NA> |
15 | 12 | RES_AREA | 주거면적 | NUMBER | 13,2 | <NA> | <NA> | NaN | <NA> |
16 | 13 | COL_ADM_SECT_CD | 원천시군구코드 | VARCHAR2 | 5 | <NA> | <NA> | NaN | <NA> |
17 | 14 | PANN_YMD | 공시일자 | VARCHAR2 | 8 | <NA> | <NA> | NaN | <NA> |
18 | 인덱스명 | <NA> | 인덱스키 | <NA> | NaN | <NA> | <NA> | NaN | <NA> |
19 | APMM_HP_PRC_MNG_INX1 | <NA> | PNU, BILD_REGSTR_UNQNO, DONG_NO, PANN_YEAR, STDMT | <NA> | NaN | <NA> | <NA> | NaN | <NA> |
20 | 업무규칙 | <NA> | <NA> | <NA> | NaN | <NA> | <NA> | NaN | <NA> |