Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 78 |
Missing cells | 244 |
Missing cells (%) | 34.8% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 5.6 KiB |
Average record size in memory | 73.7 B |
Variable types
Unsupported | 3 |
---|---|
Text | 4 |
Categorical | 1 |
Boolean | 1 |
Dataset
Description | 공시지가 토지특성 2016 |
---|---|
Author | 국토교통부 |
URL | https://www.vworld.kr/dtmk/dtmk_ntads_s002.do?dsId=30536 |
Unnamed: 8 has constant value "" | Constant |
Unnamed: 5 is highly imbalanced (68.7%) | Imbalance |
Unnamed: 1 has 6 (7.7%) missing values | Missing |
Unnamed: 2 has 1 (1.3%) missing values | Missing |
Unnamed: 4 has 5 (6.4%) missing values | Missing |
Unnamed: 5 has 7 (9.0%) missing values | Missing |
Unnamed: 6 has 73 (93.6%) missing values | Missing |
Unnamed: 7 has 75 (96.2%) missing values | Missing |
Unnamed: 8 has 77 (98.7%) missing values | Missing |
테이블정의서 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2024-04-18 00:53:06.880295 |
---|---|
Analysis finished | 2024-04-18 00:53:08.359536 |
Duration | 1.48 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
테이블정의서
Unsupported
REJECTED
  UNSUPPORTED
 
Missing | 0 |
---|---|
Missing (%) | 0.0% |
Memory size | 756.0 B |
Unnamed: 1
Text
MISSING
 
Distinct | 72 |
---|---|
Distinct (%) | 100.0% |
Missing | 6 |
Missing (%) | 7.7% |
Memory size | 756.0 B |
Value | Count | Frequency (%) |
컬럼id | 1 | 1.4% |
stdmt | 1 | 1.4% |
calc_jiga | 1 | 1.4% |
prev_jiga | 1 | 1.4% |
py_jiga | 1 | 1.4% |
handwk_yn | 1 | 1.4% |
lclw_step_cd | 1 | 1.4% |
lclw_mthd_cd | 1 | 1.4% |
harm_wast | 1 | 1.4% |
harm_rail | 1 | 1.4% |
Other values (62) | 62 |
Most occurring characters
Value | Count | Frequency (%) |
_ | 77 | 12.2% |
A | 59 | 9.3% |
R | 41 | 6.5% |
C | 37 | 5.9% |
D | 34 | 5.4% |
N | 34 | 5.4% |
E | 32 | 5.1% |
T | 32 | 5.1% |
S | 30 | 4.7% |
P | 27 | 4.3% |
Other values (23) | 229 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 532 | |
Connector Punctuation | 77 | 12.2% |
Decimal Number | 21 | 3.3% |
Other Letter | 2 | 0.3% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
A | 59 | 11.1% |
R | 41 | 7.7% |
C | 37 | 7.0% |
D | 34 | 6.4% |
N | 34 | 6.4% |
E | 32 | 6.0% |
T | 32 | 6.0% |
S | 30 | 5.6% |
P | 27 | 5.1% |
L | 25 | 4.7% |
Other values (16) | 181 |
Decimal Number
Value | Count | Frequency (%) |
2 | 12 | |
1 | 7 | |
3 | 1 | 4.8% |
4 | 1 | 4.8% |
Other Letter
Value | Count | Frequency (%) |
럼 | 1 | |
컬 | 1 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 77 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 532 | |
Common | 98 | 15.5% |
Hangul | 2 | 0.3% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
A | 59 | 11.1% |
R | 41 | 7.7% |
C | 37 | 7.0% |
D | 34 | 6.4% |
N | 34 | 6.4% |
E | 32 | 6.0% |
T | 32 | 6.0% |
S | 30 | 5.6% |
P | 27 | 5.1% |
L | 25 | 4.7% |
Other values (16) | 181 |
Common
Value | Count | Frequency (%) |
_ | 77 | |
2 | 12 | 12.2% |
1 | 7 | 7.1% |
3 | 1 | 1.0% |
4 | 1 | 1.0% |
Hangul
Value | Count | Frequency (%) |
럼 | 1 | |
컬 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 630 | |
Hangul | 2 | 0.3% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
_ | 77 | 12.2% |
A | 59 | 9.4% |
R | 41 | 6.5% |
C | 37 | 5.9% |
D | 34 | 5.4% |
N | 34 | 5.4% |
E | 32 | 5.1% |
T | 32 | 5.1% |
S | 30 | 4.8% |
P | 27 | 4.3% |
Other values (21) | 227 |
Hangul
Value | Count | Frequency (%) |
럼 | 1 | |
컬 | 1 |
Unnamed: 2
Text
MISSING
 
Distinct | 77 |
---|---|
Distinct (%) | 100.0% |
Missing | 1 |
Missing (%) | 1.3% |
Memory size | 756.0 B |
Value | Count | Frequency (%) |
김민호 | 1 | 1.2% |
토지구분 | 1 | 1.2% |
산정지가 | 1 | 1.2% |
종전지가 | 1 | 1.2% |
전년지가 | 1 | 1.2% |
수작업여부 | 1 | 1.2% |
대규모개발사업단계코드 | 1 | 1.2% |
대규모개발사업방식코드 | 1 | 1.2% |
3년전지가 | 1 | 1.2% |
유해철도 | 1 | 1.2% |
Other values (70) | 70 |
Most occurring characters
Value | Count | Frequency (%) |
지 | 38 | 9.0% |
구 | 17 | 4.0% |
가 | 13 | 3.1% |
2 | 13 | 3.1% |
드 | 12 | 2.8% |
역 | 12 | 2.8% |
코 | 12 | 2.8% |
도 | 11 | 2.6% |
토 | 10 | 2.4% |
제 | 10 | 2.4% |
Other values (120) | 274 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 385 | |
Decimal Number | 25 | 5.9% |
Uppercase Letter | 8 | 1.9% |
Space Separator | 3 | 0.7% |
Other Punctuation | 1 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
지 | 38 | 9.9% |
구 | 17 | 4.4% |
가 | 13 | 3.4% |
드 | 12 | 3.1% |
역 | 12 | 3.1% |
코 | 12 | 3.1% |
도 | 11 | 2.9% |
토 | 10 | 2.6% |
제 | 10 | 2.6% |
시 | 9 | 2.3% |
Other values (105) | 241 |
Uppercase Letter
Value | Count | Frequency (%) |
T | 2 | |
N | 1 | |
P | 1 | |
M | 1 | |
D | 1 | |
S | 1 | |
U | 1 |
Decimal Number
Value | Count | Frequency (%) |
2 | 13 | |
1 | 8 | |
4 | 1 | 4.0% |
0 | 1 | 4.0% |
6 | 1 | 4.0% |
3 | 1 | 4.0% |
Space Separator
Value | Count | Frequency (%) |
3 |
Other Punctuation
Value | Count | Frequency (%) |
, | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 385 | |
Common | 29 | 6.9% |
Latin | 8 | 1.9% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
지 | 38 | 9.9% |
구 | 17 | 4.4% |
가 | 13 | 3.4% |
드 | 12 | 3.1% |
역 | 12 | 3.1% |
코 | 12 | 3.1% |
도 | 11 | 2.9% |
토 | 10 | 2.6% |
제 | 10 | 2.6% |
시 | 9 | 2.3% |
Other values (105) | 241 |
Common
Value | Count | Frequency (%) |
2 | 13 | |
1 | 8 | |
3 | 10.3% | |
, | 1 | 3.4% |
4 | 1 | 3.4% |
0 | 1 | 3.4% |
6 | 1 | 3.4% |
3 | 1 | 3.4% |
Latin
Value | Count | Frequency (%) |
T | 2 | |
N | 1 | |
P | 1 | |
M | 1 | |
D | 1 | |
S | 1 | |
U | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 385 | |
ASCII | 37 | 8.8% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
지 | 38 | 9.9% |
구 | 17 | 4.4% |
가 | 13 | 3.4% |
드 | 12 | 3.1% |
역 | 12 | 3.1% |
코 | 12 | 3.1% |
도 | 11 | 2.9% |
토 | 10 | 2.6% |
제 | 10 | 2.6% |
시 | 9 | 2.3% |
Other values (105) | 241 |
ASCII
Value | Count | Frequency (%) |
2 | 13 | |
1 | 8 | |
3 | 8.1% | |
T | 2 | 5.4% |
N | 1 | 2.7% |
P | 1 | 2.7% |
, | 1 | 2.7% |
M | 1 | 2.7% |
D | 1 | 2.7% |
S | 1 | 2.7% |
Other values (5) | 5 | 13.5% |
Unnamed: 3
Categorical
Distinct | 6 |
---|---|
Distinct (%) | 7.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 756.0 B |
CHAR | |
---|---|
NUMBER | |
VARCHAR2 | |
<NA> | |
테이블ID | 1 |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 5.1923077 |
Min length | 2 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 2.6% |
Sample
1st row | <NA> |
---|---|
2nd row | 테이블ID |
3rd row | <NA> |
4th row | 타입 |
5th row | CHAR |
Common Values
Value | Count | Frequency (%) |
CHAR | 37 | |
NUMBER | 21 | |
VARCHAR2 | 13 | 16.7% |
<NA> | 5 | 6.4% |
테이블ID | 1 | 1.3% |
타입 | 1 | 1.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
char | 37 | |
number | 21 | |
varchar2 | 13 | 16.7% |
na | 5 | 6.4% |
테이블id | 1 | 1.3% |
타입 | 1 | 1.3% |
Unnamed: 4
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 5 |
---|---|
Missing (%) | 6.4% |
Memory size | 756.0 B |
Unnamed: 5
Boolean
IMBALANCE
  MISSING
 
Distinct | 2 |
---|---|
Distinct (%) | 2.8% |
Missing | 7 |
Missing (%) | 9.0% |
Memory size | 288.0 B |
True | |
---|---|
False | 4 |
(Missing) |
Value | Count | Frequency (%) |
True | 67 | |
False | 4 | 5.1% |
(Missing) | 7 | 9.0% |
Unnamed: 6
Text
MISSING
 
Distinct | 5 |
---|---|
Distinct (%) | 100.0% |
Missing | 73 |
Missing (%) | 93.6% |
Memory size | 756.0 B |
Value | Count | Frequency (%) |
작성일 | 1 | |
테이블명 | 1 | |
pk/fk | 1 | |
pk1 | 1 | |
pk2 | 1 |
Most occurring characters
Value | Count | Frequency (%) |
K | 4 | |
P | 3 | |
작 | 1 | 5.6% |
성 | 1 | 5.6% |
일 | 1 | 5.6% |
테 | 1 | 5.6% |
이 | 1 | 5.6% |
블 | 1 | 5.6% |
명 | 1 | 5.6% |
/ | 1 | 5.6% |
Other values (3) | 3 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 8 | |
Other Letter | 7 | |
Decimal Number | 2 | 11.1% |
Other Punctuation | 1 | 5.6% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
작 | 1 | |
성 | 1 | |
일 | 1 | |
테 | 1 | |
이 | 1 | |
블 | 1 | |
명 | 1 |
Uppercase Letter
Value | Count | Frequency (%) |
K | 4 | |
P | 3 | |
F | 1 | 12.5% |
Decimal Number
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 8 | |
Hangul | 7 | |
Common | 3 | 16.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
작 | 1 | |
성 | 1 | |
일 | 1 | |
테 | 1 | |
이 | 1 | |
블 | 1 | |
명 | 1 |
Latin
Value | Count | Frequency (%) |
K | 4 | |
P | 3 | |
F | 1 | 12.5% |
Common
Value | Count | Frequency (%) |
/ | 1 | |
1 | 1 | |
2 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 11 | |
Hangul | 7 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
K | 4 | |
P | 3 | |
/ | 1 | 9.1% |
F | 1 | 9.1% |
1 | 1 | 9.1% |
2 | 1 | 9.1% |
Hangul
Value | Count | Frequency (%) |
작 | 1 | |
성 | 1 | |
일 | 1 | |
테 | 1 | |
이 | 1 | |
블 | 1 | |
명 | 1 |
Unnamed: 7
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 75 |
---|---|
Missing (%) | 96.2% |
Memory size | 756.0 B |
Unnamed: 8
Text
CONSTANT
  MISSING
 
Distinct | 1 |
---|---|
Distinct (%) | 100.0% |
Missing | 77 |
Missing (%) | 98.7% |
Memory size | 756.0 B |
Value | Count | Frequency (%) |
참조테이블명/비고 | 1 |
Most occurring characters
Value | Count | Frequency (%) |
참 | 1 | |
조 | 1 | |
테 | 1 | |
이 | 1 | |
블 | 1 | |
명 | 1 | |
/ | 1 | |
비 | 1 | |
고 | 1 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 8 | |
Other Punctuation | 1 | 11.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
참 | 1 | |
조 | 1 | |
테 | 1 | |
이 | 1 | |
블 | 1 | |
명 | 1 | |
비 | 1 | |
고 | 1 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 8 | |
Common | 1 | 11.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
참 | 1 | |
조 | 1 | |
테 | 1 | |
이 | 1 | |
블 | 1 | |
명 | 1 | |
비 | 1 | |
고 | 1 |
Common
Value | Count | Frequency (%) |
/ | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 8 | |
ASCII | 1 | 11.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
참 | 1 | |
조 | 1 | |
테 | 1 | |
이 | 1 | |
블 | 1 | |
명 | 1 | |
비 | 1 | |
고 | 1 |
ASCII
Value | Count | Frequency (%) |
/ | 1 |
Unnamed: 1 | Unnamed: 2 | Unnamed: 3 | Unnamed: 5 | Unnamed: 6 | |
---|---|---|---|---|---|
Unnamed: 1 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
Unnamed: 2 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
Unnamed: 3 | 1.000 | 1.000 | 1.000 | 0.071 | 1.000 |
Unnamed: 5 | 1.000 | 1.000 | 0.071 | 1.000 | NaN |
Unnamed: 6 | 1.000 | 1.000 | 1.000 | NaN | 1.000 |
Unnamed: 3 | Unnamed: 5 | |
---|---|---|
Unnamed: 3 | 1.000 | 0.115 |
Unnamed: 5 | 0.115 | 1.000 |
Unnamed: 3 | Unnamed: 5 | |
---|---|---|
Unnamed: 3 | 1.000 | 0.115 |
Unnamed: 5 | 0.115 | 1.000 |
테이블정의서 | Unnamed: 1 | Unnamed: 2 | Unnamed: 3 | Unnamed: 4 | Unnamed: 5 | Unnamed: 6 | Unnamed: 7 | Unnamed: 8 | |
---|---|---|---|---|---|---|---|---|---|
0 | 작성자 | <NA> | 김민호 | <NA> | NaN | <NA> | 작성일 | 2017-05-10 00:00:00 | <NA> |
1 | 주제영역명 | <NA> | 가격업무 | 테이블ID | APMM_NV_LAND_2016 | <NA> | 테이블명 | 공시지가 토지특성 2016 | <NA> |
2 | 테이블설명 | <NA> | 공시지가 토지특성 2016 | <NA> | NaN | <NA> | <NA> | NaN | <NA> |
3 | No | 컬럼ID | 컬럼명 | 타입 | 길이(Byte) | <NA> | PK/FK | Default | 참조테이블명/비고 |
4 | 1 | STDMT | 기준월 | CHAR | 2 | N | PK1 | NaN | <NA> |
5 | 2 | PNU | 토지코드 | VARCHAR2 | 19 | N | PK2 | NaN | <NA> |
6 | 3 | LAND_SEQNO | 토지일련번호 | NUMBER | 6,0 | N | <NA> | NaN | <NA> |
7 | 4 | SGG_CD | 시군구코드 | CHAR | 5 | Y | <NA> | NaN | <NA> |
8 | 5 | LAND_LOC_CD | 토지소재지코드 | CHAR | 5 | Y | <NA> | NaN | <NA> |
9 | 6 | LAND_GBN | 토지구분 | CHAR | 1 | Y | <NA> | NaN | <NA> |
테이블정의서 | Unnamed: 1 | Unnamed: 2 | Unnamed: 3 | Unnamed: 4 | Unnamed: 5 | Unnamed: 6 | Unnamed: 7 | Unnamed: 8 | |
---|---|---|---|---|---|---|---|---|---|
68 | 65 | CNFER_CD | 확인자코드 | VARCHAR2 | 3 | Y | <NA> | NaN | <NA> |
69 | 66 | VRFY_GBN | 검증구분 | CHAR | 2 | Y | <NA> | NaN | <NA> |
70 | 67 | PY_VRFY_GBN | 전년검증구분 | CHAR | 2 | Y | <NA> | NaN | <NA> |
71 | 68 | LAND_MOV_YMD | 토지이동일자 | VARCHAR2 | 8 | N | <NA> | NaN | <NA> |
72 | 69 | LAND_MOV_RSN_CD | 토지이동사유코드 | VARCHAR2 | 5 | Y | <NA> | NaN | <NA> |
73 | 70 | HOUSE_PANN_YN | 주택공시여부 | CHAR | 1 | Y | <NA> | NaN | <NA> |
74 | 71 | COL_ADM_SECT_CD | 원천시군구코드 | VARCHAR2 | 5 | Y | <NA> | NaN | <NA> |
75 | 인덱스명 | <NA> | 인덱스키 | <NA> | NaN | <NA> | <NA> | NaN | <NA> |
76 | APMM_NV_LAND_2016_INX1 | <NA> | STDMT, PNU | <NA> | NaN | <NA> | <NA> | NaN | <NA> |
77 | 업무규칙 | <NA> | <NA> | <NA> | NaN | <NA> | <NA> | NaN | <NA> |