Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 1000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 1 |
Duplicate rows (%) | 0.1% |
Total size in memory | 32.4 KiB |
Average record size in memory | 33.1 B |
Variable types
Categorical | 3 |
---|---|
Text | 1 |
Dataset
Description | 한국주택금융공사의 모기지연계보증요율참고에 대한 정보이며 기준일자 법인명 등록사번이 포함된 데이터를 제공합니다. |
---|---|
Author | 한국주택금융공사 |
URL | https://www.data.go.kr/data/15049771/fileData.do |
Dataset has 1 (0.1%) duplicate rows | Duplicates |
등록사번 is highly overall correlated with 기준일자 and 1 other fields | High correlation |
등록일시 is highly overall correlated with 기준일자 and 1 other fields | High correlation |
기준일자 is highly overall correlated with 등록사번 and 1 other fields | High correlation |
등록사번 is highly imbalanced (75.8%) | Imbalance |
Reproduction
Analysis started | 2023-12-12 06:49:10.707362 |
---|---|
Analysis finished | 2023-12-12 06:49:11.136061 |
Duration | 0.43 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
기준일자
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
2020-08-01 | |
---|---|
2019-08-01 | |
2018-08-01 | |
2017-08-01 |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2020-08-01 |
---|---|
2nd row | 2020-08-01 |
3rd row | 2020-08-01 |
4th row | 2020-08-01 |
5th row | 2020-08-01 |
Common Values
Value | Count | Frequency (%) |
2020-08-01 | 320 | |
2019-08-01 | 320 | |
2018-08-01 | 320 | |
2017-08-01 | 40 | 4.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2020-08-01 | 320 | |
2019-08-01 | 320 | |
2018-08-01 | 320 | |
2017-08-01 | 40 | 4.0% |
법인명
Text
Distinct | 395 |
---|---|
Distinct (%) | 39.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
Value | Count | Frequency (%) |
주식회사 | 16 | 1.5% |
개발공사 | 15 | 1.4% |
도시개발공사 | 12 | 1.1% |
지방공사 | 6 | 0.6% |
유)한백종합건설 | 4 | 0.4% |
주)신 | 4 | 0.4% |
주)대창건설 | 4 | 0.4% |
피엔지건설(주 | 4 | 0.4% |
동성건설(주 | 4 | 0.4% |
정우개발(주 | 4 | 0.4% |
Other values (402) | 1010 |
Most occurring characters
Value | Count | Frequency (%) |
주 | 968 | 12.4% |
( | 918 | 11.7% |
) | 918 | 11.7% |
건 | 632 | 8.1% |
설 | 561 | 7.2% |
합 | 140 | 1.8% |
종 | 140 | 1.8% |
140 | 1.8% | |
업 | 111 | 1.4% |
산 | 111 | 1.4% |
Other values (201) | 3196 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 5853 | |
Open Punctuation | 918 | 11.7% |
Close Punctuation | 918 | 11.7% |
Space Separator | 140 | 1.8% |
Uppercase Letter | 6 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
주 | 968 | 16.5% |
건 | 632 | 10.8% |
설 | 561 | 9.6% |
합 | 140 | 2.4% |
종 | 140 | 2.4% |
업 | 111 | 1.9% |
산 | 111 | 1.9% |
대 | 105 | 1.8% |
동 | 103 | 1.8% |
이 | 102 | 1.7% |
Other values (196) | 2880 |
Uppercase Letter
Value | Count | Frequency (%) |
H | 3 | |
S | 3 |
Open Punctuation
Value | Count | Frequency (%) |
( | 918 |
Close Punctuation
Value | Count | Frequency (%) |
) | 918 |
Space Separator
Value | Count | Frequency (%) |
140 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 5853 | |
Common | 1976 | 25.2% |
Latin | 6 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
주 | 968 | 16.5% |
건 | 632 | 10.8% |
설 | 561 | 9.6% |
합 | 140 | 2.4% |
종 | 140 | 2.4% |
업 | 111 | 1.9% |
산 | 111 | 1.9% |
대 | 105 | 1.8% |
동 | 103 | 1.8% |
이 | 102 | 1.7% |
Other values (196) | 2880 |
Common
Value | Count | Frequency (%) |
( | 918 | |
) | 918 | |
140 | 7.1% |
Latin
Value | Count | Frequency (%) |
H | 3 | |
S | 3 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 5853 | |
ASCII | 1982 | 25.3% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
주 | 968 | 16.5% |
건 | 632 | 10.8% |
설 | 561 | 9.6% |
합 | 140 | 2.4% |
종 | 140 | 2.4% |
업 | 111 | 1.9% |
산 | 111 | 1.9% |
대 | 105 | 1.8% |
동 | 103 | 1.8% |
이 | 102 | 1.7% |
Other values (196) | 2880 |
ASCII
Value | Count | Frequency (%) |
( | 918 | |
) | 918 | |
140 | 7.1% | |
H | 3 | 0.2% |
S | 3 | 0.2% |
등록사번
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
1505 | |
---|---|
1249 | 40 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1505 |
---|---|
2nd row | 1505 |
3rd row | 1505 |
4th row | 1505 |
5th row | 1505 |
Common Values
Value | Count | Frequency (%) |
1505 | 960 | |
1249 | 40 | 4.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1505 | 960 | |
1249 | 40 | 4.0% |
등록일시
Categorical
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
2020-07-31 17:04 | |
---|---|
2019-08-01 10:17 | |
2018-07-31 17:47 | |
2018-07-31 17:49 | |
2017-08-07 13:19 |
Length
Max length | 16 |
---|---|
Median length | 16 |
Mean length | 16 |
Min length | 16 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2020-07-31 17:04 |
---|---|
2nd row | 2020-07-31 17:04 |
3rd row | 2020-07-31 17:04 |
4th row | 2020-07-31 17:04 |
5th row | 2020-07-31 17:04 |
Common Values
Value | Count | Frequency (%) |
2020-07-31 17:04 | 320 | |
2019-08-01 10:17 | 320 | |
2018-07-31 17:47 | 200 | |
2018-07-31 17:49 | 100 | 10.0% |
2017-08-07 13:19 | 40 | 4.0% |
2018-07-31 17:50 | 20 | 2.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2020-07-31 | 320 | |
17:04 | 320 | |
2019-08-01 | 320 | |
10:17 | 320 | |
2018-07-31 | 320 | |
17:47 | 200 | |
17:49 | 100 | 5.0% |
2017-08-07 | 40 | 2.0% |
13:19 | 40 | 2.0% |
17:50 | 20 | 1.0% |
기준일자 | 등록사번 | 등록일시 | |
---|---|---|---|
기준일자 | 1.000 | 1.000 | 1.000 |
등록사번 | 1.000 | 1.000 | 1.000 |
등록일시 | 1.000 | 1.000 | 1.000 |
등록사번 | 등록일시 | 기준일자 | |
---|---|---|---|
등록사번 | 1.000 | 0.998 | 0.999 |
등록일시 | 0.998 | 1.000 | 0.999 |
기준일자 | 0.999 | 0.999 | 1.000 |
기준일자 | 등록사번 | 등록일시 | |
---|---|---|---|
기준일자 | 1.000 | 0.999 | 0.999 |
등록사번 | 0.999 | 1.000 | 0.998 |
등록일시 | 0.999 | 0.998 | 1.000 |
기준일자 | 법인명 | 등록사번 | 등록일시 | |
---|---|---|---|---|
0 | 2020-08-01 | 동신건설(주) | 1505 | 2020-07-31 17:04 |
1 | 2020-08-01 | 신해공영(주) | 1505 | 2020-07-31 17:04 |
2 | 2020-08-01 | 래미안건설(주) | 1505 | 2020-07-31 17:04 |
3 | 2020-08-01 | (주)대양산업건설 | 1505 | 2020-07-31 17:04 |
4 | 2020-08-01 | 동호건설(주) | 1505 | 2020-07-31 17:04 |
5 | 2020-08-01 | 은일종합건설(주) | 1505 | 2020-07-31 17:04 |
6 | 2020-08-01 | (주)광양종합건설 | 1505 | 2020-07-31 17:04 |
7 | 2020-08-01 | (주)두손건설 | 1505 | 2020-07-31 17:04 |
8 | 2020-08-01 | 서림종합건설(주) | 1505 | 2020-07-31 17:04 |
9 | 2020-08-01 | 화성종합건설(주) | 1505 | 2020-07-31 17:04 |
기준일자 | 법인명 | 등록사번 | 등록일시 | |
---|---|---|---|---|
990 | 2017-08-01 | 고운시티아이(주) | 1249 | 2017-08-07 13:19 |
991 | 2017-08-01 | (주)삼희종합건설 | 1249 | 2017-08-07 13:19 |
992 | 2017-08-01 | (주)문영엔지니어링 | 1249 | 2017-08-07 13:19 |
993 | 2017-08-01 | 정상종합건설(주) | 1249 | 2017-08-07 13:19 |
994 | 2017-08-01 | (유)한백종합건설 | 1249 | 2017-08-07 13:19 |
995 | 2017-08-01 | (주)신화종합건설 | 1249 | 2017-08-07 13:19 |
996 | 2017-08-01 | (주)송학건설 | 1249 | 2017-08-07 13:19 |
997 | 2017-08-01 | (주)대창건설 | 1249 | 2017-08-07 13:19 |
998 | 2017-08-01 | 신안종합건설 | 1249 | 2017-08-07 13:19 |
999 | 2017-08-01 | (주)대건 | 1249 | 2017-08-07 13:19 |
Most frequently occurring
기준일자 | 법인명 | 등록사번 | 등록일시 | # duplicates | |
---|---|---|---|---|---|
0 | 2020-08-01 | 우경건설(주) | 1505 | 2020-07-31 17:04 | 2 |