Overview

Dataset statistics

Number of variables5
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 KiB
Average record size in memory45.4 B

Variable types

Categorical3
DateTime1
Text1

Dataset

Description샘플 데이터
Author한국신용데이터
URLhttps://bigdata-region.kr/#/dataset/587504f7-1373-430e-884c-d72b79798570

Alerts

년월 has constant value ""Constant
평균매입금액 has constant value ""Constant
유형명 is highly overall correlated with 업종명High correlation
업종명 is highly overall correlated with 유형명High correlation

Reproduction

Analysis started2023-12-10 14:25:28.045471
Analysis finished2023-12-10 14:25:28.328741
Duration0.28 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

유형명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
지역
17 
업종
지역x업종

Length

Max length5
Median length2
Mean length2.4
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row업종
2nd row업종
3rd row업종
4th row업종
5th row업종

Common Values

ValueCountFrequency (%)
지역 17
56.7%
업종 9
30.0%
지역x업종 4
 
13.3%

Length

2023-12-10T23:25:28.394015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:25:28.487412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지역 17
56.7%
업종 9
30.0%
지역x업종 4
 
13.3%

년월
Date

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
Minimum2021-10-01 00:00:00
Maximum2021-10-01 00:00:00
2023-12-10T23:25:28.566030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:25:28.647021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

업종명
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
전체
17 
건설업
제조업
기타
유통업
 
1
Other values (5)

Length

Max length8
Median length2
Mean length2.6666667
Min length2

Unique

Unique6 ?
Unique (%)20.0%

Sample

1st row유통업
2nd row건설업
3rd row제조업
4th row부동산업
5th row서비스업

Common Values

ValueCountFrequency (%)
전체 17
56.7%
건설업 3
 
10.0%
제조업 2
 
6.7%
기타 2
 
6.7%
유통업 1
 
3.3%
부동산업 1
 
3.3%
서비스업 1
 
3.3%
농업/임업/어업 1
 
3.3%
외식업 1
 
3.3%
정보통신업 1
 
3.3%

Length

2023-12-10T23:25:28.752454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:25:28.883615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전체 17
56.7%
건설업 3
 
10.0%
제조업 2
 
6.7%
기타 2
 
6.7%
유통업 1
 
3.3%
부동산업 1
 
3.3%
서비스업 1
 
3.3%
농업/임업/어업 1
 
3.3%
외식업 1
 
3.3%
정보통신업 1
 
3.3%
Distinct18
Distinct (%)60.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:25:29.069197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length3.7333333
Min length2

Characters and Unicode

Total characters112
Distinct characters32
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)46.7%

Sample

1st row전국
2nd row전국
3rd row전국
4th row전국
5th row전국
ValueCountFrequency (%)
전국 9
30.0%
경상남도 3
 
10.0%
전라북도 2
 
6.7%
강원도 2
 
6.7%
경상북도 1
 
3.3%
충청남도 1
 
3.3%
서울특별시 1
 
3.3%
경기도 1
 
3.3%
부산광역시 1
 
3.3%
충청북도 1
 
3.3%
Other values (8) 8
26.7%
2023-12-10T23:25:29.355113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13
 
11.6%
13
 
11.6%
9
 
8.0%
8
 
7.1%
7
 
6.2%
6
 
5.4%
5
 
4.5%
5
 
4.5%
4
 
3.6%
4
 
3.6%
Other values (22) 38
33.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 112
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13
 
11.6%
13
 
11.6%
9
 
8.0%
8
 
7.1%
7
 
6.2%
6
 
5.4%
5
 
4.5%
5
 
4.5%
4
 
3.6%
4
 
3.6%
Other values (22) 38
33.9%

Most occurring scripts

ValueCountFrequency (%)
Hangul 112
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
13
 
11.6%
13
 
11.6%
9
 
8.0%
8
 
7.1%
7
 
6.2%
6
 
5.4%
5
 
4.5%
5
 
4.5%
4
 
3.6%
4
 
3.6%
Other values (22) 38
33.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 112
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
13
 
11.6%
13
 
11.6%
9
 
8.0%
8
 
7.1%
7
 
6.2%
6
 
5.4%
5
 
4.5%
5
 
4.5%
4
 
3.6%
4
 
3.6%
Other values (22) 38
33.9%

평균매입금액
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
20000000
30 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20000000
2nd row20000000
3rd row20000000
4th row20000000
5th row20000000

Common Values

ValueCountFrequency (%)
20000000 30
100.0%

Length

2023-12-10T23:25:29.489295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:25:29.567940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20000000 30
100.0%

Correlations

2023-12-10T23:25:29.612944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
유형명업종명시도명
유형명1.0000.8420.877
업종명0.8421.0000.000
시도명0.8770.0001.000
2023-12-10T23:25:29.885682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
유형명업종명
유형명1.0000.646
업종명0.6461.000
2023-12-10T23:25:29.943729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
유형명업종명
유형명1.0000.646
업종명0.6461.000

Missing values

2023-12-10T23:25:28.202024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T23:25:28.294878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

유형명년월업종명시도명평균매입금액
0업종2021-10-01유통업전국20000000
1업종2021-10-01건설업전국20000000
2업종2021-10-01제조업전국20000000
3업종2021-10-01부동산업전국20000000
4업종2021-10-01서비스업전국20000000
5업종2021-10-01농업/임업/어업전국20000000
6업종2021-10-01외식업전국20000000
7업종2021-10-01기타전국20000000
8업종2021-10-01정보통신업전국20000000
9지역2021-10-01전체경상북도20000000
유형명년월업종명시도명평균매입금액
20지역2021-10-01전체제주특별자치도20000000
21지역2021-10-01전체대구광역시20000000
22지역2021-10-01전체인천광역시20000000
23지역2021-10-01전체전라남도20000000
24지역2021-10-01전체대전광역시20000000
25지역2021-10-01전체광주광역시20000000
26지역x업종2021-10-01건설업강원도20000000
27지역x업종2021-10-01기타경상남도20000000
28지역x업종2021-10-01제조업전라북도20000000
29지역x업종2021-10-01건설업경상남도20000000