Overview

Dataset statistics

Number of variables10
Number of observations42
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.4 KiB
Average record size in memory83.1 B

Variable types

Categorical7
Text2
DateTime1

Dataset

Description경상북도 구미시 승마장에서 보유하고 있는 마필 현황으로 마명, 원명, 품종, 성별, 모색, 생년월일, 생산국등의 정보를 제공하고 있습니다.
URLhttps://www.data.go.kr/data/3071272/fileData.do

Alerts

구분 has constant value ""Constant
관리기관전화번호 has constant value ""Constant
관리기관명 has constant value ""Constant
생산국 is highly imbalanced (51.3%)Imbalance
마명 has unique valuesUnique
원명 has unique valuesUnique
생년월일(입사일) has unique valuesUnique

Reproduction

Analysis started2023-12-12 15:21:36.401753
Analysis finished2023-12-12 15:21:37.177015
Duration0.78 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

CONSTANT 

Distinct1
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size468.0 B
보유마
42 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row보유마
2nd row보유마
3rd row보유마
4th row보유마
5th row보유마

Common Values

ValueCountFrequency (%)
보유마 42
100.0%

Length

2023-12-13T00:21:37.259322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:21:37.388975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
보유마 42
100.0%

마명
Text

UNIQUE 

Distinct42
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size468.0 B
2023-12-13T00:21:37.658083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length3.047619
Min length3

Characters and Unicode

Total characters128
Distinct characters73
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique42 ?
Unique (%)100.0%

Sample

1st row강 쇠
2nd row공 주
3rd row진 주
4th row명 문
5th row라 온
ValueCountFrequency (%)
2
 
2.9%
2
 
2.9%
2
 
2.9%
2
 
2.9%
2
 
2.9%
2
 
2.9%
2
 
2.9%
2
 
2.9%
1
 
1.4%
로사나 1
 
1.4%
Other values (52) 52
74.3%
2023-12-13T00:21:38.092076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
28
 
21.9%
6
 
4.7%
6
 
4.7%
3
 
2.3%
3
 
2.3%
3
 
2.3%
2
 
1.6%
2
 
1.6%
2
 
1.6%
2
 
1.6%
Other values (63) 71
55.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 99
77.3%
Space Separator 28
 
21.9%
Uppercase Letter 1
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6
 
6.1%
6
 
6.1%
3
 
3.0%
3
 
3.0%
3
 
3.0%
2
 
2.0%
2
 
2.0%
2
 
2.0%
2
 
2.0%
2
 
2.0%
Other values (61) 68
68.7%
Space Separator
ValueCountFrequency (%)
28
100.0%
Uppercase Letter
ValueCountFrequency (%)
G 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 99
77.3%
Common 28
 
21.9%
Latin 1
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6
 
6.1%
6
 
6.1%
3
 
3.0%
3
 
3.0%
3
 
3.0%
2
 
2.0%
2
 
2.0%
2
 
2.0%
2
 
2.0%
2
 
2.0%
Other values (61) 68
68.7%
Common
ValueCountFrequency (%)
28
100.0%
Latin
ValueCountFrequency (%)
G 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 99
77.3%
ASCII 29
 
22.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
28
96.6%
G 1
 
3.4%
Hangul
ValueCountFrequency (%)
6
 
6.1%
6
 
6.1%
3
 
3.0%
3
 
3.0%
3
 
3.0%
2
 
2.0%
2
 
2.0%
2
 
2.0%
2
 
2.0%
2
 
2.0%
Other values (61) 68
68.7%

원명
Text

UNIQUE 

Distinct42
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size468.0 B
2023-12-13T00:21:38.341635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length3
Mean length3.4761905
Min length3

Characters and Unicode

Total characters146
Distinct characters99
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique42 ?
Unique (%)100.0%

Sample

1st row중 심
2nd row태풍공주
3rd row명문출신
4th row명문명가
5th row원더빅터
ValueCountFrequency (%)
1
 
1.8%
1
 
1.8%
1
 
1.8%
프로세상 1
 
1.8%
삼국정벌 1
 
1.8%
태양이 1
 
1.8%
1
 
1.8%
1
 
1.8%
1
 
1.8%
1
 
1.8%
Other values (45) 45
81.8%
2023-12-13T00:21:38.746469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13
 
8.9%
5
 
3.4%
4
 
2.7%
4
 
2.7%
4
 
2.7%
3
 
2.1%
3
 
2.1%
3
 
2.1%
3
 
2.1%
3
 
2.1%
Other values (89) 101
69.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 132
90.4%
Space Separator 13
 
8.9%
Letter Number 1
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5
 
3.8%
4
 
3.0%
4
 
3.0%
4
 
3.0%
3
 
2.3%
3
 
2.3%
3
 
2.3%
3
 
2.3%
3
 
2.3%
2
 
1.5%
Other values (87) 98
74.2%
Space Separator
ValueCountFrequency (%)
13
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 132
90.4%
Common 13
 
8.9%
Latin 1
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5
 
3.8%
4
 
3.0%
4
 
3.0%
4
 
3.0%
3
 
2.3%
3
 
2.3%
3
 
2.3%
3
 
2.3%
3
 
2.3%
2
 
1.5%
Other values (87) 98
74.2%
Common
ValueCountFrequency (%)
13
100.0%
Latin
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 132
90.4%
ASCII 13
 
8.9%
Number Forms 1
 
0.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
13
100.0%
Hangul
ValueCountFrequency (%)
5
 
3.8%
4
 
3.0%
4
 
3.0%
4
 
3.0%
3
 
2.3%
3
 
2.3%
3
 
2.3%
3
 
2.3%
3
 
2.3%
2
 
1.5%
Other values (87) 98
74.2%
Number Forms
ValueCountFrequency (%)
1
100.0%

품종
Categorical

Distinct5
Distinct (%)11.9%
Missing0
Missing (%)0.0%
Memory size468.0 B
한라마
20 
더러버렛
14 
웰시포니
한국승용종
 
2
s포니
 
1

Length

Max length5
Median length4.5
Mean length3.547619
Min length3

Unique

Unique1 ?
Unique (%)2.4%

Sample

1st row더러버렛
2nd row더러버렛
3rd row더러버렛
4th row더러버렛
5th row더러버렛

Common Values

ValueCountFrequency (%)
한라마 20
47.6%
더러버렛 14
33.3%
웰시포니 5
 
11.9%
한국승용종 2
 
4.8%
s포니 1
 
2.4%

Length

2023-12-13T00:21:38.923178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:21:39.423467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한라마 20
47.6%
더러버렛 14
33.3%
웰시포니 5
 
11.9%
한국승용종 2
 
4.8%
s포니 1
 
2.4%

성별
Categorical

Distinct4
Distinct (%)9.5%
Missing0
Missing (%)0.0%
Memory size468.0 B
21 
19 
 
1
 
1

Length

Max length2
Median length1
Mean length1.0238095
Min length1

Unique

Unique2 ?
Unique (%)4.8%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
21
50.0%
19
45.2%
1
 
2.4%
1
 
2.4%

Length

2023-12-13T00:21:39.563333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:21:39.702538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
22
52.4%
19
45.2%
1
 
2.4%

모색
Categorical

Distinct6
Distinct (%)14.3%
Missing0
Missing (%)0.0%
Memory size468.0 B
갈 색
12 
회 색
11 
밤 색
흑갈색
얼룩이

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique1 ?
Unique (%)2.4%

Sample

1st row갈 색
2nd row흑갈색
3rd row갈 색
4th row갈 색
5th row흑갈색

Common Values

ValueCountFrequency (%)
갈 색 12
28.6%
회 색 11
26.2%
밤 색 7
16.7%
흑갈색 6
14.3%
얼룩이 5
11.9%
흑 색 1
 
2.4%

Length

2023-12-13T00:21:39.833707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:21:39.979960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
31
42.5%
12
 
16.4%
11
 
15.1%
7
 
9.6%
흑갈색 6
 
8.2%
얼룩이 5
 
6.8%
1
 
1.4%
Distinct42
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size468.0 B
Minimum2001-01-04 00:00:00
Maximum2031-07-05 00:00:00
2023-12-13T00:21:40.165885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:21:40.371518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=42)

생산국
Categorical

IMBALANCE 

Distinct6
Distinct (%)14.3%
Missing0
Missing (%)0.0%
Memory size468.0 B
한국
32 
미상
독일
 
2
호주
 
1
미국
 
1

Length

Max length3
Median length2
Mean length2.0238095
Min length2

Unique

Unique3 ?
Unique (%)7.1%

Sample

1st row한국
2nd row한국
3rd row한국
4th row한국
5th row호주

Common Values

ValueCountFrequency (%)
한국 32
76.2%
미상 5
 
11.9%
독일 2
 
4.8%
호주 1
 
2.4%
미국 1
 
2.4%
한국 1
 
2.4%

Length

2023-12-13T00:21:40.554583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:21:40.759264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한국 33
78.6%
미상 5
 
11.9%
독일 2
 
4.8%
호주 1
 
2.4%
미국 1
 
2.4%

관리기관전화번호
Categorical

CONSTANT 

Distinct1
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size468.0 B
054-480-5843
42 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row054-480-5843
2nd row054-480-5843
3rd row054-480-5843
4th row054-480-5843
5th row054-480-5843

Common Values

ValueCountFrequency (%)
054-480-5843 42
100.0%

Length

2023-12-13T00:21:40.934560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:21:41.069976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
054-480-5843 42
100.0%

관리기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size468.0 B
경상북도 구미시청
42 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경상북도 구미시청
2nd row경상북도 구미시청
3rd row경상북도 구미시청
4th row경상북도 구미시청
5th row경상북도 구미시청

Common Values

ValueCountFrequency (%)
경상북도 구미시청 42
100.0%

Length

2023-12-13T00:21:41.202612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:21:41.326586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경상북도 42
50.0%
구미시청 42
50.0%

Correlations

2023-12-13T00:21:41.433503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
마명원명품종성별모색생년월일(입사일)생산국
마명1.0001.0001.0001.0001.0001.0001.000
원명1.0001.0001.0001.0001.0001.0001.000
품종1.0001.0001.0000.0770.2701.0000.389
성별1.0001.0000.0771.0000.0001.0000.000
모색1.0001.0000.2700.0001.0001.0000.321
생년월일(입사일)1.0001.0001.0001.0001.0001.0001.000
생산국1.0001.0000.3890.0000.3211.0001.000
2023-12-13T00:21:41.583860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성별모색생산국품종
성별1.0000.0000.0000.037
모색0.0001.0000.1070.175
생산국0.0000.1071.0000.267
품종0.0370.1750.2671.000
2023-12-13T00:21:41.709449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
품종성별모색생산국
품종1.0000.0370.1750.267
성별0.0371.0000.0000.000
모색0.1750.0001.0000.107
생산국0.2670.0000.1071.000

Missing values

2023-12-13T00:21:36.929971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:21:37.114896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분마명원명품종성별모색생년월일(입사일)생산국관리기관전화번호관리기관명
0보유마강 쇠중 심더러버렛갈 색06.03.30한국054-480-5843경상북도 구미시청
1보유마공 주태풍공주더러버렛흑갈색07.05.31한국054-480-5843경상북도 구미시청
2보유마진 주명문출신더러버렛갈 색07.03.29한국054-480-5843경상북도 구미시청
3보유마명 문명문명가더러버렛갈 색08.03.03한국054-480-5843경상북도 구미시청
4보유마라 온원더빅터더러버렛흑갈색05.09.25호주054-480-5843경상북도 구미시청
5보유마만 세대동만세더러버렛밤 색15.05.14한국054-480-5843경상북도 구미시청
6보유마체르니베누스더러버렛밤 색08.03.08한국054-480-5843경상북도 구미시청
7보유마라피드라피드스타더러버렛밤 색12.04.01한국054-480-5843경상북도 구미시청
8보유마아레스정글짐더러버렛갈 색11.04.15한국054-480-5843경상북도 구미시청
9보유마솔 라솔라시도더러버렛갈 색12.04.09한국054-480-5843경상북도 구미시청
구분마명원명품종성별모색생년월일(입사일)생산국관리기관전화번호관리기관명
32보유마밍 키밍 키웰시포니갈 색15.05.11한국054-480-5843경상북도 구미시청
33보유마자 두자 두웰시포니회 색17.04.07한국054-480-5843경상북도 구미시청
34보유마메이로즈메이로즈한국승용종갈 색18.05.23한국054-480-5843경상북도 구미시청
35보유마레 아백화씨챕한국승용종갈 색14.06.28한국054-480-5843경상북도 구미시청
36보유마대 범제주벌판한라마회 색14.06.24한국054-480-5843경상북도 구미시청
37보유마해 치라벤더한라마얼룩이15.05.25한국054-480-5843경상북도 구미시청
38보유마도 담브릿지한라마얼룩이14.04.01미상054-480-5843경상북도 구미시청
39보유마캐 리캐 리한라마얼룩이16.04.12한국054-480-5843경상북도 구미시청
40보유마거 루레 오한라마갈 색18.10.07한국054-480-5843경상북도 구미시청
41보유마아리아세명햇살더러버렛회 색18.03.28한국054-480-5843경상북도 구미시청