Overview

Dataset statistics

Number of variables11
Number of observations56
Missing cells56
Missing cells (%)9.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.1 KiB
Average record size in memory92.4 B

Variable types

Numeric1
Text1
Categorical7
DateTime1
Unsupported1

Dataset

Description상주국제승마장에서 활용하고 있는 마필에 관한 상세정보
Author경상북도 상주시
URLhttps://www.data.go.kr/data/3075068/fileData.do

Alerts

소속 has constant value ""Constant
데이터기준일 has constant value ""Constant
연번 is highly overall correlated with 종목High correlation
품종 is highly overall correlated with 생산지 and 1 other fieldsHigh correlation
생산지 is highly overall correlated with 품종High correlation
종목 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
Unnamed: 10 has 56 (100.0%) missing valuesMissing
연번 has unique valuesUnique
마명 has unique valuesUnique
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 07:11:25.833704
Analysis finished2023-12-12 07:11:26.701380
Duration0.87 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct56
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean28.5
Minimum1
Maximum56
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size636.0 B
2023-12-12T16:11:26.803699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.75
Q114.75
median28.5
Q342.25
95-th percentile53.25
Maximum56
Range55
Interquartile range (IQR)27.5

Descriptive statistics

Standard deviation16.309506
Coefficient of variation (CV)0.57226338
Kurtosis-1.2
Mean28.5
Median Absolute Deviation (MAD)14
Skewness0
Sum1596
Variance266
MonotonicityStrictly increasing
2023-12-12T16:11:26.947062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.8%
30 1
 
1.8%
32 1
 
1.8%
33 1
 
1.8%
34 1
 
1.8%
35 1
 
1.8%
36 1
 
1.8%
37 1
 
1.8%
38 1
 
1.8%
39 1
 
1.8%
Other values (46) 46
82.1%
ValueCountFrequency (%)
1 1
1.8%
2 1
1.8%
3 1
1.8%
4 1
1.8%
5 1
1.8%
6 1
1.8%
7 1
1.8%
8 1
1.8%
9 1
1.8%
10 1
1.8%
ValueCountFrequency (%)
56 1
1.8%
55 1
1.8%
54 1
1.8%
53 1
1.8%
52 1
1.8%
51 1
1.8%
50 1
1.8%
49 1
1.8%
48 1
1.8%
47 1
1.8%

마명
Text

UNIQUE 

Distinct56
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size580.0 B
2023-12-12T16:11:27.175802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length7.5
Mean length4.1785714
Min length1

Characters and Unicode

Total characters234
Distinct characters127
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique56 ?
Unique (%)100.0%

Sample

1st row알칸티노
2nd row맨인레드
3rd row웬드랜드
4th row던디
5th row둘리
ValueCountFrequency (%)
알칸티노 1
 
1.8%
맨인레드 1
 
1.8%
나토얀 1
 
1.8%
마루한 1
 
1.8%
마루나 1
 
1.8%
1
 
1.8%
아몬드 1
 
1.8%
제주한라 1
 
1.8%
뱅크 1
 
1.8%
루씨(풀향기자마 1
 
1.8%
Other values (46) 46
82.1%
2023-12-12T16:11:27.586931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 11
 
4.7%
11
 
4.7%
( 11
 
4.7%
9
 
3.8%
8
 
3.4%
6
 
2.6%
5
 
2.1%
5
 
2.1%
5
 
2.1%
4
 
1.7%
Other values (117) 159
67.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 212
90.6%
Close Punctuation 11
 
4.7%
Open Punctuation 11
 
4.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11
 
5.2%
9
 
4.2%
8
 
3.8%
6
 
2.8%
5
 
2.4%
5
 
2.4%
5
 
2.4%
4
 
1.9%
4
 
1.9%
3
 
1.4%
Other values (115) 152
71.7%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 212
90.6%
Common 22
 
9.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11
 
5.2%
9
 
4.2%
8
 
3.8%
6
 
2.8%
5
 
2.4%
5
 
2.4%
5
 
2.4%
4
 
1.9%
4
 
1.9%
3
 
1.4%
Other values (115) 152
71.7%
Common
ValueCountFrequency (%)
) 11
50.0%
( 11
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 212
90.6%
ASCII 22
 
9.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 11
50.0%
( 11
50.0%
Hangul
ValueCountFrequency (%)
11
 
5.2%
9
 
4.2%
8
 
3.8%
6
 
2.8%
5
 
2.4%
5
 
2.4%
5
 
2.4%
4
 
1.9%
4
 
1.9%
3
 
1.4%
Other values (115) 152
71.7%

품종
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)19.6%
Missing0
Missing (%)0.0%
Memory size580.0 B
더러브렛
14 
쿼터호스
12 
웜블러드
조랑말
소형마
Other values (6)

Length

Max length7
Median length4
Mean length3.9107143
Min length3

Unique

Unique3 ?
Unique (%)5.4%

Sample

1st row웜블러드
2nd row웜블러드
3rd row웜블러드
4th row웜블러드
5th row조랑말

Common Values

ValueCountFrequency (%)
더러브렛 14
25.0%
쿼터호스 12
21.4%
웜블러드 9
16.1%
조랑말 9
16.1%
소형마 3
 
5.4%
미니호스 2
 
3.6%
한라마 2
 
3.6%
셔틀랜드포니 2
 
3.6%
웨일즈포니 1
 
1.8%
하프링거 1
 
1.8%

Length

2023-12-12T16:11:27.768347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
더러브렛 14
25.0%
쿼터호스 12
21.4%
웜블러드 9
16.1%
조랑말 9
16.1%
소형마 3
 
5.4%
셔틀랜드포니 3
 
5.4%
미니호스 2
 
3.6%
한라마 2
 
3.6%
웨일즈포니 1
 
1.8%
하프링거 1
 
1.8%

성별
Categorical

Distinct3
Distinct (%)5.4%
Missing0
Missing (%)0.0%
Memory size580.0 B
27 
거세
18 
11 

Length

Max length2
Median length1
Mean length1.3214286
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row거세
2nd row거세
3rd row거세
4th row거세
5th row거세

Common Values

ValueCountFrequency (%)
27
48.2%
거세 18
32.1%
11
19.6%

Length

2023-12-12T16:11:27.988734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:11:28.159405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
27
48.2%
거세 18
32.1%
11
19.6%

생산지
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)10.7%
Missing0
Missing (%)0.0%
Memory size580.0 B
한국
33 
독일
11 
미국
프랑스
 
3
미상
 
1

Length

Max length3
Median length2
Mean length2.0357143
Min length1

Unique

Unique2 ?
Unique (%)3.6%

Sample

1st row독일
2nd row독일
3rd row독일
4th row독일
5th row한국

Common Values

ValueCountFrequency (%)
한국 33
58.9%
독일 11
 
19.6%
미국 7
 
12.5%
프랑스 3
 
5.4%
미상 1
 
1.8%
1
 
1.8%

Length

2023-12-12T16:11:28.311449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:11:28.465045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한국 33
58.9%
독일 11
 
19.6%
미국 7
 
12.5%
프랑스 3
 
5.4%
미상 1
 
1.8%
1
 
1.8%

출생년도
Categorical

Distinct19
Distinct (%)33.9%
Missing0
Missing (%)0.0%
Memory size580.0 B
2008
2001
2007
2015
2003
Other values (14)
29 

Length

Max length4
Median length4
Mean length3.9642857
Min length2

Unique

Unique6 ?
Unique (%)10.7%

Sample

1st row2003
2nd row2001
3rd row2000
4th row2001
5th row2002

Common Values

ValueCountFrequency (%)
2008 7
12.5%
2001 5
8.9%
2007 5
8.9%
2015 5
8.9%
2003 5
8.9%
2002 4
 
7.1%
2006 4
 
7.1%
2011 3
 
5.4%
2010 3
 
5.4%
2013 3
 
5.4%
Other values (9) 12
21.4%

Length

2023-12-12T16:11:28.625972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2008 7
12.5%
2007 5
8.9%
2015 5
8.9%
2003 5
8.9%
2001 5
8.9%
2002 4
 
7.1%
2006 4
 
7.1%
2010 3
 
5.4%
2013 3
 
5.4%
2011 3
 
5.4%
Other values (9) 12
21.4%

종목
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)10.7%
Missing0
Missing (%)0.0%
Memory size580.0 B
일반승용마
23 
<NA>
15 
승용마
장애물비월
번식용

Length

Max length5
Median length4
Mean length4.2678571
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row장애물비월
2nd row장애물비월
3rd row마장마술
4th row마장마술
5th row일반승용마

Common Values

ValueCountFrequency (%)
일반승용마 23
41.1%
<NA> 15
26.8%
승용마 9
 
16.1%
장애물비월 4
 
7.1%
번식용 3
 
5.4%
마장마술 2
 
3.6%

Length

2023-12-12T16:11:28.811795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:11:28.949101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반승용마 23
41.1%
na 15
26.8%
승용마 9
 
16.1%
장애물비월 4
 
7.1%
번식용 3
 
5.4%
마장마술 2
 
3.6%

소속
Categorical

CONSTANT 

Distinct1
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size580.0 B
상주시
56 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row상주시
2nd row상주시
3rd row상주시
4th row상주시
5th row상주시

Common Values

ValueCountFrequency (%)
상주시 56
100.0%

Length

2023-12-12T16:11:29.102770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:11:29.210683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
상주시 56
100.0%

모색
Categorical

Distinct10
Distinct (%)17.9%
Missing0
Missing (%)0.0%
Memory size580.0 B
갈색
21 
밤색
16 
<NA>
황갈색
흑색
Other values (5)

Length

Max length4
Median length2
Mean length2.2857143
Min length2

Unique

Unique3 ?
Unique (%)5.4%

Sample

1st row갈색
2nd row밤색
3rd row밤색
4th row갈색
5th row흑색

Common Values

ValueCountFrequency (%)
갈색 21
37.5%
밤색 16
28.6%
<NA> 4
 
7.1%
황갈색 4
 
7.1%
흑색 3
 
5.4%
회색 3
 
5.4%
얼룩이 2
 
3.6%
흑갈색 1
 
1.8%
갈 색 1
 
1.8%
황색 1
 
1.8%

Length

2023-12-12T16:11:29.325318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:11:29.488665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
갈색 21
36.8%
밤색 16
28.1%
na 4
 
7.0%
황갈색 4
 
7.0%
흑색 3
 
5.3%
회색 3
 
5.3%
얼룩이 2
 
3.5%
흑갈색 1
 
1.8%
1
 
1.8%
1
 
1.8%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size580.0 B
Minimum2017-10-24 00:00:00
Maximum2017-10-24 00:00:00
2023-12-12T16:11:29.627533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:11:29.735161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Unnamed: 10
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing56
Missing (%)100.0%
Memory size636.0 B

Interactions

2023-12-12T16:11:26.355219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:11:29.808374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번마명품종성별생산지출생년도종목모색
연번1.0001.0000.6870.5050.4380.7510.9480.188
마명1.0001.0001.0001.0001.0001.0001.0001.000
품종0.6871.0001.0000.6750.8170.8090.8590.538
성별0.5051.0000.6751.0000.1310.6200.5540.000
생산지0.4381.0000.8170.1311.0000.4360.4930.702
출생년도0.7511.0000.8090.6200.4361.0000.8640.684
종목0.9481.0000.8590.5540.4930.8641.0000.232
모색0.1881.0000.5380.0000.7020.6840.2321.000
2023-12-12T16:11:29.934702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
출생년도모색종목품종생산지성별
출생년도1.0000.2510.4660.4110.1710.339
모색0.2511.0000.1200.2640.4220.000
종목0.4660.1201.0000.6760.3520.482
품종0.4110.2640.6761.0000.5670.472
생산지0.1710.4220.3520.5671.0000.033
성별0.3390.0000.4820.4720.0331.000
2023-12-12T16:11:30.080439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번품종성별생산지출생년도종목모색
연번1.0000.3750.3330.2350.3270.6400.114
품종0.3751.0000.4720.5670.4110.6760.264
성별0.3330.4721.0000.0330.3390.4820.000
생산지0.2350.5670.0331.0000.1710.3520.422
출생년도0.3270.4110.3390.1711.0000.4660.251
종목0.6400.6760.4820.3520.4661.0000.120
모색0.1140.2640.0000.4220.2510.1201.000

Missing values

2023-12-12T16:11:26.468746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:11:26.643445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번마명품종성별생산지출생년도종목소속모색데이터기준일Unnamed: 10
01알칸티노웜블러드거세독일2003장애물비월상주시갈색2017-10-24<NA>
12맨인레드웜블러드거세독일2001장애물비월상주시밤색2017-10-24<NA>
23웬드랜드웜블러드거세독일2000마장마술상주시밤색2017-10-24<NA>
34던디웜블러드거세독일2001마장마술상주시갈색2017-10-24<NA>
45둘리조랑말거세한국2002일반승용마상주시흑색2017-10-24<NA>
56일호(써니보이)웜블러드거세독일2001장애물비월상주시갈색2017-10-24<NA>
67지프댄서더러브렛한국2007일반승용마상주시밤색2017-10-24<NA>
78에바(솔푸른)더러브렛한국2001일반승용마상주시갈색2017-10-24<NA>
89비단이조랑말한국2003일반승용마상주시밤색2017-10-24<NA>
910서빈이조랑말거세한국2006일반승용마상주시얼룩이2017-10-24<NA>
연번마명품종성별생산지출생년도종목소속모색데이터기준일Unnamed: 10
4647더나이트리스완쿼터호스미국2008승용마상주시황갈색2017-10-24<NA>
4748로켓트리플모카쿼터호스2007승용마상주시갈 색2017-10-24<NA>
4849알라딘웜블러드독일1993승용마상주시밤색2017-10-24<NA>
4950멘토웜블러드거세독일2001승용마상주시회색2017-10-24<NA>
5051아드리안웜블러드거세독일1996승용마상주시밤색2017-10-24<NA>
5152크롱쿼터호스미국2011<NA>상주시황갈색2017-10-24<NA>
5253모모쿼터호스한국2017<NA>상주시갈색2017-10-24<NA>
5354로미쿼터호스한국2017<NA>상주시갈색2017-10-24<NA>
5455밴스텐셔틀랜드포니미국2002<NA>상주시황색2017-10-24<NA>
5556사바나셔틀랜드포니프랑스2003<NA>상주시갈색2017-10-24<NA>