Overview

Dataset statistics

Number of variables6
Number of observations124
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.1 KiB
Average record size in memory50.1 B

Variable types

Numeric1
Categorical1
Text4

Dataset

Description경상남도 내 택시 업체 현황입니다.
Author경상남도
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=3083979

Alerts

연번 is highly overall correlated with 시군명High correlation
시군명 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-10 22:38:42.897999
Analysis finished2023-12-10 22:38:43.417594
Duration0.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct124
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean62.5
Minimum1
Maximum124
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2023-12-11T07:38:43.483899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile7.15
Q131.75
median62.5
Q393.25
95-th percentile117.85
Maximum124
Range123
Interquartile range (IQR)61.5

Descriptive statistics

Standard deviation35.939764
Coefficient of variation (CV)0.57503623
Kurtosis-1.2
Mean62.5
Median Absolute Deviation (MAD)31
Skewness0
Sum7750
Variance1291.6667
MonotonicityStrictly increasing
2023-12-11T07:38:43.618028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.8%
80 1
 
0.8%
93 1
 
0.8%
92 1
 
0.8%
91 1
 
0.8%
90 1
 
0.8%
89 1
 
0.8%
88 1
 
0.8%
87 1
 
0.8%
86 1
 
0.8%
Other values (114) 114
91.9%
ValueCountFrequency (%)
1 1
0.8%
2 1
0.8%
3 1
0.8%
4 1
0.8%
5 1
0.8%
6 1
0.8%
7 1
0.8%
8 1
0.8%
9 1
0.8%
10 1
0.8%
ValueCountFrequency (%)
124 1
0.8%
123 1
0.8%
122 1
0.8%
121 1
0.8%
120 1
0.8%
119 1
0.8%
118 1
0.8%
117 1
0.8%
116 1
0.8%
115 1
0.8%

시군명
Categorical

HIGH CORRELATION 

Distinct18
Distinct (%)14.5%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
창원시
35 
진주시
15 
사천시
남해군
김해시
Other values (13)
54 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row창원시
2nd row창원시
3rd row창원시
4th row창원시
5th row창원시

Common Values

ValueCountFrequency (%)
창원시 35
28.2%
진주시 15
12.1%
사천시 7
 
5.6%
남해군 7
 
5.6%
김해시 6
 
4.8%
밀양시 6
 
4.8%
창녕군 6
 
4.8%
양산시 5
 
4.0%
거창군 5
 
4.0%
통영시 5
 
4.0%
Other values (8) 27
21.8%

Length

2023-12-11T07:38:43.739862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
창원시 35
28.2%
진주시 15
12.1%
사천시 7
 
5.6%
남해군 7
 
5.6%
김해시 6
 
4.8%
밀양시 6
 
4.8%
창녕군 6
 
4.8%
거창군 5
 
4.0%
통영시 5
 
4.0%
양산시 5
 
4.0%
Other values (8) 27
21.8%
Distinct120
Distinct (%)96.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-11T07:38:43.993902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length7
Mean length6.5645161
Min length4

Characters and Unicode

Total characters814
Distinct characters92
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique117 ?
Unique (%)94.4%

Sample

1st row(주)창원택시
2nd row(유)신일교통
3rd row(주)금호운수
4th row(주)광덕운수
5th row(주)동성택시
ValueCountFrequency (%)
천일택시㈜ 3
 
2.4%
제일택시 2
 
1.6%
남양운수(주 2
 
1.6%
㈜경일교통 1
 
0.8%
에니콜택시(주 1
 
0.8%
㈜칠원택시 1
 
0.8%
유)삼성택시 1
 
0.8%
유)금성운수 1
 
0.8%
합)신반택시 1
 
0.8%
㈜의령택시 1
 
0.8%
Other values (110) 110
88.7%
2023-12-11T07:38:44.371924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 92
 
11.3%
( 92
 
11.3%
85
 
10.4%
85
 
10.4%
37
 
4.5%
30
 
3.7%
28
 
3.4%
28
 
3.4%
25
 
3.1%
22
 
2.7%
Other values (82) 290
35.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 598
73.5%
Close Punctuation 92
 
11.3%
Open Punctuation 92
 
11.3%
Other Symbol 18
 
2.2%
Decimal Number 11
 
1.4%
Uppercase Letter 2
 
0.2%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
85
 
14.2%
85
 
14.2%
37
 
6.2%
30
 
5.0%
28
 
4.7%
28
 
4.7%
25
 
4.2%
22
 
3.7%
15
 
2.5%
13
 
2.2%
Other values (71) 230
38.5%
Decimal Number
ValueCountFrequency (%)
0 7
63.6%
1 1
 
9.1%
2 1
 
9.1%
7 1
 
9.1%
8 1
 
9.1%
Uppercase Letter
ValueCountFrequency (%)
M 1
50.0%
S 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 92
100.0%
Open Punctuation
ValueCountFrequency (%)
( 92
100.0%
Other Symbol
ValueCountFrequency (%)
18
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 616
75.7%
Common 196
 
24.1%
Latin 2
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
85
 
13.8%
85
 
13.8%
37
 
6.0%
30
 
4.9%
28
 
4.5%
28
 
4.5%
25
 
4.1%
22
 
3.6%
18
 
2.9%
15
 
2.4%
Other values (72) 243
39.4%
Common
ValueCountFrequency (%)
) 92
46.9%
( 92
46.9%
0 7
 
3.6%
. 1
 
0.5%
1 1
 
0.5%
2 1
 
0.5%
7 1
 
0.5%
8 1
 
0.5%
Latin
ValueCountFrequency (%)
M 1
50.0%
S 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 598
73.5%
ASCII 198
 
24.3%
None 18
 
2.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 92
46.5%
( 92
46.5%
0 7
 
3.5%
. 1
 
0.5%
M 1
 
0.5%
S 1
 
0.5%
1 1
 
0.5%
2 1
 
0.5%
7 1
 
0.5%
8 1
 
0.5%
Hangul
ValueCountFrequency (%)
85
 
14.2%
85
 
14.2%
37
 
6.2%
30
 
5.0%
28
 
4.7%
28
 
4.7%
25
 
4.2%
22
 
3.7%
15
 
2.5%
13
 
2.2%
Other values (71) 230
38.5%
None
ValueCountFrequency (%)
18
100.0%
Distinct115
Distinct (%)92.7%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-11T07:38:44.656151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters372
Distinct characters118
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique108 ?
Unique (%)87.1%

Sample

1st row서준수
2nd row박구영
3rd row김기동
4th row조기춘
5th row허종길
ValueCountFrequency (%)
이행식 3
 
2.4%
박세곤 3
 
2.4%
조기춘 2
 
1.6%
조재화 2
 
1.6%
김미령 2
 
1.6%
주장천 2
 
1.6%
박재선 2
 
1.6%
손태영 1
 
0.8%
유의태 1
 
0.8%
이정기 1
 
0.8%
Other values (105) 105
84.7%
2023-12-11T07:38:45.037292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
20
 
5.4%
16
 
4.3%
13
 
3.5%
12
 
3.2%
12
 
3.2%
10
 
2.7%
10
 
2.7%
10
 
2.7%
9
 
2.4%
8
 
2.2%
Other values (108) 252
67.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 372
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
20
 
5.4%
16
 
4.3%
13
 
3.5%
12
 
3.2%
12
 
3.2%
10
 
2.7%
10
 
2.7%
10
 
2.7%
9
 
2.4%
8
 
2.2%
Other values (108) 252
67.7%

Most occurring scripts

ValueCountFrequency (%)
Hangul 372
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
20
 
5.4%
16
 
4.3%
13
 
3.5%
12
 
3.2%
12
 
3.2%
10
 
2.7%
10
 
2.7%
10
 
2.7%
9
 
2.4%
8
 
2.2%
Other values (108) 252
67.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 372
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
20
 
5.4%
16
 
4.3%
13
 
3.5%
12
 
3.2%
12
 
3.2%
10
 
2.7%
10
 
2.7%
10
 
2.7%
9
 
2.4%
8
 
2.2%
Other values (108) 252
67.7%

주소
Text

Distinct115
Distinct (%)92.7%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-11T07:38:45.310602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length28
Mean length22.379032
Min length16

Characters and Unicode

Total characters2775
Distinct characters155
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique107 ?
Unique (%)86.3%

Sample

1st row경상남도 창원시 성산구 정동로162번길 36
2nd row경상남도 창원시 의창구 차상로185
3rd row경상남도 창원시 성산구 정동로162번길 18
4th row경상남도 창원시 성산구 정동로162번길 20
5th row경상남도 창원시 의창구 사화로302
ValueCountFrequency (%)
경상남도 124
 
21.3%
창원시 35
 
6.0%
마산회원구 17
 
2.9%
진주시 15
 
2.6%
마산합포구 7
 
1.2%
사천시 7
 
1.2%
남해군 7
 
1.2%
내서읍 7
 
1.2%
창녕군 6
 
1.0%
밀양시 6
 
1.0%
Other values (254) 351
60.3%
2023-12-11T07:38:45.726078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
458
 
16.5%
154
 
5.5%
131
 
4.7%
125
 
4.5%
124
 
4.5%
1 110
 
4.0%
92
 
3.3%
86
 
3.1%
2 61
 
2.2%
58
 
2.1%
Other values (145) 1376
49.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1797
64.8%
Space Separator 458
 
16.5%
Decimal Number 437
 
15.7%
Dash Punctuation 33
 
1.2%
Close Punctuation 25
 
0.9%
Open Punctuation 25
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
154
 
8.6%
131
 
7.3%
125
 
7.0%
124
 
6.9%
92
 
5.1%
86
 
4.8%
58
 
3.2%
57
 
3.2%
53
 
2.9%
51
 
2.8%
Other values (131) 866
48.2%
Decimal Number
ValueCountFrequency (%)
1 110
25.2%
2 61
14.0%
3 50
11.4%
4 41
 
9.4%
6 39
 
8.9%
5 36
 
8.2%
8 27
 
6.2%
9 26
 
5.9%
7 25
 
5.7%
0 22
 
5.0%
Space Separator
ValueCountFrequency (%)
458
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 33
100.0%
Close Punctuation
ValueCountFrequency (%)
) 25
100.0%
Open Punctuation
ValueCountFrequency (%)
( 25
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1797
64.8%
Common 978
35.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
154
 
8.6%
131
 
7.3%
125
 
7.0%
124
 
6.9%
92
 
5.1%
86
 
4.8%
58
 
3.2%
57
 
3.2%
53
 
2.9%
51
 
2.8%
Other values (131) 866
48.2%
Common
ValueCountFrequency (%)
458
46.8%
1 110
 
11.2%
2 61
 
6.2%
3 50
 
5.1%
4 41
 
4.2%
6 39
 
4.0%
5 36
 
3.7%
- 33
 
3.4%
8 27
 
2.8%
9 26
 
2.7%
Other values (4) 97
 
9.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1797
64.8%
ASCII 978
35.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
458
46.8%
1 110
 
11.2%
2 61
 
6.2%
3 50
 
5.1%
4 41
 
4.2%
6 39
 
4.0%
5 36
 
3.7%
- 33
 
3.4%
8 27
 
2.8%
9 26
 
2.7%
Other values (4) 97
 
9.9%
Hangul
ValueCountFrequency (%)
154
 
8.6%
131
 
7.3%
125
 
7.0%
124
 
6.9%
92
 
5.1%
86
 
4.8%
58
 
3.2%
57
 
3.2%
53
 
2.9%
51
 
2.8%
Other values (131) 866
48.2%
Distinct122
Distinct (%)98.4%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-11T07:38:45.975020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.008065
Min length11

Characters and Unicode

Total characters1489
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique120 ?
Unique (%)96.8%

Sample

1st row055-282-3532
2nd row055-298-0754
3rd row055-262-0764
4th row055-281-2111
5th row055-296-0646
ValueCountFrequency (%)
070-4226-4730 2
 
1.6%
055-272-0051 2
 
1.6%
055-674-3938 1
 
0.8%
055-382-0531 1
 
0.8%
055-282-3532 1
 
0.8%
055-383-5000 1
 
0.8%
055-533-2103 1
 
0.8%
055-526-2101 1
 
0.8%
055-585-0501 1
 
0.8%
055-582-6000 1
 
0.8%
Other values (112) 112
90.3%
2023-12-11T07:38:46.375594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 331
22.2%
- 248
16.7%
0 210
14.1%
3 125
 
8.4%
2 119
 
8.0%
6 89
 
6.0%
1 86
 
5.8%
7 82
 
5.5%
8 77
 
5.2%
4 76
 
5.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1241
83.3%
Dash Punctuation 248
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 331
26.7%
0 210
16.9%
3 125
 
10.1%
2 119
 
9.6%
6 89
 
7.2%
1 86
 
6.9%
7 82
 
6.6%
8 77
 
6.2%
4 76
 
6.1%
9 46
 
3.7%
Dash Punctuation
ValueCountFrequency (%)
- 248
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1489
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 331
22.2%
- 248
16.7%
0 210
14.1%
3 125
 
8.4%
2 119
 
8.0%
6 89
 
6.0%
1 86
 
5.8%
7 82
 
5.5%
8 77
 
5.2%
4 76
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1489
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 331
22.2%
- 248
16.7%
0 210
14.1%
3 125
 
8.4%
2 119
 
8.0%
6 89
 
6.0%
1 86
 
5.8%
7 82
 
5.5%
8 77
 
5.2%
4 76
 
5.1%

Interactions

2023-12-11T07:38:43.155128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T07:38:46.467354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시군명
연번1.0000.965
시군명0.9651.000
2023-12-11T07:38:46.541287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시군명
연번1.0000.802
시군명0.8021.000

Missing values

2023-12-11T07:38:43.292071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T07:38:43.383559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시군명업체명대표자주소전화번호
01창원시(주)창원택시서준수경상남도 창원시 성산구 정동로162번길 36055-282-3532
12창원시(유)신일교통박구영경상남도 창원시 의창구 차상로185055-298-0754
23창원시(주)금호운수김기동경상남도 창원시 성산구 정동로162번길 18055-262-0764
34창원시(주)광덕운수조기춘경상남도 창원시 성산구 정동로162번길 20055-281-2111
45창원시(주)동성택시허종길경상남도 창원시 의창구 사화로302055-296-0646
56창원시(유)동호택시정호식경상남도 창원시 의창구 북면 천주로929055-298-0500
67창원시(유)일광택시변홍섭경상남도 창원시 의창구 북면 천주로862번길 82055-251-0005
78창원시(유)경남교통류재원경상남도 창원시 마산회원구 회성북1길 80055-231-7766
89창원시(유)광동택시김현동경상남도 창원시 마산회원구 봉양로187055-242-9103
910창원시(주)남성택시이준호경상남도 창원시 마산회원구 내서읍 호계본동1길 115055-232-3222
연번시군명업체명대표자주소전화번호
114115함양군(자)세일택시임봉택경상남도 함양군 함양읍 용평3길 9055-963-4000
115116함양군(합자)지리산택시박충성경상남도 함양군 함양읍 함양로 1243-1055-963-3456
116117거창군(합)신택시이재청경상남도 거창군 가조면 지산로 1496번지055-942-1231
117118거창군(합)신창택시남기현경상남도 거창군 거창읍 거열로 133번지055-943-9992
118119거창군(합)거창택시정옥식경상남도 거창군 거창읍 강남로 52-1번지055-944-7077
119120거창군(합)삼양택시전병두경상남도 거창군 거창읍 거함대로5길 51055-944-3838
120121거창군(주)거창80번택시박현보경상남도 거창군 거창읍 중앙로1길 124번지055-944-2080
121122합천군(유)삼일교통백종범경상남도 합천군 합천읍 중앙로 81-1055-931-3131
122123합천군㈜합천택시전충의경상남도 합천군 합천읍 대야로 901055-931-2757
123124합천군동성택시임영구경상남도 합천군 삼가면 금리4길 35055-933-6663