Overview

Dataset statistics

Number of variables5
Number of observations71
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.0 KiB
Average record size in memory43.8 B

Variable types

Numeric2
Text1
Categorical1
DateTime1

Dataset

Description전북특별자치도 안전진단전문기관 등록 현황(1998~2020)등록번호, 업체명, 등록일자, 등록분야(건축, 교량/터널, 수리 등)
Author전북특별자치도
URLhttps://www.data.go.kr/data/15119303/fileData.do

Alerts

연번 is highly overall correlated with 등록번호High correlation
등록번호 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
등록번호 has unique valuesUnique
업 체 명 has unique valuesUnique
등록일자 has unique valuesUnique

Reproduction

Analysis started2024-03-15 00:18:16.674684
Analysis finished2024-03-15 00:18:18.163807
Duration1.49 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct71
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean36
Minimum1
Maximum71
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size767.0 B
2024-03-15T09:18:18.363620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.5
Q118.5
median36
Q353.5
95-th percentile67.5
Maximum71
Range70
Interquartile range (IQR)35

Descriptive statistics

Standard deviation20.639767
Coefficient of variation (CV)0.57332687
Kurtosis-1.2
Mean36
Median Absolute Deviation (MAD)18
Skewness0
Sum2556
Variance426
MonotonicityStrictly increasing
2024-03-15T09:18:18.699236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.4%
2 1
 
1.4%
53 1
 
1.4%
52 1
 
1.4%
51 1
 
1.4%
50 1
 
1.4%
49 1
 
1.4%
48 1
 
1.4%
47 1
 
1.4%
46 1
 
1.4%
Other values (61) 61
85.9%
ValueCountFrequency (%)
1 1
1.4%
2 1
1.4%
3 1
1.4%
4 1
1.4%
5 1
1.4%
6 1
1.4%
7 1
1.4%
8 1
1.4%
9 1
1.4%
10 1
1.4%
ValueCountFrequency (%)
71 1
1.4%
70 1
1.4%
69 1
1.4%
68 1
1.4%
67 1
1.4%
66 1
1.4%
65 1
1.4%
64 1
1.4%
63 1
1.4%
62 1
1.4%

등록번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct71
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean57.380282
Minimum1
Maximum242
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size767.0 B
2024-03-15T09:18:19.022093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile7.5
Q132.5
median56
Q377.5
95-th percentile92.5
Maximum242
Range241
Interquartile range (IQR)45

Descriptive statistics

Standard deviation38.259029
Coefficient of variation (CV)0.66676266
Kurtosis8.1940849
Mean57.380282
Median Absolute Deviation (MAD)23
Skewness2.0178391
Sum4074
Variance1463.7533
MonotonicityNot monotonic
2024-03-15T09:18:19.285498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
191 1
 
1.4%
242 1
 
1.4%
75 1
 
1.4%
74 1
 
1.4%
73 1
 
1.4%
72 1
 
1.4%
69 1
 
1.4%
68 1
 
1.4%
67 1
 
1.4%
66 1
 
1.4%
Other values (61) 61
85.9%
ValueCountFrequency (%)
1 1
1.4%
4 1
1.4%
6 1
1.4%
7 1
1.4%
8 1
1.4%
11 1
1.4%
12 1
1.4%
14 1
1.4%
15 1
1.4%
16 1
1.4%
ValueCountFrequency (%)
242 1
1.4%
191 1
1.4%
94 1
1.4%
93 1
1.4%
92 1
1.4%
91 1
1.4%
90 1
1.4%
89 1
1.4%
88 1
1.4%
87 1
1.4%

업 체 명
Text

UNIQUE 

Distinct71
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size696.0 B
2024-03-15T09:18:20.160160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length21
Mean length10.098592
Min length3

Characters and Unicode

Total characters717
Distinct characters120
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique71 ?
Unique (%)100.0%

Sample

1st row㈜건설방재기술연구원
2nd row기술사건축사사무소 미문건설기술연구원㈜
3rd row㈜건설품질시험원
4th row(유)센이엔지건축안전진단[구: (유)센이엔지건축사사무소]
5th row㈜대한건설연구원
ValueCountFrequency (%)
주식회사 13
 
13.8%
유한회사 3
 
3.2%
㈜한아 2
 
2.1%
누리종합건축사사무소 1
 
1.1%
㈜남지건설이앤씨 1
 
1.1%
서현이앤씨 1
 
1.1%
㈜영광이엔씨 1
 
1.1%
㈜에스이 1
 
1.1%
㈜성진건설기술단 1
 
1.1%
㈜대승엔지니어링(구:(유)대승엔지니어링 1
 
1.1%
Other values (69) 69
73.4%
2024-03-15T09:18:21.369775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
39
 
5.4%
39
 
5.4%
32
 
4.5%
24
 
3.3%
) 23
 
3.2%
23
 
3.2%
( 23
 
3.2%
22
 
3.1%
22
 
3.1%
22
 
3.1%
Other values (110) 448
62.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 585
81.6%
Other Symbol 39
 
5.4%
Close Punctuation 29
 
4.0%
Open Punctuation 29
 
4.0%
Space Separator 23
 
3.2%
Other Punctuation 8
 
1.1%
Decimal Number 4
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
39
 
6.7%
32
 
5.5%
24
 
4.1%
22
 
3.8%
22
 
3.8%
22
 
3.8%
22
 
3.8%
21
 
3.6%
18
 
3.1%
18
 
3.1%
Other values (101) 345
59.0%
Close Punctuation
ValueCountFrequency (%)
) 23
79.3%
] 6
 
20.7%
Open Punctuation
ValueCountFrequency (%)
( 23
79.3%
[ 6
 
20.7%
Decimal Number
ValueCountFrequency (%)
0 2
50.0%
1 2
50.0%
Other Symbol
ValueCountFrequency (%)
39
100.0%
Space Separator
ValueCountFrequency (%)
23
100.0%
Other Punctuation
ValueCountFrequency (%)
: 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 624
87.0%
Common 93
 
13.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
39
 
6.2%
39
 
6.2%
32
 
5.1%
24
 
3.8%
22
 
3.5%
22
 
3.5%
22
 
3.5%
22
 
3.5%
21
 
3.4%
18
 
2.9%
Other values (102) 363
58.2%
Common
ValueCountFrequency (%)
) 23
24.7%
23
24.7%
( 23
24.7%
: 8
 
8.6%
] 6
 
6.5%
[ 6
 
6.5%
0 2
 
2.2%
1 2
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 585
81.6%
ASCII 93
 
13.0%
None 39
 
5.4%

Most frequent character per block

None
ValueCountFrequency (%)
39
100.0%
Hangul
ValueCountFrequency (%)
39
 
6.7%
32
 
5.5%
24
 
4.1%
22
 
3.8%
22
 
3.8%
22
 
3.8%
22
 
3.8%
21
 
3.6%
18
 
3.1%
18
 
3.1%
Other values (101) 345
59.0%
ASCII
ValueCountFrequency (%)
) 23
24.7%
23
24.7%
( 23
24.7%
: 8
 
8.6%
] 6
 
6.5%
[ 6
 
6.5%
0 2
 
2.2%
1 2
 
2.2%

등록분야
Categorical

Distinct9
Distinct (%)12.7%
Missing0
Missing (%)0.0%
Memory size696.0 B
교량/터널,수리
21 
교량/터널
20 
건축
17 
교량/터널,수리,건축
교량/터널,건축
Other values (4)

Length

Max length12
Median length11
Mean length6.0140845
Min length2

Unique

Unique3 ?
Unique (%)4.2%

Sample

1st row교량/터널,수리,건축
2nd row건축
3rd row교량/터널,수리
4th row건축
5th row교량/터널,수리

Common Values

ValueCountFrequency (%)
교량/터널,수리 21
29.6%
교량/터널 20
28.2%
건축 17
23.9%
교량/터널,수리,건축 4
 
5.6%
교량/터널,건축 4
 
5.6%
교량/터널, 수리 2
 
2.8%
교량/터널,항만 1
 
1.4%
교량/터널,수리,항만 1
 
1.4%
교량/터널,수리,건축 1
 
1.4%

Length

2024-03-15T09:18:21.859540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T09:18:22.282675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
교량/터널 22
30.1%
교량/터널,수리 21
28.8%
건축 17
23.3%
교량/터널,수리,건축 5
 
6.8%
교량/터널,건축 4
 
5.5%
수리 2
 
2.7%
교량/터널,항만 1
 
1.4%
교량/터널,수리,항만 1
 
1.4%

등록일자
Date

UNIQUE 

Distinct71
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size696.0 B
Minimum1998-11-09 00:00:00
Maximum2020-01-23 00:00:00
2024-03-15T09:18:22.736079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T09:18:23.227176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2024-03-15T09:18:17.414512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T09:18:16.988119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T09:18:17.629181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T09:18:17.167123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T09:18:23.621818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번등록번호업 체 명등록분야등록일자
연번1.0000.8691.0000.4551.000
등록번호0.8691.0001.0000.5691.000
업 체 명1.0001.0001.0001.0001.000
등록분야0.4550.5691.0001.0001.000
등록일자1.0001.0001.0001.0001.000
2024-03-15T09:18:23.911659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번등록번호등록분야
연번1.0000.8360.206
등록번호0.8361.0000.314
등록분야0.2060.3141.000

Missing values

2024-03-15T09:18:17.908537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T09:18:18.098376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번등록번호업 체 명등록분야등록일자
01191㈜건설방재기술연구원교량/터널,수리,건축1998-11-09
12242기술사건축사사무소 미문건설기술연구원㈜건축2000-04-08
231㈜건설품질시험원교량/터널,수리2003-04-03
344(유)센이엔지건축안전진단[구: (유)센이엔지건축사사무소]건축2005-02-23
456㈜대한건설연구원교량/터널,수리2005-06-22
567㈜대들보구조안전기술단교량/터널,건축2006-08-09
678㈜한국건설기술공사교량/터널,수리2007-03-09
7811㈜세종건설기술교량/터널,수리2009-05-11
8912(유)쎈구조엔지니어링건축2009-05-28
91014제이씨엔㈜교량/터널2010-11-11
연번등록번호업 체 명등록분야등록일자
6162851010건축사사무소건축2019-03-11
626386(유)영화이엔지교량/터널2019-05-07
636487주식회사 신화기술건축2019-06-10
646588유한회사 금강기술건축2019-06-17
656689유한회사 라온건설기술사사무소건축2019-07-18
666790주식회사 온길교량/터널2019-07-29
676891선인건축사사무소건축2019-08-26
686992태안특수건설㈜교량/터널2019-11-28
697093(유)장원종합건축사사무소건축2019-12-03
707194(유)큰길이엔지교량/터널2020-01-23