Amino acid dipepetide frequency for Gordonia sp. NB4-1Y

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.645AlaAla: 18.645 ± 0.16
0.943AlaCys: 0.943 ± 0.029
9.778AlaAsp: 9.778 ± 0.096
7.567AlaGlu: 7.567 ± 0.097
3.468AlaPhe: 3.468 ± 0.056
11.927AlaGly: 11.927 ± 0.104
2.743AlaHis: 2.743 ± 0.048
5.663AlaIle: 5.663 ± 0.071
2.633AlaLys: 2.633 ± 0.055
12.049AlaLeu: 12.049 ± 0.119
2.676AlaMet: 2.676 ± 0.044
2.289AlaAsn: 2.289 ± 0.042
6.025AlaPro: 6.025 ± 0.091
3.784AlaGln: 3.784 ± 0.059
8.626AlaArg: 8.626 ± 0.094
5.919AlaSer: 5.919 ± 0.074
7.853AlaThr: 7.853 ± 0.083
11.31AlaVal: 11.31 ± 0.124
1.575AlaTrp: 1.575 ± 0.038
2.238AlaTyr: 2.238 ± 0.045
0.0AlaXaa: 0.0 ± 0.0
Cys
0.972CysAla: 0.972 ± 0.032
0.094CysCys: 0.094 ± 0.011
0.481CysAsp: 0.481 ± 0.02
0.348CysGlu: 0.348 ± 0.018
0.195CysPhe: 0.195 ± 0.012
0.914CysGly: 0.914 ± 0.026
0.182CysHis: 0.182 ± 0.012
0.258CysIle: 0.258 ± 0.014
0.115CysLys: 0.115 ± 0.009
0.631CysLeu: 0.631 ± 0.02
0.12CysMet: 0.12 ± 0.009
0.158CysAsn: 0.158 ± 0.011
0.427CysPro: 0.427 ± 0.02
0.153CysGln: 0.153 ± 0.01
0.589CysArg: 0.589 ± 0.023
0.477CysSer: 0.477 ± 0.02
0.5CysThr: 0.5 ± 0.019
0.637CysVal: 0.637 ± 0.023
0.124CysTrp: 0.124 ± 0.012
0.151CysTyr: 0.151 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
8.585AspAla: 8.585 ± 0.1
0.398AspCys: 0.398 ± 0.016
5.336AspAsp: 5.336 ± 0.081
4.314AspGlu: 4.314 ± 0.068
1.804AspPhe: 1.804 ± 0.04
6.302AspGly: 6.302 ± 0.071
1.763AspHis: 1.763 ± 0.04
2.725AspIle: 2.725 ± 0.048
1.268AspLys: 1.268 ± 0.037
6.758AspLeu: 6.758 ± 0.07
1.039AspMet: 1.039 ± 0.029
1.216AspAsn: 1.216 ± 0.037
4.8AspPro: 4.8 ± 0.068
1.722AspGln: 1.722 ± 0.038
5.028AspArg: 5.028 ± 0.06
2.913AspSer: 2.913 ± 0.046
3.665AspThr: 3.665 ± 0.053
5.534AspVal: 5.534 ± 0.074
0.956AspTrp: 0.956 ± 0.024
1.29AspTyr: 1.29 ± 0.034
0.0AspXaa: 0.0 ± 0.0
Glu
5.545GluAla: 5.545 ± 0.076
0.358GluCys: 0.358 ± 0.016
2.207GluAsp: 2.207 ± 0.043
2.348GluGlu: 2.348 ± 0.046
1.859GluPhe: 1.859 ± 0.036
3.318GluGly: 3.318 ± 0.064
1.502GluHis: 1.502 ± 0.035
3.112GluIle: 3.112 ± 0.051
1.408GluLys: 1.408 ± 0.041
6.209GluLeu: 6.209 ± 0.071
1.15GluMet: 1.15 ± 0.029
1.184GluAsn: 1.184 ± 0.032
2.819GluPro: 2.819 ± 0.057
2.145GluGln: 2.145 ± 0.047
4.341GluArg: 4.341 ± 0.071
2.981GluSer: 2.981 ± 0.046
2.857GluThr: 2.857 ± 0.048
4.65GluVal: 4.65 ± 0.065
0.8GluTrp: 0.8 ± 0.024
1.207GluTyr: 1.207 ± 0.03
0.0GluXaa: 0.0 ± 0.0
Phe
4.095PheAla: 4.095 ± 0.056
0.299PheCys: 0.299 ± 0.016
2.509PheAsp: 2.509 ± 0.048
1.458PheGlu: 1.458 ± 0.033
0.988PhePhe: 0.988 ± 0.032
3.559PheGly: 3.559 ± 0.059
0.615PheHis: 0.615 ± 0.022
1.103PheIle: 1.103 ± 0.033
0.441PheLys: 0.441 ± 0.019
2.574PheLeu: 2.574 ± 0.048
0.476PheMet: 0.476 ± 0.018
0.629PheAsn: 0.629 ± 0.025
1.354PhePro: 1.354 ± 0.033
0.578PheGln: 0.578 ± 0.02
1.721PheArg: 1.721 ± 0.037
1.575PheSer: 1.575 ± 0.038
2.194PheThr: 2.194 ± 0.04
2.625PheVal: 2.625 ± 0.045
0.426PheTrp: 0.426 ± 0.019
0.686PheTyr: 0.686 ± 0.026
0.0PheXaa: 0.0 ± 0.0
Gly
10.182GlyAla: 10.182 ± 0.096
0.74GlyCys: 0.74 ± 0.025
5.319GlyAsp: 5.319 ± 0.062
4.545GlyGlu: 4.545 ± 0.07
3.182GlyPhe: 3.182 ± 0.044
7.816GlyGly: 7.816 ± 0.104
2.211GlyHis: 2.211 ± 0.042
4.511GlyIle: 4.511 ± 0.068
2.296GlyLys: 2.296 ± 0.048
8.628GlyLeu: 8.628 ± 0.071
2.114GlyMet: 2.114 ± 0.039
1.86GlyAsn: 1.86 ± 0.041
4.526GlyPro: 4.526 ± 0.056
2.647GlyGln: 2.647 ± 0.047
6.585GlyArg: 6.585 ± 0.083
5.482GlySer: 5.482 ± 0.067
5.895GlyThr: 5.895 ± 0.07
8.022GlyVal: 8.022 ± 0.086
1.544GlyTrp: 1.544 ± 0.035
2.289GlyTyr: 2.289 ± 0.043
0.0GlyXaa: 0.0 ± 0.0
His
2.484HisAla: 2.484 ± 0.043
0.194HisCys: 0.194 ± 0.012
1.46HisAsp: 1.46 ± 0.039
1.086HisGlu: 1.086 ± 0.028
0.639HisPhe: 0.639 ± 0.021
2.139HisGly: 2.139 ± 0.04
0.807HisHis: 0.807 ± 0.028
0.925HisIle: 0.925 ± 0.027
0.387HisLys: 0.387 ± 0.016
2.227HisLeu: 2.227 ± 0.035
0.407HisMet: 0.407 ± 0.018
0.45HisAsn: 0.45 ± 0.019
1.697HisPro: 1.697 ± 0.044
0.638HisGln: 0.638 ± 0.021
2.019HisArg: 2.019 ± 0.047
1.105HisSer: 1.105 ± 0.032
1.33HisThr: 1.33 ± 0.038
1.763HisVal: 1.763 ± 0.04
0.351HisTrp: 0.351 ± 0.016
0.498HisTyr: 0.498 ± 0.022
0.0HisXaa: 0.0 ± 0.0
Ile
6.907IleAla: 6.907 ± 0.076
0.4IleCys: 0.4 ± 0.017
3.668IleAsp: 3.668 ± 0.059
2.63IleGlu: 2.63 ± 0.05
1.077IlePhe: 1.077 ± 0.026
5.133IleGly: 5.133 ± 0.065
0.876IleHis: 0.876 ± 0.028
1.689IleIle: 1.689 ± 0.039
0.825IleLys: 0.825 ± 0.028
3.597IleLeu: 3.597 ± 0.059
0.626IleMet: 0.626 ± 0.022
1.063IleAsn: 1.063 ± 0.029
2.603IlePro: 2.603 ± 0.05
0.854IleGln: 0.854 ± 0.028
2.933IleArg: 2.933 ± 0.044
2.504IleSer: 2.504 ± 0.042
3.233IleThr: 3.233 ± 0.05
4.358IleVal: 4.358 ± 0.063
0.503IleTrp: 0.503 ± 0.022
0.849IleTyr: 0.849 ± 0.029
0.0IleXaa: 0.0 ± 0.0
Lys
2.557LysAla: 2.557 ± 0.057
0.106LysCys: 0.106 ± 0.009
1.065LysAsp: 1.065 ± 0.029
0.904LysGlu: 0.904 ± 0.032
0.562LysPhe: 0.562 ± 0.022
1.573LysGly: 1.573 ± 0.039
0.436LysHis: 0.436 ± 0.016
1.073LysIle: 1.073 ± 0.03
0.73LysLys: 0.73 ± 0.032
1.875LysLeu: 1.875 ± 0.044
0.447LysMet: 0.447 ± 0.02
0.528LysAsn: 0.528 ± 0.02
1.196LysPro: 1.196 ± 0.036
0.695LysGln: 0.695 ± 0.024
1.505LysArg: 1.505 ± 0.033
1.213LysSer: 1.213 ± 0.031
1.374LysThr: 1.374 ± 0.036
1.966LysVal: 1.966 ± 0.039
0.281LysTrp: 0.281 ± 0.015
0.468LysTyr: 0.468 ± 0.021
0.0LysXaa: 0.0 ± 0.0
Leu
13.154LeuAla: 13.154 ± 0.098
0.756LeuCys: 0.756 ± 0.026
6.922LeuAsp: 6.922 ± 0.077
4.238LeuGlu: 4.238 ± 0.06
2.692LeuPhe: 2.692 ± 0.042
8.801LeuGly: 8.801 ± 0.082
1.953LeuHis: 1.953 ± 0.038
4.427LeuIle: 4.427 ± 0.073
1.628LeuLys: 1.628 ± 0.039
8.826LeuLeu: 8.826 ± 0.11
1.658LeuMet: 1.658 ± 0.037
1.788LeuAsn: 1.788 ± 0.042
5.327LeuPro: 5.327 ± 0.062
2.217LeuGln: 2.217 ± 0.04
7.123LeuArg: 7.123 ± 0.084
5.374LeuSer: 5.374 ± 0.062
6.601LeuThr: 6.601 ± 0.073
8.342LeuVal: 8.342 ± 0.094
1.129LeuTrp: 1.129 ± 0.027
1.596LeuTyr: 1.596 ± 0.034
0.0LeuXaa: 0.0 ± 0.0
Met
2.435MetAla: 2.435 ± 0.044
0.17MetCys: 0.17 ± 0.01
0.906MetAsp: 0.906 ± 0.028
0.689MetGlu: 0.689 ± 0.021
0.599MetPhe: 0.599 ± 0.022
1.425MetGly: 1.425 ± 0.033
0.409MetHis: 0.409 ± 0.02
1.018MetIle: 1.018 ± 0.024
0.413MetLys: 0.413 ± 0.017
1.875MetLeu: 1.875 ± 0.035
0.44MetMet: 0.44 ± 0.019
0.496MetAsn: 0.496 ± 0.019
1.166MetPro: 1.166 ± 0.033
0.53MetGln: 0.53 ± 0.021
1.524MetArg: 1.524 ± 0.03
1.798MetSer: 1.798 ± 0.035
1.925MetThr: 1.925 ± 0.035
1.721MetVal: 1.721 ± 0.04
0.268MetTrp: 0.268 ± 0.014
0.328MetTyr: 0.328 ± 0.015
0.0MetXaa: 0.0 ± 0.0
Asn
2.435AsnAla: 2.435 ± 0.054
0.161AsnCys: 0.161 ± 0.011
1.103AsnAsp: 1.103 ± 0.03
0.889AsnGlu: 0.889 ± 0.028
0.601AsnPhe: 0.601 ± 0.025
2.039AsnGly: 2.039 ± 0.041
0.469AsnHis: 0.469 ± 0.021
0.946AsnIle: 0.946 ± 0.029
0.484AsnLys: 0.484 ± 0.022
1.951AsnLeu: 1.951 ± 0.039
0.375AsnMet: 0.375 ± 0.019
0.507AsnAsn: 0.507 ± 0.022
1.695AsnPro: 1.695 ± 0.036
0.599AsnGln: 0.599 ± 0.022
1.499AsnArg: 1.499 ± 0.038
1.132AsnSer: 1.132 ± 0.03
1.283AsnThr: 1.283 ± 0.037
1.608AsnVal: 1.608 ± 0.036
0.34AsnTrp: 0.34 ± 0.015
0.485AsnTyr: 0.485 ± 0.02
0.0AsnXaa: 0.0 ± 0.0
Pro
7.303ProAla: 7.303 ± 0.091
0.262ProCys: 0.262 ± 0.015
4.694ProAsp: 4.694 ± 0.066
3.6ProGlu: 3.6 ± 0.063
1.563ProPhe: 1.563 ± 0.037
5.487ProGly: 5.487 ± 0.084
1.199ProHis: 1.199 ± 0.031
2.357ProIle: 2.357 ± 0.036
1.135ProLys: 1.135 ± 0.03
4.491ProLeu: 4.491 ± 0.063
1.16ProMet: 1.16 ± 0.032
1.205ProAsn: 1.205 ± 0.035
2.822ProPro: 2.822 ± 0.06
1.615ProGln: 1.615 ± 0.036
3.456ProArg: 3.456 ± 0.051
2.97ProSer: 2.97 ± 0.049
3.807ProThr: 3.807 ± 0.064
5.042ProVal: 5.042 ± 0.075
0.812ProTrp: 0.812 ± 0.024
1.047ProTyr: 1.047 ± 0.026
0.0ProXaa: 0.0 ± 0.0
Gln
3.335GlnAla: 3.335 ± 0.058
0.182GlnCys: 0.182 ± 0.012
1.039GlnAsp: 1.039 ± 0.03
1.068GlnGlu: 1.068 ± 0.033
0.878GlnPhe: 0.878 ± 0.027
1.922GlnGly: 1.922 ± 0.034
0.604GlnHis: 0.604 ± 0.02
1.677GlnIle: 1.677 ± 0.038
0.632GlnLys: 0.632 ± 0.025
2.86GlnLeu: 2.86 ± 0.052
0.662GlnMet: 0.662 ± 0.023
0.637GlnAsn: 0.637 ± 0.025
1.56GlnPro: 1.56 ± 0.04
1.123GlnGln: 1.123 ± 0.034
2.246GlnArg: 2.246 ± 0.039
1.356GlnSer: 1.356 ± 0.034
1.703GlnThr: 1.703 ± 0.033
2.616GlnVal: 2.616 ± 0.048
0.513GlnTrp: 0.513 ± 0.021
0.588GlnTyr: 0.588 ± 0.022
0.0GlnXaa: 0.0 ± 0.0
Arg
8.04ArgAla: 8.04 ± 0.091
0.538ArgCys: 0.538 ± 0.02
4.3ArgAsp: 4.3 ± 0.065
4.019ArgGlu: 4.019 ± 0.049
2.406ArgPhe: 2.406 ± 0.045
5.046ArgGly: 5.046 ± 0.062
1.831ArgHis: 1.831 ± 0.038
3.696ArgIle: 3.696 ± 0.06
1.627ArgLys: 1.627 ± 0.04
7.076ArgLeu: 7.076 ± 0.077
1.884ArgMet: 1.884 ± 0.038
1.618ArgAsn: 1.618 ± 0.038
4.066ArgPro: 4.066 ± 0.057
2.004ArgGln: 2.004 ± 0.041
6.831ArgArg: 6.831 ± 0.094
4.334ArgSer: 4.334 ± 0.056
4.633ArgThr: 4.633 ± 0.065
5.632ArgVal: 5.632 ± 0.07
1.311ArgTrp: 1.311 ± 0.035
1.756ArgTyr: 1.756 ± 0.036
0.0ArgXaa: 0.0 ± 0.0
Ser
7.155SerAla: 7.155 ± 0.081
0.392SerCys: 0.392 ± 0.018
3.35SerAsp: 3.35 ± 0.048
2.571SerGlu: 2.571 ± 0.045
1.61SerPhe: 1.61 ± 0.037
5.969SerGly: 5.969 ± 0.07
1.034SerHis: 1.034 ± 0.028
2.326SerIle: 2.326 ± 0.041
1.098SerLys: 1.098 ± 0.033
4.779SerLeu: 4.779 ± 0.061
1.374SerMet: 1.374 ± 0.031
1.039SerAsn: 1.039 ± 0.029
3.092SerPro: 3.092 ± 0.046
1.341SerGln: 1.341 ± 0.035
3.732SerArg: 3.732 ± 0.059
3.46SerSer: 3.46 ± 0.06
3.91SerThr: 3.91 ± 0.059
4.931SerVal: 4.931 ± 0.063
0.891SerTrp: 0.891 ± 0.024
1.093SerTyr: 1.093 ± 0.03
0.0SerXaa: 0.0 ± 0.0
Thr
8.396ThrAla: 8.396 ± 0.077
0.432ThrCys: 0.432 ± 0.017
4.675ThrAsp: 4.675 ± 0.074
3.582ThrGlu: 3.582 ± 0.051
1.857ThrPhe: 1.857 ± 0.035
6.499ThrGly: 6.499 ± 0.078
1.356ThrHis: 1.356 ± 0.033
2.875ThrIle: 2.875 ± 0.044
1.228ThrLys: 1.228 ± 0.034
5.664ThrLeu: 5.664 ± 0.065
1.18ThrMet: 1.18 ± 0.031
1.224ThrAsn: 1.224 ± 0.032
4.287ThrPro: 4.287 ± 0.068
1.48ThrGln: 1.48 ± 0.039
4.078ThrArg: 4.078 ± 0.063
3.64ThrSer: 3.64 ± 0.057
4.638ThrThr: 4.638 ± 0.074
6.224ThrVal: 6.224 ± 0.063
0.902ThrTrp: 0.902 ± 0.027
1.371ThrTyr: 1.371 ± 0.032
0.0ThrXaa: 0.0 ± 0.0
Val
11.614ValAla: 11.614 ± 0.131
0.75ValCys: 0.75 ± 0.024
6.52ValAsp: 6.52 ± 0.066
4.538ValGlu: 4.538 ± 0.058
2.647ValPhe: 2.647 ± 0.048
7.523ValGly: 7.523 ± 0.086
1.849ValHis: 1.849 ± 0.037
4.532ValIle: 4.532 ± 0.057
1.521ValLys: 1.521 ± 0.042
8.715ValLeu: 8.715 ± 0.095
1.685ValMet: 1.685 ± 0.037
1.855ValAsn: 1.855 ± 0.037
4.713ValPro: 4.713 ± 0.067
1.884ValGln: 1.884 ± 0.043
5.834ValArg: 5.834 ± 0.062
4.841ValSer: 4.841 ± 0.065
6.033ValThr: 6.033 ± 0.072
9.046ValVal: 9.046 ± 0.105
1.091ValTrp: 1.091 ± 0.029
1.585ValTyr: 1.585 ± 0.032
0.0ValXaa: 0.0 ± 0.0
Trp
1.558TrpAla: 1.558 ± 0.037
0.149TrpCys: 0.149 ± 0.011
0.783TrpAsp: 0.783 ± 0.025
0.613TrpGlu: 0.613 ± 0.02
0.567TrpPhe: 0.567 ± 0.023
1.056TrpGly: 1.056 ± 0.032
0.324TrpHis: 0.324 ± 0.016
0.722TrpIle: 0.722 ± 0.023
0.318TrpLys: 0.318 ± 0.014
1.548TrpLeu: 1.548 ± 0.038
0.363TrpMet: 0.363 ± 0.017
0.409TrpAsn: 0.409 ± 0.016
0.715TrpPro: 0.715 ± 0.023
0.548TrpGln: 0.548 ± 0.018
1.17TrpArg: 1.17 ± 0.032
0.928TrpSer: 0.928 ± 0.027
0.924TrpThr: 0.924 ± 0.029
1.127TrpVal: 1.127 ± 0.028
0.373TrpTrp: 0.373 ± 0.019
0.342TrpTyr: 0.342 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.314TyrAla: 2.314 ± 0.039
0.2TyrCys: 0.2 ± 0.015
1.3TyrAsp: 1.3 ± 0.033
0.995TyrGlu: 0.995 ± 0.026
0.712TyrPhe: 0.712 ± 0.022
1.944TyrGly: 1.944 ± 0.04
0.445TyrHis: 0.445 ± 0.019
0.696TyrIle: 0.696 ± 0.025
0.367TyrLys: 0.367 ± 0.018
2.219TyrLeu: 2.219 ± 0.045
0.296TyrMet: 0.296 ± 0.014
0.448TyrAsn: 0.448 ± 0.017
1.135TyrPro: 1.135 ± 0.035
0.641TyrGln: 0.641 ± 0.021
1.811TyrArg: 1.811 ± 0.038
1.147TyrSer: 1.147 ± 0.031
1.231TyrThr: 1.231 ± 0.029
1.624TyrVal: 1.624 ± 0.038
0.344TyrTrp: 0.344 ± 0.018
0.479TyrTyr: 0.479 ± 0.021
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4263 proteins (1343207 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski