Amino acid dipepetide frequency for Streptomyces rubrolavendulae

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
25.396AlaAla: 25.396 ± 0.219
1.055AlaCys: 1.055 ± 0.025
8.733AlaAsp: 8.733 ± 0.093
9.28AlaGlu: 9.28 ± 0.115
3.482AlaPhe: 3.482 ± 0.049
15.711AlaGly: 15.711 ± 0.129
3.179AlaHis: 3.179 ± 0.05
2.789AlaIle: 2.789 ± 0.05
2.509AlaLys: 2.509 ± 0.048
15.264AlaLeu: 15.264 ± 0.133
2.433AlaMet: 2.433 ± 0.045
1.677AlaAsn: 1.677 ± 0.034
9.89AlaPro: 9.89 ± 0.152
3.449AlaGln: 3.449 ± 0.052
12.327AlaArg: 12.327 ± 0.106
5.762AlaSer: 5.762 ± 0.063
6.741AlaThr: 6.741 ± 0.07
13.568AlaVal: 13.568 ± 0.114
1.952AlaTrp: 1.952 ± 0.034
3.013AlaTyr: 3.013 ± 0.041
0.0AlaXaa: 0.0 ± 0.0
Cys
1.193CysAla: 1.193 ± 0.031
0.082CysCys: 0.082 ± 0.006
0.452CysAsp: 0.452 ± 0.019
0.365CysGlu: 0.365 ± 0.013
0.202CysPhe: 0.202 ± 0.01
0.994CysGly: 0.994 ± 0.026
0.183CysHis: 0.183 ± 0.011
0.121CysIle: 0.121 ± 0.008
0.093CysLys: 0.093 ± 0.008
0.71CysLeu: 0.71 ± 0.02
0.105CysMet: 0.105 ± 0.008
0.115CysAsn: 0.115 ± 0.008
0.501CysPro: 0.501 ± 0.015
0.138CysGln: 0.138 ± 0.01
0.646CysArg: 0.646 ± 0.018
0.371CysSer: 0.371 ± 0.016
0.443CysThr: 0.443 ± 0.016
0.689CysVal: 0.689 ± 0.022
0.122CysTrp: 0.122 ± 0.008
0.133CysTyr: 0.133 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
8.233AspAla: 8.233 ± 0.078
0.415AspCys: 0.415 ± 0.014
3.359AspAsp: 3.359 ± 0.052
3.842AspGlu: 3.842 ± 0.05
1.454AspPhe: 1.454 ± 0.029
6.936AspGly: 6.936 ± 0.076
1.328AspHis: 1.328 ± 0.03
1.54AspIle: 1.54 ± 0.034
0.956AspLys: 0.956 ± 0.029
6.017AspLeu: 6.017 ± 0.064
0.785AspMet: 0.785 ± 0.023
0.758AspAsn: 0.758 ± 0.023
4.8AspPro: 4.8 ± 0.063
1.304AspGln: 1.304 ± 0.029
5.369AspArg: 5.369 ± 0.065
1.991AspSer: 1.991 ± 0.042
3.041AspThr: 3.041 ± 0.045
4.62AspVal: 4.62 ± 0.049
0.973AspTrp: 0.973 ± 0.026
0.94AspTyr: 0.94 ± 0.028
0.0AspXaa: 0.0 ± 0.0
Glu
8.255GluAla: 8.255 ± 0.111
0.357GluCys: 0.357 ± 0.013
2.862GluAsp: 2.862 ± 0.042
3.731GluGlu: 3.731 ± 0.057
1.26GluPhe: 1.26 ± 0.028
4.809GluGly: 4.809 ± 0.061
1.433GluHis: 1.433 ± 0.028
1.859GluIle: 1.859 ± 0.039
1.334GluLys: 1.334 ± 0.034
6.449GluLeu: 6.449 ± 0.071
0.846GluMet: 0.846 ± 0.021
0.911GluAsn: 0.911 ± 0.025
3.667GluPro: 3.667 ± 0.051
1.872GluGln: 1.872 ± 0.036
6.072GluArg: 6.072 ± 0.069
2.254GluSer: 2.254 ± 0.038
2.743GluThr: 2.743 ± 0.039
4.588GluVal: 4.588 ± 0.067
0.737GluTrp: 0.737 ± 0.021
1.071GluTyr: 1.071 ± 0.024
0.0GluXaa: 0.0 ± 0.0
Phe
3.547PheAla: 3.547 ± 0.045
0.251PheCys: 0.251 ± 0.01
1.766PheAsp: 1.766 ± 0.034
1.255PheGlu: 1.255 ± 0.026
0.786PhePhe: 0.786 ± 0.026
2.769PheGly: 2.769 ± 0.043
0.606PheHis: 0.606 ± 0.018
0.56PheIle: 0.56 ± 0.02
0.415PheLys: 0.415 ± 0.017
2.441PheLeu: 2.441 ± 0.039
0.353PheMet: 0.353 ± 0.014
0.463PheAsn: 0.463 ± 0.016
1.302PhePro: 1.302 ± 0.032
0.6PheGln: 0.6 ± 0.018
1.799PheArg: 1.799 ± 0.033
1.233PheSer: 1.233 ± 0.025
1.87PheThr: 1.87 ± 0.03
2.043PheVal: 2.043 ± 0.035
0.394PheTrp: 0.394 ± 0.016
0.489PheTyr: 0.489 ± 0.016
0.0PheXaa: 0.0 ± 0.0
Gly
14.091GlyAla: 14.091 ± 0.148
0.854GlyCys: 0.854 ± 0.024
5.554GlyAsp: 5.554 ± 0.062
5.482GlyGlu: 5.482 ± 0.061
2.685GlyPhe: 2.685 ± 0.038
11.284GlyGly: 11.284 ± 0.147
2.402GlyHis: 2.402 ± 0.038
2.799GlyIle: 2.799 ± 0.043
2.055GlyLys: 2.055 ± 0.041
9.395GlyLeu: 9.395 ± 0.087
1.973GlyMet: 1.973 ± 0.034
1.51GlyAsn: 1.51 ± 0.034
6.703GlyPro: 6.703 ± 0.099
2.374GlyGln: 2.374 ± 0.045
9.342GlyArg: 9.342 ± 0.088
5.071GlySer: 5.071 ± 0.07
6.932GlyThr: 6.932 ± 0.08
7.909GlyVal: 7.909 ± 0.071
1.68GlyTrp: 1.68 ± 0.034
2.216GlyTyr: 2.216 ± 0.031
0.0GlyXaa: 0.0 ± 0.0
His
2.843HisAla: 2.843 ± 0.045
0.204HisCys: 0.204 ± 0.011
1.301HisAsp: 1.301 ± 0.028
1.142HisGlu: 1.142 ± 0.024
0.582HisPhe: 0.582 ± 0.019
2.586HisGly: 2.586 ± 0.044
0.703HisHis: 0.703 ± 0.022
0.536HisIle: 0.536 ± 0.016
0.3HisLys: 0.3 ± 0.015
2.362HisLeu: 2.362 ± 0.04
0.314HisMet: 0.314 ± 0.013
0.323HisAsn: 0.323 ± 0.014
2.005HisPro: 2.005 ± 0.036
0.597HisGln: 0.597 ± 0.021
2.292HisArg: 2.292 ± 0.046
0.837HisSer: 0.837 ± 0.021
1.342HisThr: 1.342 ± 0.028
1.695HisVal: 1.695 ± 0.032
0.339HisTrp: 0.339 ± 0.013
0.422HisTyr: 0.422 ± 0.014
0.0HisXaa: 0.0 ± 0.0
Ile
3.976IleAla: 3.976 ± 0.057
0.221IleCys: 0.221 ± 0.013
1.759IleAsp: 1.759 ± 0.037
1.625IleGlu: 1.625 ± 0.038
0.52IlePhe: 0.52 ± 0.02
2.954IleGly: 2.954 ± 0.048
0.496IleHis: 0.496 ± 0.017
0.671IleIle: 0.671 ± 0.022
0.583IleLys: 0.583 ± 0.019
1.913IleLeu: 1.913 ± 0.034
0.368IleMet: 0.368 ± 0.016
0.513IleAsn: 0.513 ± 0.019
1.419IlePro: 1.419 ± 0.03
0.549IleGln: 0.549 ± 0.018
1.902IleArg: 1.902 ± 0.038
1.167IleSer: 1.167 ± 0.028
1.782IleThr: 1.782 ± 0.039
2.263IleVal: 2.263 ± 0.036
0.276IleTrp: 0.276 ± 0.013
0.397IleTyr: 0.397 ± 0.014
0.0IleXaa: 0.0 ± 0.0
Lys
2.654LysAla: 2.654 ± 0.049
0.094LysCys: 0.094 ± 0.006
1.134LysAsp: 1.134 ± 0.032
1.014LysGlu: 1.014 ± 0.026
0.354LysPhe: 0.354 ± 0.015
1.617LysGly: 1.617 ± 0.038
0.345LysHis: 0.345 ± 0.015
0.681LysIle: 0.681 ± 0.022
0.669LysLys: 0.669 ± 0.028
1.688LysLeu: 1.688 ± 0.039
0.323LysMet: 0.323 ± 0.015
0.439LysAsn: 0.439 ± 0.02
1.158LysPro: 1.158 ± 0.027
0.516LysGln: 0.516 ± 0.019
1.321LysArg: 1.321 ± 0.033
0.883LysSer: 0.883 ± 0.025
1.098LysThr: 1.098 ± 0.029
1.635LysVal: 1.635 ± 0.038
0.201LysTrp: 0.201 ± 0.011
0.384LysTyr: 0.384 ± 0.016
0.0LysXaa: 0.0 ± 0.0
Leu
15.75LeuAla: 15.75 ± 0.134
0.815LeuCys: 0.815 ± 0.022
6.585LeuAsp: 6.585 ± 0.057
4.744LeuGlu: 4.744 ± 0.053
2.389LeuPhe: 2.389 ± 0.045
9.175LeuGly: 9.175 ± 0.082
2.204LeuHis: 2.204 ± 0.041
2.538LeuIle: 2.538 ± 0.041
1.731LeuLys: 1.731 ± 0.037
11.376LeuLeu: 11.376 ± 0.119
1.527LeuMet: 1.527 ± 0.033
1.383LeuAsn: 1.383 ± 0.03
6.608LeuPro: 6.608 ± 0.071
1.897LeuGln: 1.897 ± 0.032
9.187LeuArg: 9.187 ± 0.082
4.671LeuSer: 4.671 ± 0.056
6.346LeuThr: 6.346 ± 0.069
9.03LeuVal: 9.03 ± 0.082
1.25LeuTrp: 1.25 ± 0.033
1.832LeuTyr: 1.832 ± 0.034
0.0LeuXaa: 0.0 ± 0.0
Met
2.235MetAla: 2.235 ± 0.036
0.134MetCys: 0.134 ± 0.009
0.847MetAsp: 0.847 ± 0.024
0.715MetGlu: 0.715 ± 0.023
0.414MetPhe: 0.414 ± 0.015
1.264MetGly: 1.264 ± 0.029
0.307MetHis: 0.307 ± 0.012
0.542MetIle: 0.542 ± 0.02
0.368MetLys: 0.368 ± 0.017
1.553MetLeu: 1.553 ± 0.032
0.247MetMet: 0.247 ± 0.014
0.371MetAsn: 0.371 ± 0.015
1.071MetPro: 1.071 ± 0.023
0.352MetGln: 0.352 ± 0.017
1.458MetArg: 1.458 ± 0.027
1.2MetSer: 1.2 ± 0.026
1.438MetThr: 1.438 ± 0.028
1.242MetVal: 1.242 ± 0.029
0.2MetTrp: 0.2 ± 0.011
0.303MetTyr: 0.303 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
1.914AsnAla: 1.914 ± 0.042
0.138AsnCys: 0.138 ± 0.009
0.776AsnAsp: 0.776 ± 0.022
0.7AsnGlu: 0.7 ± 0.021
0.397AsnPhe: 0.397 ± 0.015
1.616AsnGly: 1.616 ± 0.034
0.338AsnHis: 0.338 ± 0.014
0.491AsnIle: 0.491 ± 0.017
0.327AsnLys: 0.327 ± 0.016
1.366AsnLeu: 1.366 ± 0.035
0.266AsnMet: 0.266 ± 0.014
0.324AsnAsn: 0.324 ± 0.015
1.162AsnPro: 1.162 ± 0.027
0.426AsnGln: 0.426 ± 0.015
1.162AsnArg: 1.162 ± 0.027
0.647AsnSer: 0.647 ± 0.019
0.922AsnThr: 0.922 ± 0.028
1.22AsnVal: 1.22 ± 0.029
0.235AsnTrp: 0.235 ± 0.012
0.332AsnTyr: 0.332 ± 0.014
0.0AsnXaa: 0.0 ± 0.0
Pro
11.316ProAla: 11.316 ± 0.153
0.39ProCys: 0.39 ± 0.016
4.781ProAsp: 4.781 ± 0.06
4.698ProGlu: 4.698 ± 0.055
1.459ProPhe: 1.459 ± 0.032
8.351ProGly: 8.351 ± 0.092
1.585ProHis: 1.585 ± 0.032
1.006ProIle: 1.006 ± 0.026
1.011ProLys: 1.011 ± 0.029
5.575ProLeu: 5.575 ± 0.061
0.975ProMet: 0.975 ± 0.023
0.752ProAsn: 0.752 ± 0.023
4.793ProPro: 4.793 ± 0.097
1.596ProGln: 1.596 ± 0.049
4.802ProArg: 4.802 ± 0.07
3.373ProSer: 3.373 ± 0.049
3.052ProThr: 3.052 ± 0.049
5.913ProVal: 5.913 ± 0.063
0.925ProTrp: 0.925 ± 0.023
1.564ProTyr: 1.564 ± 0.037
0.0ProXaa: 0.0 ± 0.0
Gln
3.437GlnAla: 3.437 ± 0.052
0.139GlnCys: 0.139 ± 0.01
1.206GlnAsp: 1.206 ± 0.03
1.286GlnGlu: 1.286 ± 0.03
0.528GlnPhe: 0.528 ± 0.021
2.082GlnGly: 2.082 ± 0.041
0.566GlnHis: 0.566 ± 0.017
0.834GlnIle: 0.834 ± 0.025
0.517GlnLys: 0.517 ± 0.019
2.471GlnLeu: 2.471 ± 0.04
0.377GlnMet: 0.377 ± 0.015
0.405GlnAsn: 0.405 ± 0.018
1.595GlnPro: 1.595 ± 0.049
0.939GlnGln: 0.939 ± 0.031
2.194GlnArg: 2.194 ± 0.039
0.98GlnSer: 0.98 ± 0.025
1.114GlnThr: 1.114 ± 0.027
2.07GlnVal: 2.07 ± 0.038
0.361GlnTrp: 0.361 ± 0.015
0.546GlnTyr: 0.546 ± 0.018
0.0GlnXaa: 0.0 ± 0.0
Arg
11.638ArgAla: 11.638 ± 0.12
0.647ArgCys: 0.647 ± 0.02
4.9ArgAsp: 4.9 ± 0.064
5.209ArgGlu: 5.209 ± 0.055
2.372ArgPhe: 2.372 ± 0.037
6.987ArgGly: 6.987 ± 0.073
2.298ArgHis: 2.298 ± 0.038
2.792ArgIle: 2.792 ± 0.042
1.465ArgLys: 1.465 ± 0.032
9.535ArgLeu: 9.535 ± 0.087
1.795ArgMet: 1.795 ± 0.028
1.205ArgAsn: 1.205 ± 0.027
6.169ArgPro: 6.169 ± 0.084
2.154ArgGln: 2.154 ± 0.035
9.053ArgArg: 9.053 ± 0.094
3.978ArgSer: 3.978 ± 0.049
5.657ArgThr: 5.657 ± 0.055
6.751ArgVal: 6.751 ± 0.06
1.442ArgTrp: 1.442 ± 0.034
1.897ArgTyr: 1.897 ± 0.033
0.0ArgXaa: 0.0 ± 0.0
Ser
6.392SerAla: 6.392 ± 0.069
0.349SerCys: 0.349 ± 0.016
2.177SerAsp: 2.177 ± 0.038
2.074SerGlu: 2.074 ± 0.036
1.294SerPhe: 1.294 ± 0.029
5.954SerGly: 5.954 ± 0.068
0.901SerHis: 0.901 ± 0.025
1.085SerIle: 1.085 ± 0.027
0.802SerLys: 0.802 ± 0.023
4.357SerLeu: 4.357 ± 0.054
0.909SerMet: 0.909 ± 0.024
0.658SerAsn: 0.658 ± 0.02
3.095SerPro: 3.095 ± 0.045
0.977SerGln: 0.977 ± 0.028
3.49SerArg: 3.49 ± 0.045
2.288SerSer: 2.288 ± 0.048
2.62SerThr: 2.62 ± 0.045
3.783SerVal: 3.783 ± 0.047
0.752SerTrp: 0.752 ± 0.021
1.016SerTyr: 1.016 ± 0.026
0.0SerXaa: 0.0 ± 0.0
Thr
9.35ThrAla: 9.35 ± 0.086
0.394ThrCys: 0.394 ± 0.016
3.252ThrAsp: 3.252 ± 0.045
2.961ThrGlu: 2.961 ± 0.043
1.381ThrPhe: 1.381 ± 0.031
7.196ThrGly: 7.196 ± 0.076
1.151ThrHis: 1.151 ± 0.03
1.373ThrIle: 1.373 ± 0.026
0.889ThrLys: 0.889 ± 0.023
5.305ThrLeu: 5.305 ± 0.061
0.811ThrMet: 0.811 ± 0.025
0.776ThrAsn: 0.776 ± 0.02
4.204ThrPro: 4.204 ± 0.058
1.059ThrGln: 1.059 ± 0.029
4.052ThrArg: 4.052 ± 0.047
2.669ThrSer: 2.669 ± 0.043
3.434ThrThr: 3.434 ± 0.056
5.724ThrVal: 5.724 ± 0.067
0.819ThrTrp: 0.819 ± 0.025
1.22ThrTyr: 1.22 ± 0.034
0.0ThrXaa: 0.0 ± 0.0
Val
11.391ValAla: 11.391 ± 0.094
0.79ValCys: 0.79 ± 0.019
4.995ValAsp: 4.995 ± 0.059
4.939ValGlu: 4.939 ± 0.056
2.325ValPhe: 2.325 ± 0.038
6.588ValGly: 6.588 ± 0.061
2.004ValHis: 2.004 ± 0.037
2.309ValIle: 2.309 ± 0.043
1.519ValLys: 1.519 ± 0.034
9.607ValLeu: 9.607 ± 0.086
1.347ValMet: 1.347 ± 0.028
1.502ValAsn: 1.502 ± 0.036
5.891ValPro: 5.891 ± 0.068
1.797ValGln: 1.797 ± 0.032
8.151ValArg: 8.151 ± 0.088
3.964ValSer: 3.964 ± 0.053
5.344ValThr: 5.344 ± 0.064
8.316ValVal: 8.316 ± 0.087
1.198ValTrp: 1.198 ± 0.025
1.645ValTyr: 1.645 ± 0.029
0.0ValXaa: 0.0 ± 0.0
Trp
1.735TrpAla: 1.735 ± 0.038
0.157TrpCys: 0.157 ± 0.01
0.778TrpAsp: 0.778 ± 0.02
0.722TrpGlu: 0.722 ± 0.019
0.487TrpPhe: 0.487 ± 0.017
1.026TrpGly: 1.026 ± 0.027
0.318TrpHis: 0.318 ± 0.014
0.444TrpIle: 0.444 ± 0.014
0.32TrpLys: 0.32 ± 0.015
1.715TrpLeu: 1.715 ± 0.038
0.262TrpMet: 0.262 ± 0.014
0.353TrpAsn: 0.353 ± 0.015
0.806TrpPro: 0.806 ± 0.024
0.519TrpGln: 0.519 ± 0.019
1.403TrpArg: 1.403 ± 0.028
0.835TrpSer: 0.835 ± 0.025
1.006TrpThr: 1.006 ± 0.026
0.925TrpVal: 0.925 ± 0.023
0.336TrpTrp: 0.336 ± 0.014
0.352TrpTyr: 0.352 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.864TyrAla: 2.864 ± 0.041
0.171TyrCys: 0.171 ± 0.01
1.449TyrAsp: 1.449 ± 0.035
1.327TyrGlu: 1.327 ± 0.025
0.571TyrPhe: 0.571 ± 0.016
2.41TyrGly: 2.41 ± 0.043
0.374TyrHis: 0.374 ± 0.013
0.387TyrIle: 0.387 ± 0.014
0.345TyrLys: 0.345 ± 0.014
1.937TyrLeu: 1.937 ± 0.033
0.244TyrMet: 0.244 ± 0.014
0.326TyrAsn: 0.326 ± 0.014
1.076TyrPro: 1.076 ± 0.022
0.528TyrGln: 0.528 ± 0.022
1.922TyrArg: 1.922 ± 0.036
0.796TyrSer: 0.796 ± 0.023
1.097TyrThr: 1.097 ± 0.024
1.637TyrVal: 1.637 ± 0.032
0.309TyrTrp: 0.309 ± 0.015
0.38TyrTyr: 0.38 ± 0.014
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5418 proteins (1829293 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski