Amino acid dipepetide frequency for Alteromonadaceae bacterium Bs31

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.457AlaAla: 8.457 ± 0.109
1.165AlaCys: 1.165 ± 0.032
4.758AlaAsp: 4.758 ± 0.058
6.135AlaGlu: 6.135 ± 0.079
3.492AlaPhe: 3.492 ± 0.055
6.61AlaGly: 6.61 ± 0.097
1.731AlaHis: 1.731 ± 0.039
5.496AlaIle: 5.496 ± 0.069
4.55AlaLys: 4.55 ± 0.077
9.946AlaLeu: 9.946 ± 0.103
2.221AlaMet: 2.221 ± 0.048
3.615AlaAsn: 3.615 ± 0.061
3.16AlaPro: 3.16 ± 0.054
3.579AlaGln: 3.579 ± 0.059
4.094AlaArg: 4.094 ± 0.057
6.275AlaSer: 6.275 ± 0.083
4.038AlaThr: 4.038 ± 0.062
5.927AlaVal: 5.927 ± 0.079
1.096AlaTrp: 1.096 ± 0.03
2.612AlaTyr: 2.612 ± 0.046
0.0AlaXaa: 0.0 ± 0.0
Cys
0.888CysAla: 0.888 ± 0.029
0.178CysCys: 0.178 ± 0.011
0.608CysAsp: 0.608 ± 0.022
0.705CysGlu: 0.705 ± 0.025
0.47CysPhe: 0.47 ± 0.018
0.989CysGly: 0.989 ± 0.028
0.323CysHis: 0.323 ± 0.02
0.59CysIle: 0.59 ± 0.023
0.502CysLys: 0.502 ± 0.023
1.068CysLeu: 1.068 ± 0.027
0.228CysMet: 0.228 ± 0.012
0.413CysAsn: 0.413 ± 0.017
0.462CysPro: 0.462 ± 0.019
0.361CysGln: 0.361 ± 0.016
0.485CysArg: 0.485 ± 0.019
0.887CysSer: 0.887 ± 0.024
0.491CysThr: 0.491 ± 0.019
0.743CysVal: 0.743 ± 0.027
0.166CysTrp: 0.166 ± 0.01
0.336CysTyr: 0.336 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
4.57AspAla: 4.57 ± 0.069
0.559AspCys: 0.559 ± 0.026
3.041AspAsp: 3.041 ± 0.067
3.865AspGlu: 3.865 ± 0.067
2.485AspPhe: 2.485 ± 0.04
4.199AspGly: 4.199 ± 0.099
1.029AspHis: 1.029 ± 0.029
3.864AspIle: 3.864 ± 0.061
3.051AspLys: 3.051 ± 0.046
5.125AspLeu: 5.125 ± 0.076
1.244AspMet: 1.244 ± 0.031
2.408AspAsn: 2.408 ± 0.046
2.221AspPro: 2.221 ± 0.044
1.909AspGln: 1.909 ± 0.044
2.379AspArg: 2.379 ± 0.046
3.811AspSer: 3.811 ± 0.084
2.927AspThr: 2.927 ± 0.071
3.631AspVal: 3.631 ± 0.059
0.902AspTrp: 0.902 ± 0.027
2.129AspTyr: 2.129 ± 0.042
0.0AspXaa: 0.0 ± 0.0
Glu
5.599GluAla: 5.599 ± 0.072
0.519GluCys: 0.519 ± 0.02
3.242GluAsp: 3.242 ± 0.055
4.145GluGlu: 4.145 ± 0.078
2.516GluPhe: 2.516 ± 0.047
4.124GluGly: 4.124 ± 0.056
1.612GluHis: 1.612 ± 0.037
3.831GluIle: 3.831 ± 0.048
4.0GluLys: 4.0 ± 0.067
7.261GluLeu: 7.261 ± 0.097
1.489GluMet: 1.489 ± 0.035
3.19GluAsn: 3.19 ± 0.048
2.391GluPro: 2.391 ± 0.05
3.792GluGln: 3.792 ± 0.071
3.376GluArg: 3.376 ± 0.063
3.962GluSer: 3.962 ± 0.065
3.069GluThr: 3.069 ± 0.051
4.048GluVal: 4.048 ± 0.063
0.798GluTrp: 0.798 ± 0.024
1.914GluTyr: 1.914 ± 0.041
0.001GluXaa: 0.001 ± 0.001
Phe
3.436PheAla: 3.436 ± 0.053
0.554PheCys: 0.554 ± 0.021
2.805PheAsp: 2.805 ± 0.044
2.533PheGlu: 2.533 ± 0.047
1.812PhePhe: 1.812 ± 0.044
3.071PheGly: 3.071 ± 0.057
0.8PheHis: 0.8 ± 0.024
2.514PheIle: 2.514 ± 0.052
1.938PheLys: 1.938 ± 0.041
3.525PheLeu: 3.525 ± 0.066
0.88PheMet: 0.88 ± 0.026
1.917PheAsn: 1.917 ± 0.04
1.536PhePro: 1.536 ± 0.034
1.229PheGln: 1.229 ± 0.03
1.727PheArg: 1.727 ± 0.038
3.625PheSer: 3.625 ± 0.054
2.353PheThr: 2.353 ± 0.047
2.808PheVal: 2.808 ± 0.049
0.552PheTrp: 0.552 ± 0.021
1.439PheTyr: 1.439 ± 0.028
0.0PheXaa: 0.0 ± 0.0
Gly
5.727GlyAla: 5.727 ± 0.089
0.855GlyCys: 0.855 ± 0.026
4.185GlyAsp: 4.185 ± 0.078
4.915GlyGlu: 4.915 ± 0.07
3.262GlyPhe: 3.262 ± 0.047
5.693GlyGly: 5.693 ± 0.129
1.482GlyHis: 1.482 ± 0.031
4.293GlyIle: 4.293 ± 0.059
3.874GlyLys: 3.874 ± 0.057
7.015GlyLeu: 7.015 ± 0.086
1.808GlyMet: 1.808 ± 0.04
2.971GlyAsn: 2.971 ± 0.08
1.93GlyPro: 1.93 ± 0.042
2.372GlyGln: 2.372 ± 0.042
3.307GlyArg: 3.307 ± 0.059
5.333GlySer: 5.333 ± 0.116
3.63GlyThr: 3.63 ± 0.092
5.127GlyVal: 5.127 ± 0.071
1.062GlyTrp: 1.062 ± 0.032
2.448GlyTyr: 2.448 ± 0.047
0.0GlyXaa: 0.0 ± 0.0
His
1.633HisAla: 1.633 ± 0.033
0.331HisCys: 0.331 ± 0.016
0.975HisAsp: 0.975 ± 0.028
1.133HisGlu: 1.133 ± 0.031
1.077HisPhe: 1.077 ± 0.028
1.514HisGly: 1.514 ± 0.037
0.611HisHis: 0.611 ± 0.027
1.333HisIle: 1.333 ± 0.025
1.062HisLys: 1.062 ± 0.03
2.101HisLeu: 2.101 ± 0.038
0.504HisMet: 0.504 ± 0.02
0.912HisAsn: 0.912 ± 0.025
1.07HisPro: 1.07 ± 0.026
0.922HisGln: 0.922 ± 0.028
1.083HisArg: 1.083 ± 0.027
1.575HisSer: 1.575 ± 0.031
1.094HisThr: 1.094 ± 0.029
1.152HisVal: 1.152 ± 0.028
0.37HisTrp: 0.37 ± 0.018
0.906HisTyr: 0.906 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
5.712IleAla: 5.712 ± 0.077
0.655IleCys: 0.655 ± 0.021
3.989IleAsp: 3.989 ± 0.059
4.311IleGlu: 4.311 ± 0.058
2.224IlePhe: 2.224 ± 0.048
4.035IleGly: 4.035 ± 0.059
1.289IleHis: 1.289 ± 0.034
3.114IleIle: 3.114 ± 0.054
2.981IleLys: 2.981 ± 0.052
4.827IleLeu: 4.827 ± 0.074
1.053IleMet: 1.053 ± 0.031
2.872IleAsn: 2.872 ± 0.057
2.516IlePro: 2.516 ± 0.041
2.005IleGln: 2.005 ± 0.036
2.848IleArg: 2.848 ± 0.049
4.619IleSer: 4.619 ± 0.062
3.353IleThr: 3.353 ± 0.077
3.995IleVal: 3.995 ± 0.058
0.645IleTrp: 0.645 ± 0.021
1.799IleTyr: 1.799 ± 0.039
0.0IleXaa: 0.0 ± 0.0
Lys
4.697LysAla: 4.697 ± 0.086
0.367LysCys: 0.367 ± 0.018
2.579LysAsp: 2.579 ± 0.054
3.051LysGlu: 3.051 ± 0.063
1.532LysPhe: 1.532 ± 0.032
3.299LysGly: 3.299 ± 0.051
1.282LysHis: 1.282 ± 0.032
3.184LysIle: 3.184 ± 0.057
3.268LysLys: 3.268 ± 0.07
5.214LysLeu: 5.214 ± 0.079
1.185LysMet: 1.185 ± 0.029
2.621LysAsn: 2.621 ± 0.053
2.457LysPro: 2.457 ± 0.046
2.48LysGln: 2.48 ± 0.05
2.684LysArg: 2.684 ± 0.053
3.159LysSer: 3.159 ± 0.051
2.788LysThr: 2.788 ± 0.053
3.303LysVal: 3.303 ± 0.065
0.537LysTrp: 0.537 ± 0.021
1.429LysTyr: 1.429 ± 0.035
0.0LysXaa: 0.0 ± 0.0
Leu
10.053LeuAla: 10.053 ± 0.107
1.175LeuCys: 1.175 ± 0.032
5.761LeuAsp: 5.761 ± 0.081
6.522LeuGlu: 6.522 ± 0.088
4.066LeuPhe: 4.066 ± 0.075
6.862LeuGly: 6.862 ± 0.074
2.061LeuHis: 2.061 ± 0.045
5.628LeuIle: 5.628 ± 0.078
5.22LeuLys: 5.22 ± 0.08
10.81LeuLeu: 10.81 ± 0.157
2.255LeuMet: 2.255 ± 0.044
4.351LeuAsn: 4.351 ± 0.062
4.566LeuPro: 4.566 ± 0.064
4.17LeuGln: 4.17 ± 0.058
4.938LeuArg: 4.938 ± 0.065
8.354LeuSer: 8.354 ± 0.086
4.64LeuThr: 4.64 ± 0.057
6.974LeuVal: 6.974 ± 0.097
1.173LeuTrp: 1.173 ± 0.033
2.768LeuTyr: 2.768 ± 0.046
0.0LeuXaa: 0.0 ± 0.0
Met
2.215MetAla: 2.215 ± 0.04
0.186MetCys: 0.186 ± 0.011
1.189MetAsp: 1.189 ± 0.027
1.401MetGlu: 1.401 ± 0.035
0.77MetPhe: 0.77 ± 0.023
1.581MetGly: 1.581 ± 0.034
0.475MetHis: 0.475 ± 0.018
1.127MetIle: 1.127 ± 0.029
1.242MetLys: 1.242 ± 0.032
2.41MetLeu: 2.41 ± 0.044
0.536MetMet: 0.536 ± 0.021
0.99MetAsn: 0.99 ± 0.027
1.116MetPro: 1.116 ± 0.031
0.947MetGln: 0.947 ± 0.028
1.188MetArg: 1.188 ± 0.03
1.735MetSer: 1.735 ± 0.037
1.08MetThr: 1.08 ± 0.031
1.47MetVal: 1.47 ± 0.033
0.216MetTrp: 0.216 ± 0.014
0.505MetTyr: 0.505 ± 0.02
0.0MetXaa: 0.0 ± 0.0
Asn
3.631AsnAla: 3.631 ± 0.066
0.506AsnCys: 0.506 ± 0.018
2.288AsnAsp: 2.288 ± 0.054
2.445AsnGlu: 2.445 ± 0.04
1.74AsnPhe: 1.74 ± 0.042
3.213AsnGly: 3.213 ± 0.082
0.813AsnHis: 0.813 ± 0.026
2.962AsnIle: 2.962 ± 0.055
2.298AsnLys: 2.298 ± 0.04
3.923AsnLeu: 3.923 ± 0.061
0.947AsnMet: 0.947 ± 0.024
2.205AsnAsn: 2.205 ± 0.049
2.164AsnPro: 2.164 ± 0.042
1.566AsnGln: 1.566 ± 0.034
2.037AsnArg: 2.037 ± 0.041
3.217AsnSer: 3.217 ± 0.065
2.703AsnThr: 2.703 ± 0.041
2.513AsnVal: 2.513 ± 0.054
0.753AsnTrp: 0.753 ± 0.029
1.528AsnTyr: 1.528 ± 0.032
0.0AsnXaa: 0.0 ± 0.0
Pro
3.644ProAla: 3.644 ± 0.063
0.352ProCys: 0.352 ± 0.015
2.438ProAsp: 2.438 ± 0.044
3.433ProGlu: 3.433 ± 0.062
1.662ProPhe: 1.662 ± 0.032
2.962ProGly: 2.962 ± 0.051
0.789ProHis: 0.789 ± 0.023
2.115ProIle: 2.115 ± 0.037
1.82ProLys: 1.82 ± 0.035
4.118ProLeu: 4.118 ± 0.055
0.873ProMet: 0.873 ± 0.025
1.592ProAsn: 1.592 ± 0.04
1.503ProPro: 1.503 ± 0.042
1.641ProGln: 1.641 ± 0.036
1.587ProArg: 1.587 ± 0.038
2.827ProSer: 2.827 ± 0.047
2.021ProThr: 2.021 ± 0.073
3.01ProVal: 3.01 ± 0.05
0.565ProTrp: 0.565 ± 0.023
1.262ProTyr: 1.262 ± 0.034
0.0ProXaa: 0.0 ± 0.0
Gln
4.284GlnAla: 4.284 ± 0.069
0.405GlnCys: 0.405 ± 0.018
1.737GlnAsp: 1.737 ± 0.036
2.339GlnGlu: 2.339 ± 0.042
1.57GlnPhe: 1.57 ± 0.034
2.651GlnGly: 2.651 ± 0.042
1.088GlnHis: 1.088 ± 0.03
2.141GlnIle: 2.141 ± 0.039
2.114GlnLys: 2.114 ± 0.043
4.649GlnLeu: 4.649 ± 0.072
0.965GlnMet: 0.965 ± 0.026
1.632GlnAsn: 1.632 ± 0.033
1.486GlnPro: 1.486 ± 0.033
2.776GlnGln: 2.776 ± 0.059
2.244GlnArg: 2.244 ± 0.043
2.642GlnSer: 2.642 ± 0.048
1.824GlnThr: 1.824 ± 0.035
2.66GlnVal: 2.66 ± 0.048
0.68GlnTrp: 0.68 ± 0.019
1.161GlnTyr: 1.161 ± 0.031
0.0GlnXaa: 0.0 ± 0.0
Arg
3.924ArgAla: 3.924 ± 0.058
0.478ArgCys: 0.478 ± 0.02
2.557ArgAsp: 2.557 ± 0.045
3.368ArgGlu: 3.368 ± 0.057
2.268ArgPhe: 2.268 ± 0.038
2.925ArgGly: 2.925 ± 0.048
1.069ArgHis: 1.069 ± 0.03
2.987ArgIle: 2.987 ± 0.05
2.532ArgLys: 2.532 ± 0.054
5.071ArgLeu: 5.071 ± 0.07
1.136ArgMet: 1.136 ± 0.026
1.945ArgAsn: 1.945 ± 0.039
1.715ArgPro: 1.715 ± 0.042
1.93ArgGln: 1.93 ± 0.041
2.438ArgArg: 2.438 ± 0.048
3.157ArgSer: 3.157 ± 0.05
2.077ArgThr: 2.077 ± 0.045
3.364ArgVal: 3.364 ± 0.053
0.799ArgTrp: 0.799 ± 0.027
1.941ArgTyr: 1.941 ± 0.043
0.0ArgXaa: 0.0 ± 0.0
Ser
6.509SerAla: 6.509 ± 0.076
0.799SerCys: 0.799 ± 0.03
4.04SerAsp: 4.04 ± 0.053
4.658SerGlu: 4.658 ± 0.059
2.998SerPhe: 2.998 ± 0.049
6.066SerGly: 6.066 ± 0.114
1.5SerHis: 1.5 ± 0.032
4.276SerIle: 4.276 ± 0.071
3.349SerLys: 3.349 ± 0.048
7.371SerLeu: 7.371 ± 0.086
1.593SerMet: 1.593 ± 0.037
3.069SerAsn: 3.069 ± 0.056
2.898SerPro: 2.898 ± 0.047
2.6SerGln: 2.6 ± 0.047
3.214SerArg: 3.214 ± 0.053
9.348SerSer: 9.348 ± 0.507
3.88SerThr: 3.88 ± 0.075
4.893SerVal: 4.893 ± 0.07
1.105SerTrp: 1.105 ± 0.031
2.46SerTyr: 2.46 ± 0.048
0.0SerXaa: 0.0 ± 0.0
Thr
4.585ThrAla: 4.585 ± 0.071
0.448ThrCys: 0.448 ± 0.02
2.68ThrAsp: 2.68 ± 0.066
3.147ThrGlu: 3.147 ± 0.055
2.025ThrPhe: 2.025 ± 0.047
4.028ThrGly: 4.028 ± 0.086
1.007ThrHis: 1.007 ± 0.025
3.12ThrIle: 3.12 ± 0.075
1.951ThrLys: 1.951 ± 0.037
5.829ThrLeu: 5.829 ± 0.079
1.007ThrMet: 1.007 ± 0.023
1.907ThrAsn: 1.907 ± 0.043
2.573ThrPro: 2.573 ± 0.062
1.956ThrGln: 1.956 ± 0.038
2.293ThrArg: 2.293 ± 0.041
3.466ThrSer: 3.466 ± 0.064
2.609ThrThr: 2.609 ± 0.057
3.61ThrVal: 3.61 ± 0.069
0.599ThrTrp: 0.599 ± 0.026
1.448ThrTyr: 1.448 ± 0.037
0.0ThrXaa: 0.0 ± 0.0
Val
6.022ValAla: 6.022 ± 0.068
0.771ValCys: 0.771 ± 0.025
4.154ValAsp: 4.154 ± 0.066
4.397ValGlu: 4.397 ± 0.054
3.003ValPhe: 3.003 ± 0.051
4.314ValGly: 4.314 ± 0.072
1.322ValHis: 1.322 ± 0.029
3.883ValIle: 3.883 ± 0.055
3.321ValLys: 3.321 ± 0.057
6.979ValLeu: 6.979 ± 0.086
1.498ValMet: 1.498 ± 0.033
3.029ValAsn: 3.029 ± 0.053
2.626ValPro: 2.626 ± 0.049
2.221ValGln: 2.221 ± 0.041
3.038ValArg: 3.038 ± 0.048
5.201ValSer: 5.201 ± 0.073
3.364ValThr: 3.364 ± 0.068
5.061ValVal: 5.061 ± 0.071
0.77ValTrp: 0.77 ± 0.024
2.021ValTyr: 2.021 ± 0.049
0.0ValXaa: 0.0 ± 0.0
Trp
0.954TrpAla: 0.954 ± 0.028
0.152TrpCys: 0.152 ± 0.01
0.677TrpAsp: 0.677 ± 0.023
0.662TrpGlu: 0.662 ± 0.024
0.59TrpPhe: 0.59 ± 0.024
0.952TrpGly: 0.952 ± 0.029
0.356TrpHis: 0.356 ± 0.015
0.657TrpIle: 0.657 ± 0.024
0.531TrpLys: 0.531 ± 0.019
1.752TrpLeu: 1.752 ± 0.043
0.334TrpMet: 0.334 ± 0.015
0.591TrpAsn: 0.591 ± 0.026
0.543TrpPro: 0.543 ± 0.022
0.946TrpGln: 0.946 ± 0.025
0.815TrpArg: 0.815 ± 0.025
0.91TrpSer: 0.91 ± 0.027
0.57TrpThr: 0.57 ± 0.023
0.875TrpVal: 0.875 ± 0.027
0.218TrpTrp: 0.218 ± 0.014
0.453TrpTyr: 0.453 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.419TyrAla: 2.419 ± 0.04
0.436TyrCys: 0.436 ± 0.021
1.648TyrAsp: 1.648 ± 0.048
1.714TyrGlu: 1.714 ± 0.035
1.493TyrPhe: 1.493 ± 0.034
2.201TyrGly: 2.201 ± 0.039
0.72TyrHis: 0.72 ± 0.026
1.726TyrIle: 1.726 ± 0.037
1.513TyrLys: 1.513 ± 0.029
3.352TyrLeu: 3.352 ± 0.055
0.618TyrMet: 0.618 ± 0.02
1.221TyrAsn: 1.221 ± 0.034
1.28TyrPro: 1.28 ± 0.031
1.644TyrGln: 1.644 ± 0.035
1.903TyrArg: 1.903 ± 0.036
2.524TyrSer: 2.524 ± 0.043
1.774TyrThr: 1.774 ± 0.041
1.833TyrVal: 1.833 ± 0.034
0.535TyrTrp: 0.535 ± 0.021
1.141TyrTyr: 1.141 ± 0.032
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.001XaaAsp: 0.001 ± 0.001
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.01XaaXaa: 0.01 ± 0.01
Statistics based on 4297 proteins (1444979 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski