Amino acid dipepetide frequency for Methanosphaera sp. BMS

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.873AlaAla: 2.873 ± 0.09
0.675AlaCys: 0.675 ± 0.037
3.348AlaAsp: 3.348 ± 0.086
2.902AlaGlu: 2.902 ± 0.083
1.838AlaPhe: 1.838 ± 0.059
3.488AlaGly: 3.488 ± 0.091
0.843AlaHis: 0.843 ± 0.038
5.682AlaIle: 5.682 ± 0.134
3.786AlaLys: 3.786 ± 0.085
4.343AlaLeu: 4.343 ± 0.102
1.191AlaMet: 1.191 ± 0.047
3.633AlaAsn: 3.633 ± 0.103
1.342AlaPro: 1.342 ± 0.05
1.449AlaGln: 1.449 ± 0.04
1.766AlaArg: 1.766 ± 0.059
3.754AlaSer: 3.754 ± 0.082
4.317AlaThr: 4.317 ± 0.207
3.859AlaVal: 3.859 ± 0.078
0.239AlaTrp: 0.239 ± 0.021
2.205AlaTyr: 2.205 ± 0.059
0.0AlaXaa: 0.0 ± 0.0
Cys
0.571CysAla: 0.571 ± 0.035
0.16CysCys: 0.16 ± 0.016
0.809CysAsp: 0.809 ± 0.041
0.7CysGlu: 0.7 ± 0.038
0.364CysPhe: 0.364 ± 0.022
1.035CysGly: 1.035 ± 0.051
0.233CysHis: 0.233 ± 0.02
0.89CysIle: 0.89 ± 0.041
0.714CysLys: 0.714 ± 0.036
0.736CysLeu: 0.736 ± 0.039
0.271CysMet: 0.271 ± 0.022
0.683CysAsn: 0.683 ± 0.03
0.58CysPro: 0.58 ± 0.039
0.289CysGln: 0.289 ± 0.025
0.394CysArg: 0.394 ± 0.03
0.672CysSer: 0.672 ± 0.032
0.637CysThr: 0.637 ± 0.034
0.766CysVal: 0.766 ± 0.035
0.066CysTrp: 0.066 ± 0.009
0.467CysTyr: 0.467 ± 0.029
0.0CysXaa: 0.0 ± 0.0
Asp
3.571AspAla: 3.571 ± 0.086
0.769AspCys: 0.769 ± 0.037
5.159AspAsp: 5.159 ± 0.149
6.296AspGlu: 6.296 ± 0.153
2.354AspPhe: 2.354 ± 0.074
3.958AspGly: 3.958 ± 0.163
0.684AspHis: 0.684 ± 0.038
6.402AspIle: 6.402 ± 0.111
5.361AspLys: 5.361 ± 0.104
4.925AspLeu: 4.925 ± 0.124
1.722AspMet: 1.722 ± 0.063
5.505AspAsn: 5.505 ± 0.127
1.367AspPro: 1.367 ± 0.053
1.008AspGln: 1.008 ± 0.039
1.548AspArg: 1.548 ± 0.054
4.055AspSer: 4.055 ± 0.083
3.906AspThr: 3.906 ± 0.102
4.824AspVal: 4.824 ± 0.124
0.425AspTrp: 0.425 ± 0.024
3.525AspTyr: 3.525 ± 0.088
0.0AspXaa: 0.0 ± 0.0
Glu
3.504GluAla: 3.504 ± 0.101
0.743GluCys: 0.743 ± 0.037
4.957GluAsp: 4.957 ± 0.114
5.759GluGlu: 5.759 ± 0.164
2.591GluPhe: 2.591 ± 0.07
3.41GluGly: 3.41 ± 0.099
1.087GluHis: 1.087 ± 0.047
6.17GluIle: 6.17 ± 0.134
5.409GluLys: 5.409 ± 0.141
5.777GluLeu: 5.777 ± 0.133
1.397GluMet: 1.397 ± 0.051
5.874GluAsn: 5.874 ± 0.103
1.251GluPro: 1.251 ± 0.066
1.578GluGln: 1.578 ± 0.056
1.938GluArg: 1.938 ± 0.063
3.91GluSer: 3.91 ± 0.088
3.079GluThr: 3.079 ± 0.095
4.478GluVal: 4.478 ± 0.133
0.323GluTrp: 0.323 ± 0.022
3.979GluTyr: 3.979 ± 0.094
0.0GluXaa: 0.0 ± 0.0
Phe
1.822PheAla: 1.822 ± 0.064
0.356PheCys: 0.356 ± 0.026
2.87PheAsp: 2.87 ± 0.073
2.492PheGlu: 2.492 ± 0.063
1.346PhePhe: 1.346 ± 0.052
2.0PheGly: 2.0 ± 0.065
0.536PheHis: 0.536 ± 0.027
3.7PheIle: 3.7 ± 0.098
3.077PheLys: 3.077 ± 0.09
2.703PheLeu: 2.703 ± 0.092
0.918PheMet: 0.918 ± 0.039
3.085PheAsn: 3.085 ± 0.081
0.819PhePro: 0.819 ± 0.035
0.709PheGln: 0.709 ± 0.034
1.031PheArg: 1.031 ± 0.039
2.273PheSer: 2.273 ± 0.061
2.803PheThr: 2.803 ± 0.092
2.128PheVal: 2.128 ± 0.052
0.202PheTrp: 0.202 ± 0.02
1.5PheTyr: 1.5 ± 0.048
0.0PheXaa: 0.0 ± 0.0
Gly
3.911GlyAla: 3.911 ± 0.16
0.795GlyCys: 0.795 ± 0.042
3.261GlyAsp: 3.261 ± 0.072
3.33GlyGlu: 3.33 ± 0.116
2.182GlyPhe: 2.182 ± 0.071
4.319GlyGly: 4.319 ± 0.155
0.986GlyHis: 0.986 ± 0.038
5.113GlyIle: 5.113 ± 0.122
4.328GlyLys: 4.328 ± 0.094
4.356GlyLeu: 4.356 ± 0.11
1.302GlyMet: 1.302 ± 0.046
4.12GlyAsn: 4.12 ± 0.184
1.21GlyPro: 1.21 ± 0.04
1.888GlyGln: 1.888 ± 0.086
1.736GlyArg: 1.736 ± 0.054
3.94GlySer: 3.94 ± 0.094
3.92GlyThr: 3.92 ± 0.18
4.148GlyVal: 4.148 ± 0.093
0.441GlyTrp: 0.441 ± 0.028
2.743GlyTyr: 2.743 ± 0.075
0.0GlyXaa: 0.0 ± 0.0
His
0.879HisAla: 0.879 ± 0.034
0.224HisCys: 0.224 ± 0.018
1.091HisAsp: 1.091 ± 0.036
1.164HisGlu: 1.164 ± 0.041
0.524HisPhe: 0.524 ± 0.027
1.005HisGly: 1.005 ± 0.04
0.378HisHis: 0.378 ± 0.031
1.403HisIle: 1.403 ± 0.044
1.128HisLys: 1.128 ± 0.044
1.208HisLeu: 1.208 ± 0.05
0.429HisMet: 0.429 ± 0.024
1.128HisAsn: 1.128 ± 0.045
0.67HisPro: 0.67 ± 0.03
0.394HisGln: 0.394 ± 0.022
0.494HisArg: 0.494 ± 0.024
0.901HisSer: 0.901 ± 0.038
0.877HisThr: 0.877 ± 0.029
1.018HisVal: 1.018 ± 0.04
0.077HisTrp: 0.077 ± 0.011
0.743HisTyr: 0.743 ± 0.04
0.0HisXaa: 0.0 ± 0.0
Ile
5.087IleAla: 5.087 ± 0.106
1.076IleCys: 1.076 ± 0.046
6.316IleAsp: 6.316 ± 0.099
6.33IleGlu: 6.33 ± 0.136
3.46IlePhe: 3.46 ± 0.086
4.738IleGly: 4.738 ± 0.097
1.562IleHis: 1.562 ± 0.052
9.353IleIle: 9.353 ± 0.185
7.165IleLys: 7.165 ± 0.13
7.657IleLeu: 7.657 ± 0.174
1.913IleMet: 1.913 ± 0.062
7.834IleAsn: 7.834 ± 0.184
3.358IlePro: 3.358 ± 0.085
2.514IleGln: 2.514 ± 0.069
3.023IleArg: 3.023 ± 0.086
6.132IleSer: 6.132 ± 0.12
8.52IleThr: 8.52 ± 0.276
5.429IleVal: 5.429 ± 0.095
0.414IleTrp: 0.414 ± 0.026
4.249IleTyr: 4.249 ± 0.104
0.0IleXaa: 0.0 ± 0.0
Lys
3.781LysAla: 3.781 ± 0.097
0.754LysCys: 0.754 ± 0.041
4.906LysAsp: 4.906 ± 0.084
5.529LysGlu: 5.529 ± 0.127
2.255LysPhe: 2.255 ± 0.074
3.094LysGly: 3.094 ± 0.077
1.2LysHis: 1.2 ± 0.043
7.313LysIle: 7.313 ± 0.127
5.85LysLys: 5.85 ± 0.179
6.573LysLeu: 6.573 ± 0.139
1.554LysMet: 1.554 ± 0.058
5.513LysAsn: 5.513 ± 0.121
1.731LysPro: 1.731 ± 0.06
2.524LysGln: 2.524 ± 0.064
2.501LysArg: 2.501 ± 0.073
4.243LysSer: 4.243 ± 0.104
5.032LysThr: 5.032 ± 0.1
4.657LysVal: 4.657 ± 0.086
0.427LysTrp: 0.427 ± 0.024
3.83LysTyr: 3.83 ± 0.096
0.0LysXaa: 0.0 ± 0.0
Leu
4.311LeuAla: 4.311 ± 0.107
0.801LeuCys: 0.801 ± 0.041
5.602LeuAsp: 5.602 ± 0.107
5.768LeuGlu: 5.768 ± 0.13
3.354LeuPhe: 3.354 ± 0.115
4.148LeuGly: 4.148 ± 0.099
1.164LeuHis: 1.164 ± 0.047
7.374LeuIle: 7.374 ± 0.17
6.681LeuLys: 6.681 ± 0.161
7.066LeuLeu: 7.066 ± 0.179
1.843LeuMet: 1.843 ± 0.066
6.601LeuAsn: 6.601 ± 0.155
2.617LeuPro: 2.617 ± 0.071
1.808LeuGln: 1.808 ± 0.057
2.456LeuArg: 2.456 ± 0.08
5.957LeuSer: 5.957 ± 0.121
6.199LeuThr: 6.199 ± 0.142
4.423LeuVal: 4.423 ± 0.079
0.428LeuTrp: 0.428 ± 0.025
3.167LeuTyr: 3.167 ± 0.088
0.0LeuXaa: 0.0 ± 0.0
Met
1.362MetAla: 1.362 ± 0.052
0.217MetCys: 0.217 ± 0.017
1.865MetAsp: 1.865 ± 0.061
1.428MetGlu: 1.428 ± 0.053
0.74MetPhe: 0.74 ± 0.04
1.545MetGly: 1.545 ± 0.049
0.319MetHis: 0.319 ± 0.023
2.068MetIle: 2.068 ± 0.063
1.764MetLys: 1.764 ± 0.055
1.748MetLeu: 1.748 ± 0.063
0.548MetMet: 0.548 ± 0.03
1.51MetAsn: 1.51 ± 0.052
0.731MetPro: 0.731 ± 0.035
0.624MetGln: 0.624 ± 0.031
0.553MetArg: 0.553 ± 0.031
1.318MetSer: 1.318 ± 0.054
1.432MetThr: 1.432 ± 0.047
1.284MetVal: 1.284 ± 0.05
0.105MetTrp: 0.105 ± 0.012
0.715MetTyr: 0.715 ± 0.036
0.0MetXaa: 0.0 ± 0.0
Asn
3.708AsnAla: 3.708 ± 0.124
0.969AsnCys: 0.969 ± 0.051
4.487AsnAsp: 4.487 ± 0.102
4.261AsnGlu: 4.261 ± 0.087
2.165AsnPhe: 2.165 ± 0.073
5.191AsnGly: 5.191 ± 0.242
1.251AsnHis: 1.251 ± 0.048
8.282AsnIle: 8.282 ± 0.197
5.547AsnLys: 5.547 ± 0.106
5.711AsnLeu: 5.711 ± 0.112
1.781AsnMet: 1.781 ± 0.05
8.804AsnAsn: 8.804 ± 0.369
2.805AsnPro: 2.805 ± 0.075
2.666AsnGln: 2.666 ± 0.084
2.172AsnArg: 2.172 ± 0.057
5.101AsnSer: 5.101 ± 0.148
6.092AsnThr: 6.092 ± 0.221
5.096AsnVal: 5.096 ± 0.22
0.423AsnTrp: 0.423 ± 0.032
3.716AsnTyr: 3.716 ± 0.121
0.0AsnXaa: 0.0 ± 0.0
Pro
1.471ProAla: 1.471 ± 0.046
0.298ProCys: 0.298 ± 0.023
2.043ProAsp: 2.043 ± 0.058
2.619ProGlu: 2.619 ± 0.079
1.079ProPhe: 1.079 ± 0.042
1.618ProGly: 1.618 ± 0.053
0.551ProHis: 0.551 ± 0.029
2.552ProIle: 2.552 ± 0.068
1.77ProLys: 1.77 ± 0.065
2.405ProLeu: 2.405 ± 0.061
0.52ProMet: 0.52 ± 0.031
1.444ProAsn: 1.444 ± 0.049
0.649ProPro: 0.649 ± 0.03
0.925ProGln: 0.925 ± 0.039
0.873ProArg: 0.873 ± 0.039
1.864ProSer: 1.864 ± 0.056
2.029ProThr: 2.029 ± 0.07
2.433ProVal: 2.433 ± 0.143
0.185ProTrp: 0.185 ± 0.016
1.229ProTyr: 1.229 ± 0.045
0.0ProXaa: 0.0 ± 0.0
Gln
1.558GlnAla: 1.558 ± 0.047
0.228GlnCys: 0.228 ± 0.019
1.444GlnAsp: 1.444 ± 0.045
1.973GlnGlu: 1.973 ± 0.066
0.957GlnPhe: 0.957 ± 0.046
1.194GlnGly: 1.194 ± 0.042
0.432GlnHis: 0.432 ± 0.027
2.441GlnIle: 2.441 ± 0.065
2.119GlnLys: 2.119 ± 0.067
2.712GlnLeu: 2.712 ± 0.076
0.722GlnMet: 0.722 ± 0.038
2.0GlnAsn: 2.0 ± 0.087
0.679GlnPro: 0.679 ± 0.03
0.948GlnGln: 0.948 ± 0.042
0.901GlnArg: 0.901 ± 0.034
1.736GlnSer: 1.736 ± 0.065
1.967GlnThr: 1.967 ± 0.068
1.632GlnVal: 1.632 ± 0.046
0.192GlnTrp: 0.192 ± 0.016
1.394GlnTyr: 1.394 ± 0.048
0.0GlnXaa: 0.0 ± 0.0
Arg
1.353ArgAla: 1.353 ± 0.048
0.412ArgCys: 0.412 ± 0.027
1.852ArgAsp: 1.852 ± 0.051
2.013ArgGlu: 2.013 ± 0.066
1.128ArgPhe: 1.128 ± 0.047
1.715ArgGly: 1.715 ± 0.057
0.501ArgHis: 0.501 ± 0.031
2.939ArgIle: 2.939 ± 0.075
2.392ArgLys: 2.392 ± 0.081
2.708ArgLeu: 2.708 ± 0.08
0.73ArgMet: 0.73 ± 0.038
2.086ArgAsn: 2.086 ± 0.06
0.87ArgPro: 0.87 ± 0.044
0.961ArgGln: 0.961 ± 0.04
1.236ArgArg: 1.236 ± 0.054
1.522ArgSer: 1.522 ± 0.044
1.526ArgThr: 1.526 ± 0.047
1.731ArgVal: 1.731 ± 0.048
0.205ArgTrp: 0.205 ± 0.016
1.485ArgTyr: 1.485 ± 0.042
0.0ArgXaa: 0.0 ± 0.0
Ser
3.017SerAla: 3.017 ± 0.088
0.715SerCys: 0.715 ± 0.034
4.167SerAsp: 4.167 ± 0.085
3.467SerGlu: 3.467 ± 0.079
2.669SerPhe: 2.669 ± 0.072
4.135SerGly: 4.135 ± 0.09
1.086SerHis: 1.086 ± 0.045
6.229SerIle: 6.229 ± 0.098
4.508SerLys: 4.508 ± 0.111
5.068SerLeu: 5.068 ± 0.128
1.397SerMet: 1.397 ± 0.05
5.303SerAsn: 5.303 ± 0.155
1.636SerPro: 1.636 ± 0.046
2.052SerGln: 2.052 ± 0.064
1.95SerArg: 1.95 ± 0.055
4.771SerSer: 4.771 ± 0.145
4.603SerThr: 4.603 ± 0.105
3.886SerVal: 3.886 ± 0.097
0.462SerTrp: 0.462 ± 0.026
2.937SerTyr: 2.937 ± 0.072
0.0SerXaa: 0.0 ± 0.0
Thr
4.629ThrAla: 4.629 ± 0.182
0.589ThrCys: 0.589 ± 0.031
4.576ThrAsp: 4.576 ± 0.162
3.376ThrGlu: 3.376 ± 0.09
2.912ThrPhe: 2.912 ± 0.093
4.438ThrGly: 4.438 ± 0.125
1.121ThrHis: 1.121 ± 0.042
8.083ThrIle: 8.083 ± 0.311
3.911ThrLys: 3.911 ± 0.093
6.56ThrLeu: 6.56 ± 0.175
1.135ThrMet: 1.135 ± 0.04
4.99ThrAsn: 4.99 ± 0.21
2.708ThrPro: 2.708 ± 0.092
2.107ThrGln: 2.107 ± 0.095
1.745ThrArg: 1.745 ± 0.048
4.317ThrSer: 4.317 ± 0.103
5.364ThrThr: 5.364 ± 0.235
5.686ThrVal: 5.686 ± 0.357
0.345ThrTrp: 0.345 ± 0.024
2.887ThrTyr: 2.887 ± 0.094
0.0ThrXaa: 0.0 ± 0.0
Val
3.491ValAla: 3.491 ± 0.098
0.709ValCys: 0.709 ± 0.037
4.618ValAsp: 4.618 ± 0.109
4.488ValGlu: 4.488 ± 0.134
2.535ValPhe: 2.535 ± 0.069
3.642ValGly: 3.642 ± 0.094
0.873ValHis: 0.873 ± 0.033
6.046ValIle: 6.046 ± 0.102
4.638ValLys: 4.638 ± 0.07
5.352ValLeu: 5.352 ± 0.09
1.349ValMet: 1.349 ± 0.051
4.99ValAsn: 4.99 ± 0.196
1.86ValPro: 1.86 ± 0.069
1.363ValGln: 1.363 ± 0.04
1.628ValArg: 1.628 ± 0.061
4.23ValSer: 4.23 ± 0.095
5.381ValThr: 5.381 ± 0.423
4.366ValVal: 4.366 ± 0.164
0.26ValTrp: 0.26 ± 0.019
2.99ValTyr: 2.99 ± 0.092
0.0ValXaa: 0.0 ± 0.0
Trp
0.337TrpAla: 0.337 ± 0.025
0.057TrpCys: 0.057 ± 0.009
0.438TrpAsp: 0.438 ± 0.027
0.308TrpGlu: 0.308 ± 0.023
0.212TrpPhe: 0.212 ± 0.018
0.377TrpGly: 0.377 ± 0.03
0.101TrpHis: 0.101 ± 0.013
0.545TrpIle: 0.545 ± 0.033
0.369TrpLys: 0.369 ± 0.023
0.433TrpLeu: 0.433 ± 0.026
0.143TrpMet: 0.143 ± 0.014
0.453TrpAsn: 0.453 ± 0.027
0.118TrpPro: 0.118 ± 0.012
0.153TrpGln: 0.153 ± 0.013
0.159TrpArg: 0.159 ± 0.015
0.32TrpSer: 0.32 ± 0.022
0.385TrpThr: 0.385 ± 0.024
0.308TrpVal: 0.308 ± 0.023
0.075TrpTrp: 0.075 ± 0.011
0.268TrpTyr: 0.268 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.296TyrAla: 2.296 ± 0.053
0.49TyrCys: 0.49 ± 0.028
3.556TyrAsp: 3.556 ± 0.079
3.008TyrGlu: 3.008 ± 0.077
1.705TyrPhe: 1.705 ± 0.056
2.977TyrGly: 2.977 ± 0.083
0.827TyrHis: 0.827 ± 0.039
3.542TyrIle: 3.542 ± 0.093
2.752TyrLys: 2.752 ± 0.077
3.726TyrLeu: 3.726 ± 0.086
1.009TyrMet: 1.009 ± 0.042
4.673TyrAsn: 4.673 ± 0.172
1.556TyrPro: 1.556 ± 0.049
1.268TyrGln: 1.268 ± 0.048
1.262TyrArg: 1.262 ± 0.041
3.043TyrSer: 3.043 ± 0.073
3.414TyrThr: 3.414 ± 0.118
2.656TyrVal: 2.656 ± 0.076
0.268TyrTrp: 0.268 ± 0.022
2.368TyrTyr: 2.368 ± 0.065
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2101 proteins (768877 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski