Amino acid dipepetide frequency for Pseudomonas phage PhiPA3 (Pseudomonas aeruginosa phage PhiPA3)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.802AlaAla: 6.802 ± 0.489
0.549AlaCys: 0.549 ± 0.071
4.373AlaAsp: 4.373 ± 0.198
5.296AlaGlu: 5.296 ± 0.383
2.824AlaPhe: 2.824 ± 0.14
4.868AlaGly: 4.868 ± 0.334
1.385AlaHis: 1.385 ± 0.118
4.879AlaIle: 4.879 ± 0.168
3.934AlaLys: 3.934 ± 0.231
6.483AlaLeu: 6.483 ± 0.286
2.648AlaMet: 2.648 ± 0.163
3.934AlaAsn: 3.934 ± 0.216
2.637AlaPro: 2.637 ± 0.225
2.758AlaGln: 2.758 ± 0.173
3.835AlaArg: 3.835 ± 0.175
4.066AlaSer: 4.066 ± 0.219
4.483AlaThr: 4.483 ± 0.249
5.362AlaVal: 5.362 ± 0.246
0.934AlaTrp: 0.934 ± 0.113
2.989AlaTyr: 2.989 ± 0.204
0.0AlaXaa: 0.0 ± 0.0
Cys
0.604CysAla: 0.604 ± 0.084
0.176CysCys: 0.176 ± 0.044
0.626CysAsp: 0.626 ± 0.085
0.505CysGlu: 0.505 ± 0.075
0.319CysPhe: 0.319 ± 0.06
0.538CysGly: 0.538 ± 0.084
0.22CysHis: 0.22 ± 0.054
0.462CysIle: 0.462 ± 0.072
0.582CysLys: 0.582 ± 0.091
0.725CysLeu: 0.725 ± 0.102
0.297CysMet: 0.297 ± 0.063
0.418CysAsn: 0.418 ± 0.069
0.418CysPro: 0.418 ± 0.063
0.451CysGln: 0.451 ± 0.077
0.604CysArg: 0.604 ± 0.089
0.483CysSer: 0.483 ± 0.079
0.527CysThr: 0.527 ± 0.072
0.637CysVal: 0.637 ± 0.091
0.176CysTrp: 0.176 ± 0.046
0.363CysTyr: 0.363 ± 0.066
0.0CysXaa: 0.0 ± 0.0
Asp
4.417AspAla: 4.417 ± 0.222
0.505AspCys: 0.505 ± 0.078
3.791AspAsp: 3.791 ± 0.228
4.351AspGlu: 4.351 ± 0.266
2.604AspPhe: 2.604 ± 0.174
4.395AspGly: 4.395 ± 0.224
1.297AspHis: 1.297 ± 0.112
4.011AspIle: 4.011 ± 0.223
3.824AspLys: 3.824 ± 0.279
6.109AspLeu: 6.109 ± 0.24
1.736AspMet: 1.736 ± 0.149
2.901AspAsn: 2.901 ± 0.153
3.34AspPro: 3.34 ± 0.215
2.912AspGln: 2.912 ± 0.172
3.253AspArg: 3.253 ± 0.195
3.154AspSer: 3.154 ± 0.185
3.769AspThr: 3.769 ± 0.172
4.571AspVal: 4.571 ± 0.288
0.956AspTrp: 0.956 ± 0.116
2.703AspTyr: 2.703 ± 0.177
0.0AspXaa: 0.0 ± 0.0
Glu
5.362GluAla: 5.362 ± 0.393
0.593GluCys: 0.593 ± 0.083
3.879GluAsp: 3.879 ± 0.239
4.659GluGlu: 4.659 ± 0.331
2.912GluPhe: 2.912 ± 0.167
3.395GluGly: 3.395 ± 0.221
1.33GluHis: 1.33 ± 0.115
4.176GluIle: 4.176 ± 0.238
3.154GluLys: 3.154 ± 0.176
7.142GluLeu: 7.142 ± 0.303
1.813GluMet: 1.813 ± 0.139
2.637GluAsn: 2.637 ± 0.178
1.989GluPro: 1.989 ± 0.164
3.176GluGln: 3.176 ± 0.218
3.703GluArg: 3.703 ± 0.218
3.275GluSer: 3.275 ± 0.192
3.714GluThr: 3.714 ± 0.227
4.307GluVal: 4.307 ± 0.26
1.165GluTrp: 1.165 ± 0.102
2.846GluTyr: 2.846 ± 0.226
0.0GluXaa: 0.0 ± 0.0
Phe
2.637PheAla: 2.637 ± 0.166
0.472PheCys: 0.472 ± 0.067
3.033PheAsp: 3.033 ± 0.164
2.308PheGlu: 2.308 ± 0.151
1.802PhePhe: 1.802 ± 0.137
2.725PheGly: 2.725 ± 0.168
0.956PheHis: 0.956 ± 0.1
2.308PheIle: 2.308 ± 0.154
2.319PheLys: 2.319 ± 0.16
2.725PheLeu: 2.725 ± 0.181
1.11PheMet: 1.11 ± 0.13
2.527PheAsn: 2.527 ± 0.168
1.659PhePro: 1.659 ± 0.155
1.604PheGln: 1.604 ± 0.138
2.231PheArg: 2.231 ± 0.156
2.264PheSer: 2.264 ± 0.151
2.593PheThr: 2.593 ± 0.159
2.802PheVal: 2.802 ± 0.158
0.494PheTrp: 0.494 ± 0.074
1.67PheTyr: 1.67 ± 0.15
0.0PheXaa: 0.0 ± 0.0
Gly
3.659GlyAla: 3.659 ± 0.252
0.472GlyCys: 0.472 ± 0.075
4.681GlyAsp: 4.681 ± 0.438
4.143GlyGlu: 4.143 ± 0.233
2.582GlyPhe: 2.582 ± 0.156
4.703GlyGly: 4.703 ± 0.312
1.33GlyHis: 1.33 ± 0.129
3.934GlyIle: 3.934 ± 0.203
3.736GlyLys: 3.736 ± 0.215
5.307GlyLeu: 5.307 ± 0.263
1.89GlyMet: 1.89 ± 0.161
3.549GlyAsn: 3.549 ± 0.227
1.791GlyPro: 1.791 ± 0.192
2.22GlyGln: 2.22 ± 0.15
3.077GlyArg: 3.077 ± 0.241
3.846GlySer: 3.846 ± 0.274
4.176GlyThr: 4.176 ± 0.262
4.582GlyVal: 4.582 ± 0.252
1.0GlyTrp: 1.0 ± 0.107
3.0GlyTyr: 3.0 ± 0.227
0.0GlyXaa: 0.0 ± 0.0
His
1.264HisAla: 1.264 ± 0.134
0.198HisCys: 0.198 ± 0.054
1.264HisAsp: 1.264 ± 0.114
1.231HisGlu: 1.231 ± 0.097
1.044HisPhe: 1.044 ± 0.114
1.56HisGly: 1.56 ± 0.139
0.56HisHis: 0.56 ± 0.088
1.341HisIle: 1.341 ± 0.118
0.967HisLys: 0.967 ± 0.102
1.934HisLeu: 1.934 ± 0.145
0.582HisMet: 0.582 ± 0.083
1.143HisAsn: 1.143 ± 0.109
1.165HisPro: 1.165 ± 0.145
0.989HisGln: 0.989 ± 0.116
1.286HisArg: 1.286 ± 0.108
1.055HisSer: 1.055 ± 0.113
1.088HisThr: 1.088 ± 0.114
1.527HisVal: 1.527 ± 0.127
0.396HisTrp: 0.396 ± 0.072
1.165HisTyr: 1.165 ± 0.125
0.0HisXaa: 0.0 ± 0.0
Ile
4.923IleAla: 4.923 ± 0.233
0.527IleCys: 0.527 ± 0.094
4.692IleAsp: 4.692 ± 0.274
4.121IleGlu: 4.121 ± 0.211
1.582IlePhe: 1.582 ± 0.149
3.439IleGly: 3.439 ± 0.229
1.483IleHis: 1.483 ± 0.144
2.89IleIle: 2.89 ± 0.185
3.285IleLys: 3.285 ± 0.192
4.121IleLeu: 4.121 ± 0.208
1.626IleMet: 1.626 ± 0.137
3.505IleAsn: 3.505 ± 0.19
3.198IlePro: 3.198 ± 0.175
2.659IleGln: 2.659 ± 0.17
3.615IleArg: 3.615 ± 0.218
3.549IleSer: 3.549 ± 0.205
4.373IleThr: 4.373 ± 0.203
3.868IleVal: 3.868 ± 0.196
0.725IleTrp: 0.725 ± 0.096
2.044IleTyr: 2.044 ± 0.145
0.0IleXaa: 0.0 ± 0.0
Lys
4.835LysAla: 4.835 ± 0.256
0.407LysCys: 0.407 ± 0.083
3.406LysAsp: 3.406 ± 0.209
3.439LysGlu: 3.439 ± 0.223
2.209LysPhe: 2.209 ± 0.175
3.945LysGly: 3.945 ± 0.418
1.407LysHis: 1.407 ± 0.131
2.956LysIle: 2.956 ± 0.174
2.571LysLys: 2.571 ± 0.241
5.241LysLeu: 5.241 ± 0.262
1.286LysMet: 1.286 ± 0.12
2.275LysAsn: 2.275 ± 0.164
2.264LysPro: 2.264 ± 0.15
2.099LysGln: 2.099 ± 0.139
3.11LysArg: 3.11 ± 0.188
2.747LysSer: 2.747 ± 0.194
2.802LysThr: 2.802 ± 0.175
4.165LysVal: 4.165 ± 0.196
0.681LysTrp: 0.681 ± 0.092
1.901LysTyr: 1.901 ± 0.138
0.0LysXaa: 0.0 ± 0.0
Leu
6.626LeuAla: 6.626 ± 0.271
0.879LeuCys: 0.879 ± 0.106
5.813LeuAsp: 5.813 ± 0.24
5.945LeuGlu: 5.945 ± 0.29
3.395LeuPhe: 3.395 ± 0.19
5.033LeuGly: 5.033 ± 0.244
1.857LeuHis: 1.857 ± 0.149
4.901LeuIle: 4.901 ± 0.224
5.077LeuLys: 5.077 ± 0.272
6.483LeuLeu: 6.483 ± 0.292
2.264LeuMet: 2.264 ± 0.151
4.439LeuAsn: 4.439 ± 0.234
4.0LeuPro: 4.0 ± 0.193
3.417LeuGln: 3.417 ± 0.193
5.011LeuArg: 5.011 ± 0.238
4.56LeuSer: 4.56 ± 0.205
5.186LeuThr: 5.186 ± 0.23
5.681LeuVal: 5.681 ± 0.254
0.846LeuTrp: 0.846 ± 0.096
3.242LeuTyr: 3.242 ± 0.18
0.0LeuXaa: 0.0 ± 0.0
Met
2.428MetAla: 2.428 ± 0.163
0.209MetCys: 0.209 ± 0.048
1.791MetAsp: 1.791 ± 0.133
1.747MetGlu: 1.747 ± 0.135
1.154MetPhe: 1.154 ± 0.118
1.923MetGly: 1.923 ± 0.163
0.648MetHis: 0.648 ± 0.093
1.615MetIle: 1.615 ± 0.141
1.22MetLys: 1.22 ± 0.101
1.989MetLeu: 1.989 ± 0.126
0.648MetMet: 0.648 ± 0.097
1.352MetAsn: 1.352 ± 0.125
1.099MetPro: 1.099 ± 0.13
1.297MetGln: 1.297 ± 0.104
1.659MetArg: 1.659 ± 0.14
2.472MetSer: 2.472 ± 0.155
1.89MetThr: 1.89 ± 0.123
2.209MetVal: 2.209 ± 0.135
0.231MetTrp: 0.231 ± 0.053
1.033MetTyr: 1.033 ± 0.128
0.0MetXaa: 0.0 ± 0.0
Asn
3.45AsnAla: 3.45 ± 0.195
0.418AsnCys: 0.418 ± 0.059
2.89AsnAsp: 2.89 ± 0.167
3.022AsnGlu: 3.022 ± 0.176
1.78AsnPhe: 1.78 ± 0.131
4.033AsnGly: 4.033 ± 0.232
1.198AsnHis: 1.198 ± 0.132
3.022AsnIle: 3.022 ± 0.167
2.549AsnLys: 2.549 ± 0.138
4.121AsnLeu: 4.121 ± 0.21
1.505AsnMet: 1.505 ± 0.148
3.0AsnAsn: 3.0 ± 0.214
2.769AsnPro: 2.769 ± 0.194
1.989AsnGln: 1.989 ± 0.126
2.945AsnArg: 2.945 ± 0.177
2.538AsnSer: 2.538 ± 0.154
2.901AsnThr: 2.901 ± 0.176
3.516AsnVal: 3.516 ± 0.195
0.824AsnTrp: 0.824 ± 0.093
1.934AsnTyr: 1.934 ± 0.152
0.0AsnXaa: 0.0 ± 0.0
Pro
3.066ProAla: 3.066 ± 0.173
0.253ProCys: 0.253 ± 0.052
2.846ProAsp: 2.846 ± 0.174
3.406ProGlu: 3.406 ± 0.197
1.923ProPhe: 1.923 ± 0.143
2.494ProGly: 2.494 ± 0.15
0.912ProHis: 0.912 ± 0.104
2.341ProIle: 2.341 ± 0.172
2.165ProLys: 2.165 ± 0.181
2.956ProLeu: 2.956 ± 0.18
1.352ProMet: 1.352 ± 0.131
1.835ProAsn: 1.835 ± 0.149
1.483ProPro: 1.483 ± 0.147
1.615ProGln: 1.615 ± 0.119
1.78ProArg: 1.78 ± 0.143
2.538ProSer: 2.538 ± 0.163
3.406ProThr: 3.406 ± 0.193
3.516ProVal: 3.516 ± 0.181
0.549ProTrp: 0.549 ± 0.085
1.692ProTyr: 1.692 ± 0.122
0.0ProXaa: 0.0 ± 0.0
Gln
3.428GlnAla: 3.428 ± 0.202
0.472GlnCys: 0.472 ± 0.068
1.989GlnAsp: 1.989 ± 0.139
2.703GlnGlu: 2.703 ± 0.185
1.78GlnPhe: 1.78 ± 0.151
2.297GlnGly: 2.297 ± 0.198
0.978GlnHis: 0.978 ± 0.112
2.308GlnIle: 2.308 ± 0.155
1.846GlnLys: 1.846 ± 0.153
3.912GlnLeu: 3.912 ± 0.226
1.187GlnMet: 1.187 ± 0.112
1.868GlnAsn: 1.868 ± 0.18
1.637GlnPro: 1.637 ± 0.153
2.077GlnGln: 2.077 ± 0.161
2.494GlnArg: 2.494 ± 0.147
2.22GlnSer: 2.22 ± 0.166
2.373GlnThr: 2.373 ± 0.171
2.736GlnVal: 2.736 ± 0.16
0.791GlnTrp: 0.791 ± 0.089
1.615GlnTyr: 1.615 ± 0.128
0.0GlnXaa: 0.0 ± 0.0
Arg
3.472ArgAla: 3.472 ± 0.227
0.516ArgCys: 0.516 ± 0.078
3.571ArgAsp: 3.571 ± 0.205
3.329ArgGlu: 3.329 ± 0.198
2.67ArgPhe: 2.67 ± 0.157
2.769ArgGly: 2.769 ± 0.18
1.308ArgHis: 1.308 ± 0.138
3.637ArgIle: 3.637 ± 0.208
3.088ArgLys: 3.088 ± 0.176
5.571ArgLeu: 5.571 ± 0.259
1.681ArgMet: 1.681 ± 0.147
2.417ArgAsn: 2.417 ± 0.18
1.879ArgPro: 1.879 ± 0.146
2.165ArgGln: 2.165 ± 0.181
3.549ArgArg: 3.549 ± 0.211
3.022ArgSer: 3.022 ± 0.155
2.989ArgThr: 2.989 ± 0.15
3.956ArgVal: 3.956 ± 0.209
1.0ArgTrp: 1.0 ± 0.094
2.494ArgTyr: 2.494 ± 0.157
0.0ArgXaa: 0.0 ± 0.0
Ser
4.055SerAla: 4.055 ± 0.273
0.472SerCys: 0.472 ± 0.073
3.296SerAsp: 3.296 ± 0.181
3.176SerGlu: 3.176 ± 0.199
2.494SerPhe: 2.494 ± 0.151
4.187SerGly: 4.187 ± 0.204
1.0SerHis: 1.0 ± 0.103
3.846SerIle: 3.846 ± 0.217
3.253SerLys: 3.253 ± 0.214
4.659SerLeu: 4.659 ± 0.197
1.67SerMet: 1.67 ± 0.111
2.912SerAsn: 2.912 ± 0.187
2.275SerPro: 2.275 ± 0.152
2.066SerGln: 2.066 ± 0.172
2.659SerArg: 2.659 ± 0.176
3.527SerSer: 3.527 ± 0.207
3.549SerThr: 3.549 ± 0.191
3.923SerVal: 3.923 ± 0.209
0.857SerTrp: 0.857 ± 0.104
2.121SerTyr: 2.121 ± 0.146
0.0SerXaa: 0.0 ± 0.0
Thr
5.044ThrAla: 5.044 ± 0.303
0.549ThrCys: 0.549 ± 0.082
3.879ThrAsp: 3.879 ± 0.213
3.857ThrGlu: 3.857 ± 0.245
2.472ThrPhe: 2.472 ± 0.169
4.428ThrGly: 4.428 ± 0.236
1.275ThrHis: 1.275 ± 0.121
3.714ThrIle: 3.714 ± 0.185
2.945ThrLys: 2.945 ± 0.173
5.241ThrLeu: 5.241 ± 0.278
1.319ThrMet: 1.319 ± 0.125
2.736ThrAsn: 2.736 ± 0.211
3.121ThrPro: 3.121 ± 0.198
2.472ThrGln: 2.472 ± 0.162
2.824ThrArg: 2.824 ± 0.161
3.472ThrSer: 3.472 ± 0.201
4.077ThrThr: 4.077 ± 0.258
4.901ThrVal: 4.901 ± 0.23
0.956ThrTrp: 0.956 ± 0.094
2.56ThrTyr: 2.56 ± 0.175
0.0ThrXaa: 0.0 ± 0.0
Val
5.384ValAla: 5.384 ± 0.229
0.802ValCys: 0.802 ± 0.112
5.011ValAsp: 5.011 ± 0.243
5.011ValGlu: 5.011 ± 0.302
2.593ValPhe: 2.593 ± 0.171
3.934ValGly: 3.934 ± 0.186
1.088ValHis: 1.088 ± 0.12
4.538ValIle: 4.538 ± 0.223
4.318ValLys: 4.318 ± 0.205
5.066ValLeu: 5.066 ± 0.218
2.165ValMet: 2.165 ± 0.148
3.67ValAsn: 3.67 ± 0.211
3.318ValPro: 3.318 ± 0.186
2.659ValGln: 2.659 ± 0.174
4.077ValArg: 4.077 ± 0.192
4.154ValSer: 4.154 ± 0.24
4.659ValThr: 4.659 ± 0.242
5.428ValVal: 5.428 ± 0.302
1.044ValTrp: 1.044 ± 0.107
2.901ValTyr: 2.901 ± 0.192
0.0ValXaa: 0.0 ± 0.0
Trp
0.901TrpAla: 0.901 ± 0.098
0.154TrpCys: 0.154 ± 0.043
0.868TrpAsp: 0.868 ± 0.097
0.868TrpGlu: 0.868 ± 0.081
0.582TrpPhe: 0.582 ± 0.082
0.703TrpGly: 0.703 ± 0.089
0.275TrpHis: 0.275 ± 0.058
0.824TrpIle: 0.824 ± 0.1
0.791TrpLys: 0.791 ± 0.09
1.461TrpLeu: 1.461 ± 0.113
0.549TrpMet: 0.549 ± 0.087
0.758TrpAsn: 0.758 ± 0.095
0.385TrpPro: 0.385 ± 0.056
0.516TrpGln: 0.516 ± 0.064
1.011TrpArg: 1.011 ± 0.102
0.791TrpSer: 0.791 ± 0.093
0.89TrpThr: 0.89 ± 0.109
1.352TrpVal: 1.352 ± 0.17
0.22TrpTrp: 0.22 ± 0.043
0.593TrpTyr: 0.593 ± 0.081
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.681TyrAla: 2.681 ± 0.185
0.505TyrCys: 0.505 ± 0.073
2.901TyrAsp: 2.901 ± 0.219
2.11TyrGlu: 2.11 ± 0.154
1.516TyrPhe: 1.516 ± 0.14
2.319TyrGly: 2.319 ± 0.136
1.253TyrHis: 1.253 ± 0.129
2.527TyrIle: 2.527 ± 0.188
2.121TyrLys: 2.121 ± 0.16
3.406TyrLeu: 3.406 ± 0.18
1.275TyrMet: 1.275 ± 0.118
2.527TyrAsn: 2.527 ± 0.152
1.703TyrPro: 1.703 ± 0.138
1.571TyrGln: 1.571 ± 0.125
2.362TyrArg: 2.362 ± 0.153
2.264TyrSer: 2.264 ± 0.142
2.395TyrThr: 2.395 ± 0.188
2.78TyrVal: 2.78 ± 0.173
0.648TyrTrp: 0.648 ± 0.09
1.747TyrTyr: 1.747 ± 0.152
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 375 proteins (91007 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski