Amino acid dipepetide frequency for Ostreococcus tauri virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.975AlaAla: 3.975 ± 0.345
1.121AlaCys: 1.121 ± 0.169
3.362AlaAsp: 3.362 ± 0.271
3.135AlaGlu: 3.135 ± 0.281
2.277AlaPhe: 2.277 ± 0.228
4.063AlaGly: 4.063 ± 0.538
1.103AlaHis: 1.103 ± 0.135
3.905AlaIle: 3.905 ± 0.297
4.185AlaLys: 4.185 ± 0.478
5.148AlaLeu: 5.148 ± 0.362
1.629AlaMet: 1.629 ± 0.19
3.555AlaAsn: 3.555 ± 0.552
1.699AlaPro: 1.699 ± 0.205
2.452AlaGln: 2.452 ± 0.289
3.012AlaArg: 3.012 ± 0.437
3.572AlaSer: 3.572 ± 0.308
4.045AlaThr: 4.045 ± 0.442
3.642AlaVal: 3.642 ± 0.262
0.525AlaTrp: 0.525 ± 0.125
2.119AlaTyr: 2.119 ± 0.196
0.0AlaXaa: 0.0 ± 0.0
Cys
0.841CysAla: 0.841 ± 0.145
0.455CysCys: 0.455 ± 0.089
1.191CysAsp: 1.191 ± 0.161
1.594CysGlu: 1.594 ± 0.174
0.63CysPhe: 0.63 ± 0.1
1.173CysGly: 1.173 ± 0.19
0.438CysHis: 0.438 ± 0.087
0.806CysIle: 0.806 ± 0.139
1.121CysLys: 1.121 ± 0.16
1.296CysLeu: 1.296 ± 0.15
0.473CysMet: 0.473 ± 0.086
0.788CysAsn: 0.788 ± 0.143
1.296CysPro: 1.296 ± 0.235
0.385CysGln: 0.385 ± 0.086
0.911CysArg: 0.911 ± 0.129
1.156CysSer: 1.156 ± 0.163
0.823CysThr: 0.823 ± 0.125
1.191CysVal: 1.191 ± 0.143
0.105CysTrp: 0.105 ± 0.043
0.735CysTyr: 0.735 ± 0.103
0.0CysXaa: 0.0 ± 0.0
Asp
3.45AspAla: 3.45 ± 0.248
1.033AspCys: 1.033 ± 0.148
3.835AspAsp: 3.835 ± 0.287
4.798AspGlu: 4.798 ± 0.332
2.557AspPhe: 2.557 ± 0.211
3.747AspGly: 3.747 ± 0.332
1.243AspHis: 1.243 ± 0.166
4.413AspIle: 4.413 ± 0.339
3.082AspLys: 3.082 ± 0.233
4.868AspLeu: 4.868 ± 0.232
1.751AspMet: 1.751 ± 0.193
2.259AspAsn: 2.259 ± 0.172
2.171AspPro: 2.171 ± 0.215
1.576AspGln: 1.576 ± 0.152
2.942AspArg: 2.942 ± 0.189
3.065AspSer: 3.065 ± 0.368
4.01AspThr: 4.01 ± 0.383
3.975AspVal: 3.975 ± 0.365
0.806AspTrp: 0.806 ± 0.128
2.872AspTyr: 2.872 ± 0.257
0.0AspXaa: 0.0 ± 0.0
Glu
4.22GluAla: 4.22 ± 0.468
1.121GluCys: 1.121 ± 0.142
4.168GluAsp: 4.168 ± 0.383
5.901GluGlu: 5.901 ± 0.475
2.889GluPhe: 2.889 ± 0.195
3.397GluGly: 3.397 ± 0.25
1.681GluHis: 1.681 ± 0.177
4.43GluIle: 4.43 ± 0.333
5.481GluLys: 5.481 ± 0.49
5.551GluLeu: 5.551 ± 0.377
2.154GluMet: 2.154 ± 0.177
3.818GluAsn: 3.818 ± 0.255
2.364GluPro: 2.364 ± 0.253
1.751GluGln: 1.751 ± 0.191
3.66GluArg: 3.66 ± 0.35
3.485GluSer: 3.485 ± 0.267
3.66GluThr: 3.66 ± 0.243
3.818GluVal: 3.818 ± 0.278
0.823GluTrp: 0.823 ± 0.134
2.994GluTyr: 2.994 ± 0.206
0.0GluXaa: 0.0 ± 0.0
Phe
2.627PheAla: 2.627 ± 0.235
0.893PheCys: 0.893 ± 0.119
2.907PheAsp: 2.907 ± 0.241
2.627PheGlu: 2.627 ± 0.235
2.014PhePhe: 2.014 ± 0.218
2.329PheGly: 2.329 ± 0.227
1.138PheHis: 1.138 ± 0.139
2.627PheIle: 2.627 ± 0.228
3.03PheLys: 3.03 ± 0.258
3.205PheLeu: 3.205 ± 0.27
1.559PheMet: 1.559 ± 0.192
2.312PheAsn: 2.312 ± 0.232
1.576PhePro: 1.576 ± 0.176
1.068PheGln: 1.068 ± 0.16
1.891PheArg: 1.891 ± 0.173
3.065PheSer: 3.065 ± 0.253
2.206PheThr: 2.206 ± 0.189
3.38PheVal: 3.38 ± 0.285
0.42PheTrp: 0.42 ± 0.086
1.751PheTyr: 1.751 ± 0.189
0.0PheXaa: 0.0 ± 0.0
Gly
4.045GlyAla: 4.045 ± 0.435
0.998GlyCys: 0.998 ± 0.153
4.045GlyAsp: 4.045 ± 0.419
3.432GlyGlu: 3.432 ± 0.251
2.942GlyPhe: 2.942 ± 0.246
5.131GlyGly: 5.131 ± 0.655
1.488GlyHis: 1.488 ± 0.159
3.923GlyIle: 3.923 ± 0.313
4.325GlyLys: 4.325 ± 0.244
5.061GlyLeu: 5.061 ± 0.355
1.629GlyMet: 1.629 ± 0.184
3.485GlyAsn: 3.485 ± 0.396
2.277GlyPro: 2.277 ± 0.226
2.049GlyGln: 2.049 ± 0.189
2.837GlyArg: 2.837 ± 0.278
3.712GlySer: 3.712 ± 0.336
3.905GlyThr: 3.905 ± 0.417
4.08GlyVal: 4.08 ± 0.368
0.735GlyTrp: 0.735 ± 0.117
2.469GlyTyr: 2.469 ± 0.221
0.0GlyXaa: 0.0 ± 0.0
His
1.418HisAla: 1.418 ± 0.335
0.455HisCys: 0.455 ± 0.088
1.051HisAsp: 1.051 ± 0.145
1.401HisGlu: 1.401 ± 0.14
0.735HisPhe: 0.735 ± 0.126
1.594HisGly: 1.594 ± 0.145
0.683HisHis: 0.683 ± 0.119
1.769HisIle: 1.769 ± 0.177
1.296HisLys: 1.296 ± 0.156
1.856HisLeu: 1.856 ± 0.207
0.49HisMet: 0.49 ± 0.088
1.086HisAsn: 1.086 ± 0.152
1.103HisPro: 1.103 ± 0.142
0.578HisGln: 0.578 ± 0.093
1.173HisArg: 1.173 ± 0.141
1.121HisSer: 1.121 ± 0.136
1.541HisThr: 1.541 ± 0.197
1.453HisVal: 1.453 ± 0.189
0.368HisTrp: 0.368 ± 0.069
0.911HisTyr: 0.911 ± 0.121
0.0HisXaa: 0.0 ± 0.0
Ile
3.485IleAla: 3.485 ± 0.228
0.841IleCys: 0.841 ± 0.091
4.203IleAsp: 4.203 ± 0.348
4.763IleGlu: 4.763 ± 0.306
2.539IlePhe: 2.539 ± 0.234
3.765IleGly: 3.765 ± 0.272
1.944IleHis: 1.944 ± 0.239
3.712IleIle: 3.712 ± 0.285
5.148IleLys: 5.148 ± 0.31
5.429IleLeu: 5.429 ± 0.309
1.734IleMet: 1.734 ± 0.164
3.818IleAsn: 3.818 ± 0.27
3.135IlePro: 3.135 ± 0.241
2.679IleGln: 2.679 ± 0.175
2.679IleArg: 2.679 ± 0.183
3.59IleSer: 3.59 ± 0.231
4.01IleThr: 4.01 ± 0.439
3.94IleVal: 3.94 ± 0.338
0.508IleTrp: 0.508 ± 0.097
2.434IleTyr: 2.434 ± 0.211
0.018IleXaa: 0.018 ± 0.017
Lys
3.712LysAla: 3.712 ± 0.387
1.471LysCys: 1.471 ± 0.207
3.888LysAsp: 3.888 ± 0.346
4.851LysGlu: 4.851 ± 0.404
3.397LysPhe: 3.397 ± 0.302
3.572LysGly: 3.572 ± 0.294
1.594LysHis: 1.594 ± 0.157
5.148LysIle: 5.148 ± 0.33
7.18LysLys: 7.18 ± 0.602
5.884LysLeu: 5.884 ± 0.439
2.627LysMet: 2.627 ± 0.256
5.131LysAsn: 5.131 ± 0.578
3.1LysPro: 3.1 ± 0.265
2.959LysGln: 2.959 ± 0.201
4.325LysArg: 4.325 ± 0.507
4.483LysSer: 4.483 ± 0.345
4.921LysThr: 4.921 ± 0.361
4.623LysVal: 4.623 ± 0.291
0.683LysTrp: 0.683 ± 0.104
3.17LysTyr: 3.17 ± 0.227
0.018LysXaa: 0.018 ± 0.017
Leu
4.571LeuAla: 4.571 ± 0.324
1.383LeuCys: 1.383 ± 0.185
4.553LeuAsp: 4.553 ± 0.304
5.831LeuGlu: 5.831 ± 0.3
2.907LeuPhe: 2.907 ± 0.24
4.36LeuGly: 4.36 ± 0.315
1.366LeuHis: 1.366 ± 0.139
5.008LeuIle: 5.008 ± 0.253
7.04LeuLys: 7.04 ± 0.503
6.269LeuLeu: 6.269 ± 0.374
2.277LeuMet: 2.277 ± 0.215
5.131LeuAsn: 5.131 ± 0.769
3.082LeuPro: 3.082 ± 0.248
2.452LeuGln: 2.452 ± 0.291
4.676LeuArg: 4.676 ± 0.304
4.851LeuSer: 4.851 ± 0.309
4.483LeuThr: 4.483 ± 0.278
5.516LeuVal: 5.516 ± 0.351
0.788LeuTrp: 0.788 ± 0.13
3.065LeuTyr: 3.065 ± 0.254
0.0LeuXaa: 0.0 ± 0.0
Met
1.576MetAla: 1.576 ± 0.16
0.665MetCys: 0.665 ± 0.103
1.471MetAsp: 1.471 ± 0.191
1.804MetGlu: 1.804 ± 0.193
1.436MetPhe: 1.436 ± 0.158
1.646MetGly: 1.646 ± 0.169
0.578MetHis: 0.578 ± 0.1
1.629MetIle: 1.629 ± 0.174
2.627MetLys: 2.627 ± 0.269
1.996MetLeu: 1.996 ± 0.24
0.911MetMet: 0.911 ± 0.15
2.066MetAsn: 2.066 ± 0.268
0.963MetPro: 0.963 ± 0.128
0.7MetGln: 0.7 ± 0.1
1.401MetArg: 1.401 ± 0.149
2.522MetSer: 2.522 ± 0.222
1.453MetThr: 1.453 ± 0.174
1.786MetVal: 1.786 ± 0.18
0.385MetTrp: 0.385 ± 0.086
1.348MetTyr: 1.348 ± 0.169
0.0MetXaa: 0.0 ± 0.0
Asn
4.098AsnAla: 4.098 ± 0.786
0.648AsnCys: 0.648 ± 0.107
2.819AsnAsp: 2.819 ± 0.201
3.397AsnGlu: 3.397 ± 0.297
2.767AsnPhe: 2.767 ± 0.259
3.345AsnGly: 3.345 ± 0.236
1.156AsnHis: 1.156 ± 0.148
3.958AsnIle: 3.958 ± 0.412
4.08AsnLys: 4.08 ± 0.625
5.183AsnLeu: 5.183 ± 0.404
1.751AsnMet: 1.751 ± 0.205
3.45AsnAsn: 3.45 ± 0.437
2.154AsnPro: 2.154 ± 0.21
2.031AsnGln: 2.031 ± 0.281
2.732AsnArg: 2.732 ± 0.478
2.889AsnSer: 2.889 ± 0.253
4.115AsnThr: 4.115 ± 0.582
4.378AsnVal: 4.378 ± 0.493
0.735AsnTrp: 0.735 ± 0.134
2.487AsnTyr: 2.487 ± 0.246
0.018AsnXaa: 0.018 ± 0.017
Pro
1.926ProAla: 1.926 ± 0.236
0.735ProCys: 0.735 ± 0.15
2.206ProAsp: 2.206 ± 0.186
3.38ProGlu: 3.38 ± 0.279
1.646ProPhe: 1.646 ± 0.172
2.609ProGly: 2.609 ± 0.27
0.788ProHis: 0.788 ± 0.127
2.224ProIle: 2.224 ± 0.205
3.397ProLys: 3.397 ± 0.33
2.627ProLeu: 2.627 ± 0.226
1.103ProMet: 1.103 ± 0.128
2.189ProAsn: 2.189 ± 0.207
2.119ProPro: 2.119 ± 0.255
2.031ProGln: 2.031 ± 0.183
1.734ProArg: 1.734 ± 0.201
2.819ProSer: 2.819 ± 0.342
2.924ProThr: 2.924 ± 0.298
2.592ProVal: 2.592 ± 0.231
0.368ProTrp: 0.368 ± 0.071
1.436ProTyr: 1.436 ± 0.195
0.018ProXaa: 0.018 ± 0.017
Gln
1.559GlnAla: 1.559 ± 0.165
0.613GlnCys: 0.613 ± 0.095
2.084GlnAsp: 2.084 ± 0.193
2.101GlnGlu: 2.101 ± 0.224
1.506GlnPhe: 1.506 ± 0.171
1.629GlnGly: 1.629 ± 0.193
0.718GlnHis: 0.718 ± 0.106
2.382GlnIle: 2.382 ± 0.198
2.959GlnLys: 2.959 ± 0.388
2.959GlnLeu: 2.959 ± 0.228
1.051GlnMet: 1.051 ± 0.16
1.856GlnAsn: 1.856 ± 0.204
1.769GlnPro: 1.769 ± 0.189
1.786GlnGln: 1.786 ± 0.193
1.926GlnArg: 1.926 ± 0.215
1.944GlnSer: 1.944 ± 0.163
1.996GlnThr: 1.996 ± 0.195
1.804GlnVal: 1.804 ± 0.204
0.333GlnTrp: 0.333 ± 0.082
1.121GlnTyr: 1.121 ± 0.148
0.0GlnXaa: 0.0 ± 0.0
Arg
2.959ArgAla: 2.959 ± 0.294
0.648ArgCys: 0.648 ± 0.103
3.187ArgAsp: 3.187 ± 0.233
3.572ArgGlu: 3.572 ± 0.391
2.154ArgPhe: 2.154 ± 0.215
3.03ArgGly: 3.03 ± 0.251
1.138ArgHis: 1.138 ± 0.157
3.31ArgIle: 3.31 ± 0.272
3.66ArgLys: 3.66 ± 0.464
3.765ArgLeu: 3.765 ± 0.257
1.453ArgMet: 1.453 ± 0.201
2.732ArgAsn: 2.732 ± 0.349
1.839ArgPro: 1.839 ± 0.182
1.821ArgGln: 1.821 ± 0.171
2.889ArgArg: 2.889 ± 0.27
2.784ArgSer: 2.784 ± 0.223
2.557ArgThr: 2.557 ± 0.241
3.607ArgVal: 3.607 ± 0.303
0.665ArgTrp: 0.665 ± 0.102
1.996ArgTyr: 1.996 ± 0.199
0.018ArgXaa: 0.018 ± 0.017
Ser
3.257SerAla: 3.257 ± 0.285
0.858SerCys: 0.858 ± 0.139
3.117SerAsp: 3.117 ± 0.267
4.028SerGlu: 4.028 ± 0.33
2.767SerPhe: 2.767 ± 0.223
4.991SerGly: 4.991 ± 0.527
1.156SerHis: 1.156 ± 0.154
3.8SerIle: 3.8 ± 0.29
4.203SerLys: 4.203 ± 0.33
4.921SerLeu: 4.921 ± 0.288
1.488SerMet: 1.488 ± 0.171
4.483SerAsn: 4.483 ± 0.565
2.066SerPro: 2.066 ± 0.183
2.294SerGln: 2.294 ± 0.178
2.679SerArg: 2.679 ± 0.183
4.518SerSer: 4.518 ± 0.552
4.098SerThr: 4.098 ± 0.391
3.975SerVal: 3.975 ± 0.28
0.595SerTrp: 0.595 ± 0.107
2.119SerTyr: 2.119 ± 0.187
0.0SerXaa: 0.0 ± 0.0
Thr
3.607ThrAla: 3.607 ± 0.377
0.928ThrCys: 0.928 ± 0.143
3.677ThrAsp: 3.677 ± 0.465
3.222ThrGlu: 3.222 ± 0.23
2.452ThrPhe: 2.452 ± 0.21
4.781ThrGly: 4.781 ± 0.538
1.524ThrHis: 1.524 ± 0.232
3.888ThrIle: 3.888 ± 0.316
4.641ThrLys: 4.641 ± 0.275
4.588ThrLeu: 4.588 ± 0.281
1.559ThrMet: 1.559 ± 0.184
3.712ThrAsn: 3.712 ± 0.421
3.205ThrPro: 3.205 ± 0.367
2.031ThrGln: 2.031 ± 0.217
3.065ThrArg: 3.065 ± 0.258
4.168ThrSer: 4.168 ± 0.535
4.168ThrThr: 4.168 ± 0.719
3.502ThrVal: 3.502 ± 0.268
0.665ThrTrp: 0.665 ± 0.121
2.277ThrTyr: 2.277 ± 0.228
0.0ThrXaa: 0.0 ± 0.0
Val
4.203ValAla: 4.203 ± 0.507
1.629ValCys: 1.629 ± 0.162
3.467ValAsp: 3.467 ± 0.271
4.343ValGlu: 4.343 ± 0.277
2.837ValPhe: 2.837 ± 0.278
3.94ValGly: 3.94 ± 0.401
1.366ValHis: 1.366 ± 0.197
3.642ValIle: 3.642 ± 0.281
5.376ValLys: 5.376 ± 0.336
4.343ValLeu: 4.343 ± 0.288
1.786ValMet: 1.786 ± 0.184
3.467ValAsn: 3.467 ± 0.431
3.257ValPro: 3.257 ± 0.315
2.084ValGln: 2.084 ± 0.18
3.012ValArg: 3.012 ± 0.279
4.133ValSer: 4.133 ± 0.289
3.783ValThr: 3.783 ± 0.293
4.465ValVal: 4.465 ± 0.312
0.788ValTrp: 0.788 ± 0.122
2.504ValTyr: 2.504 ± 0.266
0.0ValXaa: 0.0 ± 0.0
Trp
0.438TrpAla: 0.438 ± 0.094
0.35TrpCys: 0.35 ± 0.087
0.578TrpAsp: 0.578 ± 0.115
0.718TrpGlu: 0.718 ± 0.124
0.473TrpPhe: 0.473 ± 0.086
0.823TrpGly: 0.823 ± 0.112
0.175TrpHis: 0.175 ± 0.049
1.016TrpIle: 1.016 ± 0.151
0.911TrpLys: 0.911 ± 0.132
0.858TrpLeu: 0.858 ± 0.138
0.123TrpMet: 0.123 ± 0.045
0.735TrpAsn: 0.735 ± 0.098
0.368TrpPro: 0.368 ± 0.088
0.28TrpGln: 0.28 ± 0.084
0.403TrpArg: 0.403 ± 0.094
0.876TrpSer: 0.876 ± 0.126
0.49TrpThr: 0.49 ± 0.096
0.683TrpVal: 0.683 ± 0.137
0.21TrpTrp: 0.21 ± 0.062
0.35TrpTyr: 0.35 ± 0.075
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.557TyrAla: 2.557 ± 0.214
0.543TyrCys: 0.543 ± 0.098
2.627TyrAsp: 2.627 ± 0.202
2.487TyrGlu: 2.487 ± 0.273
1.699TyrPhe: 1.699 ± 0.192
2.644TyrGly: 2.644 ± 0.247
0.806TyrHis: 0.806 ± 0.13
2.784TyrIle: 2.784 ± 0.245
2.994TyrLys: 2.994 ± 0.267
3.8TyrLeu: 3.8 ± 0.293
1.401TyrMet: 1.401 ± 0.158
2.119TyrAsn: 2.119 ± 0.226
1.313TyrPro: 1.313 ± 0.171
1.086TyrGln: 1.086 ± 0.117
1.821TyrArg: 1.821 ± 0.193
2.644TyrSer: 2.644 ± 0.266
2.469TyrThr: 2.469 ± 0.294
2.014TyrVal: 2.014 ± 0.186
0.35TyrTrp: 0.35 ± 0.08
1.734TyrTyr: 1.734 ± 0.181
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.018XaaCys: 0.018 ± 0.017
0.018XaaAsp: 0.018 ± 0.017
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.018XaaPro: 0.018 ± 0.017
0.0XaaGln: 0.0 ± 0.0
0.018XaaArg: 0.018 ± 0.017
0.018XaaSer: 0.018 ± 0.017
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.018XaaXaa: 0.018 ± 0.017
Statistics based on 237 proteins (57106 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski