Amino acid dipepetide frequency for Glypta fumiferanae ichnovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.091AlaAla: 2.091 ± 0.303
0.972AlaCys: 0.972 ± 0.261
2.383AlaAsp: 2.383 ± 0.363
3.647AlaGlu: 3.647 ± 0.394
1.799AlaPhe: 1.799 ± 0.269
2.091AlaGly: 2.091 ± 0.379
0.729AlaHis: 0.729 ± 0.185
3.793AlaIle: 3.793 ± 0.494
2.869AlaLys: 2.869 ± 0.43
3.404AlaLeu: 3.404 ± 0.36
1.41AlaMet: 1.41 ± 0.258
2.917AlaAsn: 2.917 ± 0.321
1.799AlaPro: 1.799 ± 0.319
1.313AlaGln: 1.313 ± 0.223
2.528AlaArg: 2.528 ± 0.365
2.966AlaSer: 2.966 ± 0.315
2.674AlaThr: 2.674 ± 0.344
3.695AlaVal: 3.695 ± 0.342
0.34AlaTrp: 0.34 ± 0.136
0.972AlaTyr: 0.972 ± 0.237
0.0AlaXaa: 0.0 ± 0.0
Cys
2.237CysAla: 2.237 ± 0.312
0.438CysCys: 0.438 ± 0.133
1.361CysAsp: 1.361 ± 0.244
1.653CysGlu: 1.653 ± 0.245
1.021CysPhe: 1.021 ± 0.262
1.118CysGly: 1.118 ± 0.276
0.438CysHis: 0.438 ± 0.188
1.459CysIle: 1.459 ± 0.315
1.361CysLys: 1.361 ± 0.262
2.334CysLeu: 2.334 ± 0.548
0.729CysMet: 0.729 ± 0.196
1.507CysAsn: 1.507 ± 0.276
1.361CysPro: 1.361 ± 0.246
0.778CysGln: 0.778 ± 0.186
1.41CysArg: 1.41 ± 0.261
1.507CysSer: 1.507 ± 0.263
0.972CysThr: 0.972 ± 0.199
5.446CysVal: 5.446 ± 2.244
0.583CysTrp: 0.583 ± 0.143
0.827CysTyr: 0.827 ± 0.181
0.0CysXaa: 0.0 ± 0.0
Asp
1.556AspAla: 1.556 ± 0.201
0.827AspCys: 0.827 ± 0.187
3.452AspAsp: 3.452 ± 0.656
4.182AspGlu: 4.182 ± 0.442
3.306AspPhe: 3.306 ± 0.432
2.188AspGly: 2.188 ± 0.285
1.799AspHis: 1.799 ± 0.335
4.668AspIle: 4.668 ± 0.43
2.237AspLys: 2.237 ± 0.371
4.328AspLeu: 4.328 ± 0.512
1.702AspMet: 1.702 ± 0.215
3.744AspAsn: 3.744 ± 0.566
1.556AspPro: 1.556 ± 0.249
1.264AspGln: 1.264 ± 0.257
2.334AspArg: 2.334 ± 0.426
4.376AspSer: 4.376 ± 0.611
2.966AspThr: 2.966 ± 0.343
3.258AspVal: 3.258 ± 0.422
0.681AspTrp: 0.681 ± 0.184
2.285AspTyr: 2.285 ± 0.257
0.0AspXaa: 0.0 ± 0.0
Glu
2.674GluAla: 2.674 ± 0.323
0.924GluCys: 0.924 ± 0.197
2.917GluAsp: 2.917 ± 0.497
4.182GluGlu: 4.182 ± 0.431
2.674GluPhe: 2.674 ± 0.422
1.313GluGly: 1.313 ± 0.316
1.216GluHis: 1.216 ± 0.258
5.057GluIle: 5.057 ± 0.471
5.883GluLys: 5.883 ± 0.493
4.425GluLeu: 4.425 ± 0.504
1.75GluMet: 1.75 ± 0.318
5.349GluAsn: 5.349 ± 0.665
1.605GluPro: 1.605 ± 0.264
2.577GluGln: 2.577 ± 0.311
2.917GluArg: 2.917 ± 0.355
3.306GluSer: 3.306 ± 0.355
3.744GluThr: 3.744 ± 0.347
2.577GluVal: 2.577 ± 0.369
0.924GluTrp: 0.924 ± 0.176
2.383GluTyr: 2.383 ± 0.321
0.0GluXaa: 0.0 ± 0.0
Phe
1.896PheAla: 1.896 ± 0.345
2.139PheCys: 2.139 ± 0.359
2.82PheAsp: 2.82 ± 0.293
3.209PheGlu: 3.209 ± 0.408
3.258PhePhe: 3.258 ± 0.548
1.556PheGly: 1.556 ± 0.234
1.945PheHis: 1.945 ± 0.288
6.37PheIle: 6.37 ± 0.554
1.896PheLys: 1.896 ± 0.29
5.057PheLeu: 5.057 ± 0.422
0.875PheMet: 0.875 ± 0.247
2.237PheAsn: 2.237 ± 0.355
1.799PhePro: 1.799 ± 0.3
1.118PheGln: 1.118 ± 0.233
1.896PheArg: 1.896 ± 0.266
2.772PheSer: 2.772 ± 0.318
2.042PheThr: 2.042 ± 0.278
3.355PheVal: 3.355 ± 0.615
0.632PheTrp: 0.632 ± 0.17
2.188PheTyr: 2.188 ± 0.347
0.0PheXaa: 0.0 ± 0.0
Gly
1.167GlyAla: 1.167 ± 0.265
0.972GlyCys: 0.972 ± 0.207
1.507GlyAsp: 1.507 ± 0.257
1.848GlyGlu: 1.848 ± 0.309
2.042GlyPhe: 2.042 ± 0.28
1.361GlyGly: 1.361 ± 0.336
0.924GlyHis: 0.924 ± 0.193
2.772GlyIle: 2.772 ± 0.414
2.869GlyLys: 2.869 ± 0.388
2.674GlyLeu: 2.674 ± 0.383
1.021GlyMet: 1.021 ± 0.208
2.869GlyAsn: 2.869 ± 0.436
1.167GlyPro: 1.167 ± 0.246
0.729GlyGln: 0.729 ± 0.176
2.383GlyArg: 2.383 ± 0.37
2.577GlySer: 2.577 ± 0.351
1.994GlyThr: 1.994 ± 0.261
1.994GlyVal: 1.994 ± 0.3
0.486GlyTrp: 0.486 ± 0.136
1.41GlyTyr: 1.41 ± 0.227
0.0GlyXaa: 0.0 ± 0.0
His
2.042HisAla: 2.042 ± 0.404
0.827HisCys: 0.827 ± 0.175
1.216HisAsp: 1.216 ± 0.188
0.924HisGlu: 0.924 ± 0.262
1.216HisPhe: 1.216 ± 0.282
1.799HisGly: 1.799 ± 0.261
0.827HisHis: 0.827 ± 0.192
1.507HisIle: 1.507 ± 0.294
1.507HisLys: 1.507 ± 0.226
2.383HisLeu: 2.383 ± 0.372
0.729HisMet: 0.729 ± 0.188
1.507HisAsn: 1.507 ± 0.237
0.972HisPro: 0.972 ± 0.227
0.729HisGln: 0.729 ± 0.155
1.459HisArg: 1.459 ± 0.311
2.528HisSer: 2.528 ± 0.336
0.729HisThr: 0.729 ± 0.204
1.653HisVal: 1.653 ± 0.291
0.292HisTrp: 0.292 ± 0.112
2.091HisTyr: 2.091 ± 0.341
0.0HisXaa: 0.0 ± 0.0
Ile
4.862IleAla: 4.862 ± 0.425
1.799IleCys: 1.799 ± 0.27
4.862IleAsp: 4.862 ± 0.518
4.522IleGlu: 4.522 ± 0.54
5.106IlePhe: 5.106 ± 0.479
1.945IleGly: 1.945 ± 0.32
2.772IleHis: 2.772 ± 0.301
6.564IleIle: 6.564 ± 0.594
4.328IleLys: 4.328 ± 0.492
6.467IleLeu: 6.467 ± 0.502
2.042IleMet: 2.042 ± 0.238
5.106IleAsn: 5.106 ± 0.433
3.598IlePro: 3.598 ± 0.371
2.626IleGln: 2.626 ± 0.415
4.279IleArg: 4.279 ± 0.417
4.717IleSer: 4.717 ± 0.449
3.55IleThr: 3.55 ± 0.383
6.905IleVal: 6.905 ± 0.571
1.361IleTrp: 1.361 ± 0.247
2.674IleTyr: 2.674 ± 0.401
0.0IleXaa: 0.0 ± 0.0
Lys
2.285LysAla: 2.285 ± 0.378
1.945LysCys: 1.945 ± 0.255
1.653LysAsp: 1.653 ± 0.348
3.987LysGlu: 3.987 ± 0.476
4.571LysPhe: 4.571 ± 0.425
1.75LysGly: 1.75 ± 0.275
1.459LysHis: 1.459 ± 0.267
5.981LysIle: 5.981 ± 0.53
9.725LysLys: 9.725 ± 0.948
6.467LysLeu: 6.467 ± 0.61
2.869LysMet: 2.869 ± 0.297
5.008LysAsn: 5.008 ± 0.481
1.945LysPro: 1.945 ± 0.312
1.653LysGln: 1.653 ± 0.263
3.744LysArg: 3.744 ± 0.484
4.668LysSer: 4.668 ± 0.598
4.279LysThr: 4.279 ± 0.553
2.528LysVal: 2.528 ± 0.344
0.535LysTrp: 0.535 ± 0.188
4.279LysTyr: 4.279 ± 0.344
0.0LysXaa: 0.0 ± 0.0
Leu
3.987LeuAla: 3.987 ± 0.421
2.48LeuCys: 2.48 ± 0.553
4.473LeuAsp: 4.473 ± 0.482
5.154LeuGlu: 5.154 ± 0.518
3.89LeuPhe: 3.89 ± 0.376
2.723LeuGly: 2.723 ± 0.474
1.459LeuHis: 1.459 ± 0.245
5.835LeuIle: 5.835 ± 0.502
6.272LeuLys: 6.272 ± 0.539
7.634LeuLeu: 7.634 ± 0.591
2.237LeuMet: 2.237 ± 0.385
5.543LeuAsn: 5.543 ± 0.528
3.015LeuPro: 3.015 ± 0.429
4.182LeuGln: 4.182 ± 0.498
5.883LeuArg: 5.883 ± 0.523
7.537LeuSer: 7.537 ± 0.671
5.3LeuThr: 5.3 ± 0.601
3.939LeuVal: 3.939 ± 0.488
1.507LeuTrp: 1.507 ± 0.254
2.82LeuTyr: 2.82 ± 0.326
0.0LeuXaa: 0.0 ± 0.0
Met
1.41MetAla: 1.41 ± 0.254
0.438MetCys: 0.438 ± 0.136
2.237MetAsp: 2.237 ± 0.306
1.848MetGlu: 1.848 ± 0.303
0.875MetPhe: 0.875 ± 0.204
1.313MetGly: 1.313 ± 0.262
0.778MetHis: 0.778 ± 0.201
3.015MetIle: 3.015 ± 0.311
3.355MetLys: 3.355 ± 0.354
2.237MetLeu: 2.237 ± 0.298
1.021MetMet: 1.021 ± 0.243
1.945MetAsn: 1.945 ± 0.324
0.729MetPro: 0.729 ± 0.168
0.535MetGln: 0.535 ± 0.171
1.896MetArg: 1.896 ± 0.264
2.383MetSer: 2.383 ± 0.315
1.264MetThr: 1.264 ± 0.242
1.653MetVal: 1.653 ± 0.28
0.389MetTrp: 0.389 ± 0.14
1.167MetTyr: 1.167 ± 0.229
0.0MetXaa: 0.0 ± 0.0
Asn
2.772AsnAla: 2.772 ± 0.437
1.264AsnCys: 1.264 ± 0.234
3.355AsnAsp: 3.355 ± 0.347
4.862AsnGlu: 4.862 ± 0.475
3.112AsnPhe: 3.112 ± 0.324
3.258AsnGly: 3.258 ± 0.317
1.264AsnHis: 1.264 ± 0.308
4.376AsnIle: 4.376 ± 0.382
4.182AsnLys: 4.182 ± 0.506
4.23AsnLeu: 4.23 ± 0.44
1.896AsnMet: 1.896 ± 0.388
4.328AsnAsn: 4.328 ± 0.526
1.653AsnPro: 1.653 ± 0.295
1.507AsnGln: 1.507 ± 0.265
4.084AsnArg: 4.084 ± 0.436
4.619AsnSer: 4.619 ± 0.497
3.987AsnThr: 3.987 ± 0.397
4.765AsnVal: 4.765 ± 0.471
1.118AsnTrp: 1.118 ± 0.239
2.528AsnTyr: 2.528 ± 0.312
0.0AsnXaa: 0.0 ± 0.0
Pro
1.507ProAla: 1.507 ± 0.264
1.605ProCys: 1.605 ± 0.259
1.75ProAsp: 1.75 ± 0.271
1.75ProGlu: 1.75 ± 0.315
1.07ProPhe: 1.07 ± 0.236
1.167ProGly: 1.167 ± 0.233
0.681ProHis: 0.681 ± 0.175
3.695ProIle: 3.695 ± 0.3
2.383ProLys: 2.383 ± 0.276
2.383ProLeu: 2.383 ± 0.354
1.507ProMet: 1.507 ± 0.214
1.896ProAsn: 1.896 ± 0.315
1.556ProPro: 1.556 ± 0.262
1.07ProGln: 1.07 ± 0.249
1.556ProArg: 1.556 ± 0.2
3.063ProSer: 3.063 ± 0.346
1.994ProThr: 1.994 ± 0.32
1.702ProVal: 1.702 ± 0.274
0.389ProTrp: 0.389 ± 0.161
1.507ProTyr: 1.507 ± 0.297
0.0ProXaa: 0.0 ± 0.0
Gln
2.042GlnAla: 2.042 ± 0.298
0.875GlnCys: 0.875 ± 0.181
1.118GlnAsp: 1.118 ± 0.243
1.702GlnGlu: 1.702 ± 0.273
1.361GlnPhe: 1.361 ± 0.251
1.216GlnGly: 1.216 ± 0.238
0.924GlnHis: 0.924 ± 0.205
1.896GlnIle: 1.896 ± 0.256
2.626GlnLys: 2.626 ± 0.342
3.452GlnLeu: 3.452 ± 0.404
1.167GlnMet: 1.167 ± 0.188
2.383GlnAsn: 2.383 ± 0.258
0.924GlnPro: 0.924 ± 0.195
1.021GlnGln: 1.021 ± 0.216
2.285GlnArg: 2.285 ± 0.327
2.577GlnSer: 2.577 ± 0.333
1.507GlnThr: 1.507 ± 0.262
1.118GlnVal: 1.118 ± 0.264
0.486GlnTrp: 0.486 ± 0.169
1.459GlnTyr: 1.459 ± 0.273
0.0GlnXaa: 0.0 ± 0.0
Arg
2.431ArgAla: 2.431 ± 0.301
1.605ArgCys: 1.605 ± 0.309
3.063ArgAsp: 3.063 ± 0.363
2.188ArgGlu: 2.188 ± 0.279
1.799ArgPhe: 1.799 ± 0.249
1.945ArgGly: 1.945 ± 0.258
2.237ArgHis: 2.237 ± 0.309
3.841ArgIle: 3.841 ± 0.396
4.133ArgLys: 4.133 ± 0.372
5.106ArgLeu: 5.106 ± 0.555
1.848ArgMet: 1.848 ± 0.315
3.258ArgAsn: 3.258 ± 0.436
1.75ArgPro: 1.75 ± 0.343
2.674ArgGln: 2.674 ± 0.433
4.717ArgArg: 4.717 ± 0.429
5.446ArgSer: 5.446 ± 0.58
3.209ArgThr: 3.209 ± 0.385
3.161ArgVal: 3.161 ± 0.364
0.486ArgTrp: 0.486 ± 0.132
2.82ArgTyr: 2.82 ± 0.324
0.0ArgXaa: 0.0 ± 0.0
Ser
3.258SerAla: 3.258 ± 0.394
1.216SerCys: 1.216 ± 0.27
4.522SerAsp: 4.522 ± 0.512
4.084SerGlu: 4.084 ± 0.468
2.917SerPhe: 2.917 ± 0.43
2.917SerGly: 2.917 ± 0.338
1.507SerHis: 1.507 ± 0.277
5.495SerIle: 5.495 ± 0.494
4.668SerLys: 4.668 ± 0.554
7.196SerLeu: 7.196 ± 0.606
1.896SerMet: 1.896 ± 0.272
4.279SerAsn: 4.279 ± 0.457
1.75SerPro: 1.75 ± 0.274
2.626SerGln: 2.626 ± 0.383
4.522SerArg: 4.522 ± 0.454
6.71SerSer: 6.71 ± 0.643
5.251SerThr: 5.251 ± 0.619
6.029SerVal: 6.029 ± 0.664
0.875SerTrp: 0.875 ± 0.199
2.334SerTyr: 2.334 ± 0.376
0.0SerXaa: 0.0 ± 0.0
Thr
1.945ThrAla: 1.945 ± 0.331
1.264ThrCys: 1.264 ± 0.232
3.55ThrAsp: 3.55 ± 0.409
3.404ThrGlu: 3.404 ± 0.35
2.869ThrPhe: 2.869 ± 0.344
1.945ThrGly: 1.945 ± 0.278
1.313ThrHis: 1.313 ± 0.264
4.814ThrIle: 4.814 ± 0.45
3.939ThrLys: 3.939 ± 0.409
4.522ThrLeu: 4.522 ± 0.459
2.48ThrMet: 2.48 ± 0.348
4.133ThrAsn: 4.133 ± 0.394
1.216ThrPro: 1.216 ± 0.283
1.945ThrGln: 1.945 ± 0.315
3.063ThrArg: 3.063 ± 0.439
3.306ThrSer: 3.306 ± 0.39
3.452ThrThr: 3.452 ± 0.465
3.647ThrVal: 3.647 ± 0.396
0.34ThrTrp: 0.34 ± 0.129
2.285ThrTyr: 2.285 ± 0.412
0.0ThrXaa: 0.0 ± 0.0
Val
1.945ValAla: 1.945 ± 0.372
5.057ValCys: 5.057 ± 2.304
3.404ValAsp: 3.404 ± 0.435
2.528ValGlu: 2.528 ± 0.349
3.161ValPhe: 3.161 ± 0.431
1.75ValGly: 1.75 ± 0.305
2.431ValHis: 2.431 ± 0.257
4.23ValIle: 4.23 ± 0.482
4.862ValLys: 4.862 ± 0.412
7.05ValLeu: 7.05 ± 0.566
1.41ValMet: 1.41 ± 0.27
3.063ValAsn: 3.063 ± 0.404
3.306ValPro: 3.306 ± 0.404
2.139ValGln: 2.139 ± 0.248
3.355ValArg: 3.355 ± 0.426
5.057ValSer: 5.057 ± 0.669
2.966ValThr: 2.966 ± 0.401
6.418ValVal: 6.418 ± 0.664
0.827ValTrp: 0.827 ± 0.186
1.896ValTyr: 1.896 ± 0.332
0.0ValXaa: 0.0 ± 0.0
Trp
0.681TrpAla: 0.681 ± 0.164
0.243TrpCys: 0.243 ± 0.116
0.827TrpAsp: 0.827 ± 0.174
0.583TrpGlu: 0.583 ± 0.167
0.778TrpPhe: 0.778 ± 0.192
0.292TrpGly: 0.292 ± 0.121
0.438TrpHis: 0.438 ± 0.149
0.827TrpIle: 0.827 ± 0.194
0.632TrpLys: 0.632 ± 0.158
1.021TrpLeu: 1.021 ± 0.224
0.292TrpMet: 0.292 ± 0.144
0.583TrpAsn: 0.583 ± 0.162
1.118TrpPro: 1.118 ± 0.298
0.729TrpGln: 0.729 ± 0.162
1.507TrpArg: 1.507 ± 0.221
1.605TrpSer: 1.605 ± 0.294
0.827TrpThr: 0.827 ± 0.241
0.194TrpVal: 0.194 ± 0.096
0.0TrpTrp: 0.0 ± 0.0
0.146TrpTyr: 0.146 ± 0.08
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.605TyrAla: 1.605 ± 0.286
1.653TyrCys: 1.653 ± 0.295
2.237TyrAsp: 2.237 ± 0.366
2.188TyrGlu: 2.188 ± 0.294
2.237TyrPhe: 2.237 ± 0.3
1.118TyrGly: 1.118 ± 0.226
1.799TyrHis: 1.799 ± 0.307
3.89TyrIle: 3.89 ± 0.44
1.605TyrLys: 1.605 ± 0.267
3.501TyrLeu: 3.501 ± 0.436
1.556TyrMet: 1.556 ± 0.26
1.507TyrAsn: 1.507 ± 0.263
1.41TyrPro: 1.41 ± 0.216
0.924TyrGln: 0.924 ± 0.266
1.896TyrArg: 1.896 ± 0.354
2.334TyrSer: 2.334 ± 0.32
2.723TyrThr: 2.723 ± 0.416
2.82TyrVal: 2.82 ± 0.298
1.07TyrTrp: 1.07 ± 0.195
2.431TyrTyr: 2.431 ± 0.313
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 101 proteins (20567 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski