Amino acid dipepetide frequency for Micromonas pusilla reovirus (isolate Netherlands/2005) (MpRV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.122AlaAla: 6.122 ± 1.406
0.5AlaCys: 0.5 ± 0.477
4.873AlaAsp: 4.873 ± 0.559
3.873AlaGlu: 3.873 ± 1.155
2.624AlaPhe: 2.624 ± 0.868
5.247AlaGly: 5.247 ± 1.062
1.124AlaHis: 1.124 ± 0.303
4.373AlaIle: 4.373 ± 0.631
3.623AlaLys: 3.623 ± 0.891
6.872AlaLeu: 6.872 ± 0.829
1.999AlaMet: 1.999 ± 0.515
3.498AlaAsn: 3.498 ± 0.699
4.623AlaPro: 4.623 ± 0.897
2.124AlaGln: 2.124 ± 0.525
4.998AlaArg: 4.998 ± 0.83
5.872AlaSer: 5.872 ± 1.799
4.998AlaThr: 4.998 ± 0.932
4.248AlaVal: 4.248 ± 0.624
0.5AlaTrp: 0.5 ± 0.282
3.498AlaTyr: 3.498 ± 0.767
0.0AlaXaa: 0.0 ± 0.0
Cys
1.0CysAla: 1.0 ± 0.491
0.125CysCys: 0.125 ± 0.119
0.625CysAsp: 0.625 ± 0.37
0.375CysGlu: 0.375 ± 0.241
0.25CysPhe: 0.25 ± 0.163
0.25CysGly: 0.25 ± 0.172
0.125CysHis: 0.125 ± 0.12
0.25CysIle: 0.25 ± 0.165
0.375CysLys: 0.375 ± 0.192
1.0CysLeu: 1.0 ± 0.376
0.125CysMet: 0.125 ± 0.119
0.375CysAsn: 0.375 ± 0.17
0.0CysPro: 0.0 ± 0.0
0.125CysGln: 0.125 ± 0.102
0.5CysArg: 0.5 ± 0.251
0.875CysSer: 0.875 ± 0.259
0.375CysThr: 0.375 ± 0.194
1.124CysVal: 1.124 ± 0.34
0.25CysTrp: 0.25 ± 0.188
0.375CysTyr: 0.375 ± 0.196
0.0CysXaa: 0.0 ± 0.0
Asp
5.997AspAla: 5.997 ± 0.991
0.25AspCys: 0.25 ± 0.155
5.497AspAsp: 5.497 ± 0.584
4.873AspGlu: 4.873 ± 0.583
2.499AspPhe: 2.499 ± 0.795
4.373AspGly: 4.373 ± 1.164
1.124AspHis: 1.124 ± 0.21
4.623AspIle: 4.623 ± 0.711
2.749AspLys: 2.749 ± 0.521
5.622AspLeu: 5.622 ± 0.748
1.999AspMet: 1.999 ± 0.668
2.499AspAsn: 2.499 ± 0.595
3.623AspPro: 3.623 ± 0.836
1.374AspGln: 1.374 ± 0.526
3.123AspArg: 3.123 ± 1.017
5.247AspSer: 5.247 ± 0.631
4.873AspThr: 4.873 ± 0.97
3.373AspVal: 3.373 ± 0.472
0.5AspTrp: 0.5 ± 0.132
2.499AspTyr: 2.499 ± 0.438
0.0AspXaa: 0.0 ± 0.0
Glu
3.873GluAla: 3.873 ± 0.964
0.75GluCys: 0.75 ± 0.224
3.373GluAsp: 3.373 ± 0.903
2.999GluGlu: 2.999 ± 1.103
2.374GluPhe: 2.374 ± 0.738
4.123GluGly: 4.123 ± 0.541
1.249GluHis: 1.249 ± 0.589
3.373GluIle: 3.373 ± 0.644
2.999GluLys: 2.999 ± 0.652
5.247GluLeu: 5.247 ± 0.741
1.0GluMet: 1.0 ± 0.453
2.749GluAsn: 2.749 ± 0.401
1.499GluPro: 1.499 ± 0.574
1.624GluGln: 1.624 ± 0.474
2.999GluArg: 2.999 ± 0.281
4.623GluSer: 4.623 ± 0.503
3.748GluThr: 3.748 ± 0.453
3.748GluVal: 3.748 ± 0.452
0.625GluTrp: 0.625 ± 0.383
2.999GluTyr: 2.999 ± 0.527
0.0GluXaa: 0.0 ± 0.0
Phe
2.124PheAla: 2.124 ± 0.536
0.625PheCys: 0.625 ± 0.28
2.124PheAsp: 2.124 ± 0.473
2.374PheGlu: 2.374 ± 0.442
1.124PhePhe: 1.124 ± 0.312
2.124PheGly: 2.124 ± 0.491
0.375PheHis: 0.375 ± 0.167
2.124PheIle: 2.124 ± 0.626
1.499PheLys: 1.499 ± 0.529
2.249PheLeu: 2.249 ± 0.594
0.625PheMet: 0.625 ± 0.25
2.874PheAsn: 2.874 ± 0.57
1.499PhePro: 1.499 ± 0.374
0.625PheGln: 0.625 ± 0.457
1.499PheArg: 1.499 ± 0.329
2.124PheSer: 2.124 ± 0.359
3.498PheThr: 3.498 ± 0.63
2.874PheVal: 2.874 ± 0.606
0.375PheTrp: 0.375 ± 0.207
0.875PheTyr: 0.875 ± 0.327
0.0PheXaa: 0.0 ± 0.0
Gly
5.872GlyAla: 5.872 ± 0.984
0.125GlyCys: 0.125 ± 0.141
4.623GlyAsp: 4.623 ± 0.392
2.124GlyGlu: 2.124 ± 0.497
1.999GlyPhe: 1.999 ± 0.478
4.498GlyGly: 4.498 ± 0.694
1.374GlyHis: 1.374 ± 0.233
3.998GlyIle: 3.998 ± 0.648
2.749GlyLys: 2.749 ± 0.436
4.873GlyLeu: 4.873 ± 0.915
0.875GlyMet: 0.875 ± 0.361
4.873GlyAsn: 4.873 ± 1.081
1.749GlyPro: 1.749 ± 0.463
1.999GlyGln: 1.999 ± 0.55
3.998GlyArg: 3.998 ± 0.792
4.623GlySer: 4.623 ± 1.136
5.622GlyThr: 5.622 ± 1.799
5.122GlyVal: 5.122 ± 0.843
0.375GlyTrp: 0.375 ± 0.27
2.124GlyTyr: 2.124 ± 0.618
0.0GlyXaa: 0.0 ± 0.0
His
1.999HisAla: 1.999 ± 0.566
0.5HisCys: 0.5 ± 0.262
1.374HisAsp: 1.374 ± 0.363
1.249HisGlu: 1.249 ± 0.432
0.75HisPhe: 0.75 ± 0.265
2.124HisGly: 2.124 ± 0.457
0.5HisHis: 0.5 ± 0.22
1.374HisIle: 1.374 ± 0.455
0.625HisLys: 0.625 ± 0.231
1.374HisLeu: 1.374 ± 0.743
0.75HisMet: 0.75 ± 0.222
1.624HisAsn: 1.624 ± 0.626
0.5HisPro: 0.5 ± 0.166
0.5HisGln: 0.5 ± 0.234
1.374HisArg: 1.374 ± 0.367
1.124HisSer: 1.124 ± 0.354
1.249HisThr: 1.249 ± 0.476
2.249HisVal: 2.249 ± 0.611
0.25HisTrp: 0.25 ± 0.163
0.25HisTyr: 0.25 ± 0.174
0.0HisXaa: 0.0 ± 0.0
Ile
4.748IleAla: 4.748 ± 0.653
0.375IleCys: 0.375 ± 0.194
3.748IleAsp: 3.748 ± 1.049
3.373IleGlu: 3.373 ± 0.5
1.499IlePhe: 1.499 ± 0.548
2.374IleGly: 2.374 ± 0.244
1.749IleHis: 1.749 ± 0.376
3.748IleIle: 3.748 ± 0.734
3.748IleLys: 3.748 ± 0.678
3.623IleLeu: 3.623 ± 0.599
1.999IleMet: 1.999 ± 0.432
3.623IleAsn: 3.623 ± 0.397
4.248IlePro: 4.248 ± 1.09
1.624IleGln: 1.624 ± 0.347
2.999IleArg: 2.999 ± 0.685
4.998IleSer: 4.998 ± 1.118
4.248IleThr: 4.248 ± 1.056
4.623IleVal: 4.623 ± 1.066
0.5IleTrp: 0.5 ± 0.3
1.999IleTyr: 1.999 ± 0.454
0.0IleXaa: 0.0 ± 0.0
Lys
3.373LysAla: 3.373 ± 0.443
0.5LysCys: 0.5 ± 0.364
2.999LysAsp: 2.999 ± 0.649
2.999LysGlu: 2.999 ± 0.615
2.749LysPhe: 2.749 ± 0.346
2.249LysGly: 2.249 ± 0.414
1.749LysHis: 1.749 ± 0.961
2.374LysIle: 2.374 ± 0.33
2.499LysLys: 2.499 ± 0.504
6.122LysLeu: 6.122 ± 0.93
1.124LysMet: 1.124 ± 0.391
2.124LysAsn: 2.124 ± 0.326
1.749LysPro: 1.749 ± 0.59
1.499LysGln: 1.499 ± 0.553
1.874LysArg: 1.874 ± 0.583
4.123LysSer: 4.123 ± 0.942
2.624LysThr: 2.624 ± 0.675
4.123LysVal: 4.123 ± 0.678
0.375LysTrp: 0.375 ± 0.252
2.374LysTyr: 2.374 ± 0.506
0.0LysXaa: 0.0 ± 0.0
Leu
5.247LeuAla: 5.247 ± 0.981
0.5LeuCys: 0.5 ± 0.22
6.247LeuAsp: 6.247 ± 0.923
6.372LeuGlu: 6.372 ± 0.6
1.749LeuPhe: 1.749 ± 0.412
5.122LeuGly: 5.122 ± 0.81
1.874LeuHis: 1.874 ± 0.454
4.373LeuIle: 4.373 ± 0.458
5.122LeuLys: 5.122 ± 0.664
6.997LeuLeu: 6.997 ± 1.293
2.374LeuMet: 2.374 ± 0.795
3.748LeuAsn: 3.748 ± 0.532
2.999LeuPro: 2.999 ± 1.056
2.874LeuGln: 2.874 ± 0.274
4.873LeuArg: 4.873 ± 1.092
7.496LeuSer: 7.496 ± 1.204
7.496LeuThr: 7.496 ± 1.024
3.498LeuVal: 3.498 ± 0.794
0.75LeuTrp: 0.75 ± 0.379
3.498LeuTyr: 3.498 ± 0.604
0.0LeuXaa: 0.0 ± 0.0
Met
1.249MetAla: 1.249 ± 0.335
0.625MetCys: 0.625 ± 0.242
3.123MetAsp: 3.123 ± 0.58
1.0MetGlu: 1.0 ± 0.514
1.124MetPhe: 1.124 ± 0.443
1.124MetGly: 1.124 ± 0.603
1.374MetHis: 1.374 ± 0.464
0.875MetIle: 0.875 ± 0.273
1.624MetLys: 1.624 ± 0.791
2.499MetLeu: 2.499 ± 0.501
0.5MetMet: 0.5 ± 0.213
1.624MetAsn: 1.624 ± 0.422
1.249MetPro: 1.249 ± 0.302
0.875MetGln: 0.875 ± 0.378
1.499MetArg: 1.499 ± 0.487
2.999MetSer: 2.999 ± 0.752
1.0MetThr: 1.0 ± 0.548
1.374MetVal: 1.374 ± 0.395
0.25MetTrp: 0.25 ± 0.191
0.75MetTyr: 0.75 ± 0.479
0.0MetXaa: 0.0 ± 0.0
Asn
4.248AsnAla: 4.248 ± 1.034
0.375AsnCys: 0.375 ± 0.13
4.248AsnAsp: 4.248 ± 0.799
3.498AsnGlu: 3.498 ± 0.716
1.749AsnPhe: 1.749 ± 0.567
3.123AsnGly: 3.123 ± 0.512
0.375AsnHis: 0.375 ± 0.166
3.248AsnIle: 3.248 ± 0.908
2.749AsnLys: 2.749 ± 0.42
4.123AsnLeu: 4.123 ± 0.995
1.499AsnMet: 1.499 ± 0.4
2.374AsnAsn: 2.374 ± 0.571
3.248AsnPro: 3.248 ± 0.446
1.0AsnGln: 1.0 ± 0.396
1.874AsnArg: 1.874 ± 0.556
4.373AsnSer: 4.373 ± 0.517
2.999AsnThr: 2.999 ± 0.834
4.623AsnVal: 4.623 ± 0.577
0.75AsnTrp: 0.75 ± 0.275
1.999AsnTyr: 1.999 ± 0.802
0.0AsnXaa: 0.0 ± 0.0
Pro
3.623ProAla: 3.623 ± 0.817
0.125ProCys: 0.125 ± 0.121
2.499ProAsp: 2.499 ± 0.629
3.373ProGlu: 3.373 ± 0.921
2.499ProPhe: 2.499 ± 0.852
2.874ProGly: 2.874 ± 0.579
0.875ProHis: 0.875 ± 0.354
3.623ProIle: 3.623 ± 0.697
2.124ProLys: 2.124 ± 0.486
2.624ProLeu: 2.624 ± 0.606
1.374ProMet: 1.374 ± 0.597
1.749ProAsn: 1.749 ± 0.365
2.124ProPro: 2.124 ± 0.297
1.999ProGln: 1.999 ± 0.33
2.249ProArg: 2.249 ± 0.682
2.499ProSer: 2.499 ± 0.318
2.874ProThr: 2.874 ± 0.897
2.499ProVal: 2.499 ± 0.844
0.375ProTrp: 0.375 ± 0.211
1.249ProTyr: 1.249 ± 0.417
0.0ProXaa: 0.0 ± 0.0
Gln
2.374GlnAla: 2.374 ± 0.539
0.5GlnCys: 0.5 ± 0.276
1.624GlnAsp: 1.624 ± 0.341
0.875GlnGlu: 0.875 ± 0.282
1.0GlnPhe: 1.0 ± 0.489
1.499GlnGly: 1.499 ± 0.345
0.5GlnHis: 0.5 ± 0.212
1.249GlnIle: 1.249 ± 0.232
1.124GlnLys: 1.124 ± 0.431
3.248GlnLeu: 3.248 ± 0.543
0.5GlnMet: 0.5 ± 0.268
0.875GlnAsn: 0.875 ± 0.508
1.374GlnPro: 1.374 ± 0.57
0.875GlnGln: 0.875 ± 0.269
1.874GlnArg: 1.874 ± 0.468
2.499GlnSer: 2.499 ± 0.852
2.374GlnThr: 2.374 ± 0.401
2.499GlnVal: 2.499 ± 0.466
0.25GlnTrp: 0.25 ± 0.172
1.624GlnTyr: 1.624 ± 0.295
0.0GlnXaa: 0.0 ± 0.0
Arg
4.623ArgAla: 4.623 ± 0.65
0.25ArgCys: 0.25 ± 0.181
2.124ArgAsp: 2.124 ± 0.446
2.124ArgGlu: 2.124 ± 0.55
1.874ArgPhe: 1.874 ± 0.293
3.248ArgGly: 3.248 ± 0.889
1.0ArgHis: 1.0 ± 0.315
4.498ArgIle: 4.498 ± 0.716
2.624ArgLys: 2.624 ± 0.719
3.498ArgLeu: 3.498 ± 0.689
1.499ArgMet: 1.499 ± 0.612
3.123ArgAsn: 3.123 ± 0.653
1.749ArgPro: 1.749 ± 0.35
1.124ArgGln: 1.124 ± 0.306
2.624ArgArg: 2.624 ± 1.168
4.498ArgSer: 4.498 ± 1.009
4.373ArgThr: 4.373 ± 0.549
3.248ArgVal: 3.248 ± 0.655
0.5ArgTrp: 0.5 ± 0.277
2.749ArgTyr: 2.749 ± 0.49
0.0ArgXaa: 0.0 ± 0.0
Ser
5.497SerAla: 5.497 ± 0.661
0.625SerCys: 0.625 ± 0.211
5.247SerAsp: 5.247 ± 1.017
3.748SerGlu: 3.748 ± 0.759
1.999SerPhe: 1.999 ± 0.506
7.871SerGly: 7.871 ± 1.76
1.749SerHis: 1.749 ± 0.36
4.998SerIle: 4.998 ± 0.817
4.123SerLys: 4.123 ± 0.783
6.997SerLeu: 6.997 ± 1.21
2.249SerMet: 2.249 ± 0.576
5.122SerAsn: 5.122 ± 1.485
3.248SerPro: 3.248 ± 0.603
2.374SerGln: 2.374 ± 0.361
4.498SerArg: 4.498 ± 0.838
8.121SerSer: 8.121 ± 1.323
7.496SerThr: 7.496 ± 1.411
6.872SerVal: 6.872 ± 1.444
0.625SerTrp: 0.625 ± 0.335
2.999SerTyr: 2.999 ± 0.514
0.0SerXaa: 0.0 ± 0.0
Thr
5.122ThrAla: 5.122 ± 1.053
0.625ThrCys: 0.625 ± 0.211
4.373ThrAsp: 4.373 ± 0.419
3.623ThrGlu: 3.623 ± 0.815
1.749ThrPhe: 1.749 ± 0.49
3.873ThrGly: 3.873 ± 0.953
1.874ThrHis: 1.874 ± 0.373
4.373ThrIle: 4.373 ± 0.995
3.873ThrLys: 3.873 ± 0.805
7.246ThrLeu: 7.246 ± 1.118
1.374ThrMet: 1.374 ± 0.525
3.873ThrAsn: 3.873 ± 0.589
3.748ThrPro: 3.748 ± 0.707
2.624ThrGln: 2.624 ± 0.472
4.498ThrArg: 4.498 ± 0.588
9.12ThrSer: 9.12 ± 3.022
5.747ThrThr: 5.747 ± 2.634
4.873ThrVal: 4.873 ± 0.743
0.125ThrTrp: 0.125 ± 0.119
2.624ThrTyr: 2.624 ± 0.683
0.0ThrXaa: 0.0 ± 0.0
Val
5.747ValAla: 5.747 ± 0.987
0.625ValCys: 0.625 ± 0.218
4.248ValAsp: 4.248 ± 0.577
3.873ValGlu: 3.873 ± 0.783
2.374ValPhe: 2.374 ± 0.738
3.998ValGly: 3.998 ± 0.57
1.874ValHis: 1.874 ± 0.56
4.248ValIle: 4.248 ± 0.489
3.248ValLys: 3.248 ± 0.528
5.497ValLeu: 5.497 ± 0.72
3.123ValMet: 3.123 ± 0.741
3.373ValAsn: 3.373 ± 0.359
2.874ValPro: 2.874 ± 0.556
1.749ValGln: 1.749 ± 0.501
1.999ValArg: 1.999 ± 0.326
8.121ValSer: 8.121 ± 1.754
5.372ValThr: 5.372 ± 0.688
5.747ValVal: 5.747 ± 1.152
0.5ValTrp: 0.5 ± 0.185
2.249ValTyr: 2.249 ± 0.618
0.0ValXaa: 0.0 ± 0.0
Trp
0.375TrpAla: 0.375 ± 0.266
0.375TrpCys: 0.375 ± 0.212
0.375TrpAsp: 0.375 ± 0.184
0.625TrpGlu: 0.625 ± 0.394
0.625TrpPhe: 0.625 ± 0.325
0.75TrpGly: 0.75 ± 0.343
0.25TrpHis: 0.25 ± 0.161
0.125TrpIle: 0.125 ± 0.119
0.25TrpLys: 0.25 ± 0.241
0.625TrpLeu: 0.625 ± 0.24
0.5TrpMet: 0.5 ± 0.226
0.5TrpAsn: 0.5 ± 0.241
0.25TrpPro: 0.25 ± 0.178
0.0TrpGln: 0.0 ± 0.0
0.375TrpArg: 0.375 ± 0.202
0.875TrpSer: 0.875 ± 0.606
0.75TrpThr: 0.75 ± 0.38
0.25TrpVal: 0.25 ± 0.179
0.0TrpTrp: 0.0 ± 0.0
0.375TrpTyr: 0.375 ± 0.275
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.374TyrAla: 2.374 ± 0.466
0.125TyrCys: 0.125 ± 0.131
3.373TyrAsp: 3.373 ± 0.935
2.624TyrGlu: 2.624 ± 0.39
1.0TyrPhe: 1.0 ± 0.32
2.749TyrGly: 2.749 ± 0.598
0.625TyrHis: 0.625 ± 0.214
1.999TyrIle: 1.999 ± 0.42
1.874TyrLys: 1.874 ± 0.672
2.874TyrLeu: 2.874 ± 0.848
1.124TyrMet: 1.124 ± 0.353
1.874TyrAsn: 1.874 ± 0.428
0.875TyrPro: 0.875 ± 0.352
1.749TyrGln: 1.749 ± 0.465
1.624TyrArg: 1.624 ± 0.564
2.249TyrSer: 2.249 ± 0.559
3.623TyrThr: 3.623 ± 1.193
3.998TyrVal: 3.998 ± 0.486
0.375TyrTrp: 0.375 ± 0.231
1.124TyrTyr: 1.124 ± 0.354
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11 proteins (8005 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski