Amino acid dipepetide frequency for Rotavirus A human/Vanderbilt/VU08-09-12/2008/G3P[8]

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.225AlaAla: 3.225 ± 0.663
1.528AlaCys: 1.528 ± 0.622
3.225AlaAsp: 3.225 ± 0.796
1.698AlaGlu: 1.698 ± 0.576
2.037AlaPhe: 2.037 ± 0.475
2.037AlaGly: 2.037 ± 0.697
0.34AlaHis: 0.34 ± 0.207
3.565AlaIle: 3.565 ± 0.934
3.225AlaLys: 3.225 ± 1.119
4.753AlaLeu: 4.753 ± 0.675
0.849AlaMet: 0.849 ± 0.292
4.753AlaAsn: 4.753 ± 0.82
1.188AlaPro: 1.188 ± 0.382
1.698AlaGln: 1.698 ± 0.367
1.528AlaArg: 1.528 ± 1.038
3.565AlaSer: 3.565 ± 0.695
3.565AlaThr: 3.565 ± 0.859
3.904AlaVal: 3.904 ± 0.928
0.34AlaTrp: 0.34 ± 0.207
1.188AlaTyr: 1.188 ± 0.396
0.0AlaXaa: 0.0 ± 0.0
Cys
0.17CysAla: 0.17 ± 0.201
0.509CysCys: 0.509 ± 0.286
0.679CysAsp: 0.679 ± 0.286
0.679CysGlu: 0.679 ± 0.377
0.679CysPhe: 0.679 ± 0.326
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.849CysIle: 0.849 ± 0.325
1.358CysLys: 1.358 ± 0.567
1.019CysLeu: 1.019 ± 0.481
0.679CysMet: 0.679 ± 0.295
1.188CysAsn: 1.188 ± 0.378
0.17CysPro: 0.17 ± 0.192
0.509CysGln: 0.509 ± 0.325
0.849CysArg: 0.849 ± 0.52
1.188CysSer: 1.188 ± 0.618
0.849CysThr: 0.849 ± 0.456
0.679CysVal: 0.679 ± 0.254
0.0CysTrp: 0.0 ± 0.0
0.509CysTyr: 0.509 ± 0.233
0.0CysXaa: 0.0 ± 0.0
Asp
2.886AspAla: 2.886 ± 0.912
0.509AspCys: 0.509 ± 0.198
4.244AspAsp: 4.244 ± 0.898
3.565AspGlu: 3.565 ± 0.326
3.904AspPhe: 3.904 ± 0.825
2.377AspGly: 2.377 ± 0.33
1.019AspHis: 1.019 ± 0.352
4.923AspIle: 4.923 ± 0.703
3.395AspLys: 3.395 ± 0.63
4.753AspLeu: 4.753 ± 0.638
2.377AspMet: 2.377 ± 0.577
3.565AspAsn: 3.565 ± 0.595
1.698AspPro: 1.698 ± 0.384
2.546AspGln: 2.546 ± 0.836
2.207AspArg: 2.207 ± 0.475
4.923AspSer: 4.923 ± 1.296
2.716AspThr: 2.716 ± 0.751
5.432AspVal: 5.432 ± 0.817
0.849AspTrp: 0.849 ± 0.361
3.395AspTyr: 3.395 ± 0.693
0.0AspXaa: 0.0 ± 0.0
Glu
2.716GluAla: 2.716 ± 0.624
0.17GluCys: 0.17 ± 0.146
2.716GluAsp: 2.716 ± 0.662
2.037GluGlu: 2.037 ± 0.585
1.867GluPhe: 1.867 ± 0.521
1.019GluGly: 1.019 ± 0.362
0.34GluHis: 0.34 ± 0.207
4.414GluIle: 4.414 ± 0.731
3.904GluLys: 3.904 ± 0.918
5.772GluLeu: 5.772 ± 1.104
2.716GluMet: 2.716 ± 0.629
3.395GluAsn: 3.395 ± 0.696
1.867GluPro: 1.867 ± 0.667
2.207GluGln: 2.207 ± 0.645
2.716GluArg: 2.716 ± 0.798
3.056GluSer: 3.056 ± 0.694
2.207GluThr: 2.207 ± 0.575
3.735GluVal: 3.735 ± 0.587
1.019GluTrp: 1.019 ± 0.321
4.583GluTyr: 4.583 ± 0.872
0.0GluXaa: 0.0 ± 0.0
Phe
1.698PheAla: 1.698 ± 0.745
0.34PheCys: 0.34 ± 0.21
3.225PheAsp: 3.225 ± 0.513
1.867PheGlu: 1.867 ± 0.518
1.019PhePhe: 1.019 ± 0.381
1.698PheGly: 1.698 ± 0.34
1.358PheHis: 1.358 ± 0.337
4.244PheIle: 4.244 ± 0.865
3.225PheLys: 3.225 ± 0.611
3.735PheLeu: 3.735 ± 0.947
0.34PheMet: 0.34 ± 0.277
3.395PheAsn: 3.395 ± 0.582
1.698PhePro: 1.698 ± 0.744
1.867PheGln: 1.867 ± 0.701
2.546PheArg: 2.546 ± 0.566
4.074PheSer: 4.074 ± 1.268
2.886PheThr: 2.886 ± 0.482
1.528PheVal: 1.528 ± 0.481
0.34PheTrp: 0.34 ± 0.207
1.867PheTyr: 1.867 ± 0.479
0.0PheXaa: 0.0 ± 0.0
Gly
1.358GlyAla: 1.358 ± 0.308
0.849GlyCys: 0.849 ± 0.286
1.019GlyAsp: 1.019 ± 0.429
1.867GlyGlu: 1.867 ± 0.608
0.849GlyPhe: 0.849 ± 0.438
1.698GlyGly: 1.698 ± 0.864
0.849GlyHis: 0.849 ± 0.351
4.074GlyIle: 4.074 ± 0.575
3.565GlyLys: 3.565 ± 0.804
2.037GlyLeu: 2.037 ± 0.522
1.188GlyMet: 1.188 ± 0.337
1.528GlyAsn: 1.528 ± 0.478
1.019GlyPro: 1.019 ± 0.446
1.358GlyGln: 1.358 ± 0.472
1.528GlyArg: 1.528 ± 0.57
2.546GlySer: 2.546 ± 0.956
1.698GlyThr: 1.698 ± 0.779
3.225GlyVal: 3.225 ± 0.404
0.509GlyTrp: 0.509 ± 0.244
1.019GlyTyr: 1.019 ± 0.349
0.0GlyXaa: 0.0 ± 0.0
His
1.188HisAla: 1.188 ± 0.385
0.17HisCys: 0.17 ± 0.146
1.358HisAsp: 1.358 ± 0.442
0.34HisGlu: 0.34 ± 0.194
0.509HisPhe: 0.509 ± 0.219
0.679HisGly: 0.679 ± 0.382
0.509HisHis: 0.509 ± 0.298
0.679HisIle: 0.679 ± 0.236
1.698HisLys: 1.698 ± 0.635
1.528HisLeu: 1.528 ± 0.54
0.34HisMet: 0.34 ± 0.201
1.019HisAsn: 1.019 ± 0.448
0.34HisPro: 0.34 ± 0.201
0.849HisGln: 0.849 ± 0.375
0.17HisArg: 0.17 ± 0.184
1.698HisSer: 1.698 ± 0.436
0.679HisThr: 0.679 ± 0.241
1.188HisVal: 1.188 ± 0.247
0.17HisTrp: 0.17 ± 0.147
1.019HisTyr: 1.019 ± 0.367
0.0HisXaa: 0.0 ± 0.0
Ile
5.602IleAla: 5.602 ± 1.116
0.34IleCys: 0.34 ± 0.226
4.583IleAsp: 4.583 ± 1.146
5.432IleGlu: 5.432 ± 0.76
2.716IlePhe: 2.716 ± 0.796
2.546IleGly: 2.546 ± 0.552
1.019IleHis: 1.019 ± 0.464
6.111IleIle: 6.111 ± 0.645
4.923IleLys: 4.923 ± 0.618
6.79IleLeu: 6.79 ± 1.192
1.358IleMet: 1.358 ± 0.396
7.469IleAsn: 7.469 ± 0.764
3.735IlePro: 3.735 ± 0.659
3.904IleGln: 3.904 ± 0.85
3.395IleArg: 3.395 ± 0.572
5.941IleSer: 5.941 ± 1.523
6.79IleThr: 6.79 ± 0.907
4.414IleVal: 4.414 ± 0.905
0.34IleTrp: 0.34 ± 0.245
4.244IleTyr: 4.244 ± 1.528
0.0IleXaa: 0.0 ± 0.0
Lys
2.037LysAla: 2.037 ± 0.465
1.698LysCys: 1.698 ± 0.572
3.225LysAsp: 3.225 ± 0.558
4.244LysGlu: 4.244 ± 1.34
2.716LysPhe: 2.716 ± 1.013
2.377LysGly: 2.377 ± 0.741
0.849LysHis: 0.849 ± 0.36
4.753LysIle: 4.753 ± 0.98
3.904LysLys: 3.904 ± 0.812
7.809LysLeu: 7.809 ± 1.118
2.207LysMet: 2.207 ± 0.605
4.244LysAsn: 4.244 ± 0.647
2.207LysPro: 2.207 ± 0.71
3.225LysGln: 3.225 ± 0.877
3.735LysArg: 3.735 ± 0.344
3.565LysSer: 3.565 ± 0.744
3.735LysThr: 3.735 ± 0.709
4.074LysVal: 4.074 ± 1.174
1.019LysTrp: 1.019 ± 0.492
3.395LysTyr: 3.395 ± 1.023
0.0LysXaa: 0.0 ± 0.0
Leu
3.565LeuAla: 3.565 ± 0.541
0.679LeuCys: 0.679 ± 0.392
5.772LeuAsp: 5.772 ± 0.956
4.753LeuGlu: 4.753 ± 1.216
3.565LeuPhe: 3.565 ± 0.891
2.377LeuGly: 2.377 ± 0.439
2.037LeuHis: 2.037 ± 0.658
7.469LeuIle: 7.469 ± 0.345
6.62LeuLys: 6.62 ± 1.213
8.318LeuLeu: 8.318 ± 1.543
3.395LeuMet: 3.395 ± 0.924
6.96LeuAsn: 6.96 ± 1.384
3.904LeuPro: 3.904 ± 0.724
3.904LeuGln: 3.904 ± 1.162
5.941LeuArg: 5.941 ± 0.596
8.488LeuSer: 8.488 ± 1.31
6.111LeuThr: 6.111 ± 0.686
4.753LeuVal: 4.753 ± 0.951
0.679LeuTrp: 0.679 ± 0.232
3.735LeuTyr: 3.735 ± 0.694
0.0LeuXaa: 0.0 ± 0.0
Met
1.698MetAla: 1.698 ± 0.515
0.0MetCys: 0.0 ± 0.0
2.546MetAsp: 2.546 ± 0.664
1.358MetGlu: 1.358 ± 0.678
1.528MetPhe: 1.528 ± 0.559
1.019MetGly: 1.019 ± 0.35
0.679MetHis: 0.679 ± 0.321
1.358MetIle: 1.358 ± 0.506
2.377MetLys: 2.377 ± 0.371
3.056MetLeu: 3.056 ± 0.729
0.34MetMet: 0.34 ± 0.233
2.377MetAsn: 2.377 ± 0.538
1.019MetPro: 1.019 ± 0.326
1.698MetGln: 1.698 ± 0.738
1.698MetArg: 1.698 ± 0.616
2.886MetSer: 2.886 ± 0.666
1.528MetThr: 1.528 ± 0.415
0.849MetVal: 0.849 ± 0.311
0.679MetTrp: 0.679 ± 0.279
1.358MetTyr: 1.358 ± 0.385
0.0MetXaa: 0.0 ± 0.0
Asn
3.225AsnAla: 3.225 ± 0.849
0.849AsnCys: 0.849 ± 0.48
4.753AsnAsp: 4.753 ± 0.935
4.583AsnGlu: 4.583 ± 0.399
2.886AsnPhe: 2.886 ± 0.735
3.565AsnGly: 3.565 ± 0.792
2.037AsnHis: 2.037 ± 0.664
3.056AsnIle: 3.056 ± 0.951
3.904AsnLys: 3.904 ± 0.7
7.809AsnLeu: 7.809 ± 0.624
2.886AsnMet: 2.886 ± 0.728
5.602AsnAsn: 5.602 ± 1.196
1.698AsnPro: 1.698 ± 0.764
1.867AsnGln: 1.867 ± 0.474
2.716AsnArg: 2.716 ± 0.779
6.451AsnSer: 6.451 ± 1.688
4.244AsnThr: 4.244 ± 0.828
5.602AsnVal: 5.602 ± 1.058
1.867AsnTrp: 1.867 ± 0.745
4.923AsnTyr: 4.923 ± 0.675
0.0AsnXaa: 0.0 ± 0.0
Pro
1.358ProAla: 1.358 ± 0.439
0.0ProCys: 0.0 ± 0.0
2.207ProAsp: 2.207 ± 0.506
0.509ProGlu: 0.509 ± 0.322
2.037ProPhe: 2.037 ± 0.447
1.188ProGly: 1.188 ± 0.301
0.849ProHis: 0.849 ± 0.351
3.904ProIle: 3.904 ± 0.483
1.019ProLys: 1.019 ± 0.317
1.698ProLeu: 1.698 ± 0.469
0.849ProMet: 0.849 ± 0.361
1.528ProAsn: 1.528 ± 0.589
1.528ProPro: 1.528 ± 0.54
1.698ProGln: 1.698 ± 0.781
1.698ProArg: 1.698 ± 0.325
2.716ProSer: 2.716 ± 0.598
2.886ProThr: 2.886 ± 0.765
2.886ProVal: 2.886 ± 0.848
0.0ProTrp: 0.0 ± 0.0
1.698ProTyr: 1.698 ± 0.446
0.0ProXaa: 0.0 ± 0.0
Gln
1.358GlnAla: 1.358 ± 0.582
0.509GlnCys: 0.509 ± 0.221
2.377GlnAsp: 2.377 ± 0.554
1.867GlnGlu: 1.867 ± 0.355
1.698GlnPhe: 1.698 ± 0.394
0.509GlnGly: 0.509 ± 0.311
1.358GlnHis: 1.358 ± 0.455
3.904GlnIle: 3.904 ± 0.995
2.377GlnLys: 2.377 ± 0.771
5.602GlnLeu: 5.602 ± 1.172
1.528GlnMet: 1.528 ± 0.575
2.546GlnAsn: 2.546 ± 0.827
1.188GlnPro: 1.188 ± 0.372
2.886GlnGln: 2.886 ± 1.333
2.037GlnArg: 2.037 ± 0.841
2.377GlnSer: 2.377 ± 0.593
3.735GlnThr: 3.735 ± 0.601
2.207GlnVal: 2.207 ± 0.787
0.679GlnTrp: 0.679 ± 0.388
2.716GlnTyr: 2.716 ± 0.578
0.0GlnXaa: 0.0 ± 0.0
Arg
1.698ArgAla: 1.698 ± 0.423
0.679ArgCys: 0.679 ± 0.334
2.207ArgAsp: 2.207 ± 0.51
2.207ArgGlu: 2.207 ± 0.544
2.716ArgPhe: 2.716 ± 0.545
1.358ArgGly: 1.358 ± 0.275
1.019ArgHis: 1.019 ± 0.445
4.244ArgIle: 4.244 ± 0.524
3.056ArgLys: 3.056 ± 0.614
3.395ArgLeu: 3.395 ± 1.389
2.207ArgMet: 2.207 ± 0.605
4.074ArgAsn: 4.074 ± 1.091
1.188ArgPro: 1.188 ± 0.478
2.886ArgGln: 2.886 ± 0.751
2.037ArgArg: 2.037 ± 0.532
3.904ArgSer: 3.904 ± 0.97
2.377ArgThr: 2.377 ± 0.497
2.716ArgVal: 2.716 ± 0.964
0.679ArgTrp: 0.679 ± 0.269
2.037ArgTyr: 2.037 ± 0.406
0.0ArgXaa: 0.0 ± 0.0
Ser
3.565SerAla: 3.565 ± 0.68
0.509SerCys: 0.509 ± 0.357
3.904SerAsp: 3.904 ± 0.862
4.583SerGlu: 4.583 ± 0.852
3.395SerPhe: 3.395 ± 0.623
2.546SerGly: 2.546 ± 0.598
0.849SerHis: 0.849 ± 0.341
7.978SerIle: 7.978 ± 1.083
4.753SerLys: 4.753 ± 0.638
7.639SerLeu: 7.639 ± 1.639
2.207SerMet: 2.207 ± 0.451
6.111SerAsn: 6.111 ± 1.349
2.377SerPro: 2.377 ± 0.522
3.565SerGln: 3.565 ± 0.693
3.735SerArg: 3.735 ± 0.51
5.941SerSer: 5.941 ± 1.634
5.262SerThr: 5.262 ± 0.913
5.093SerVal: 5.093 ± 1.245
0.34SerTrp: 0.34 ± 0.217
3.904SerTyr: 3.904 ± 1.152
0.0SerXaa: 0.0 ± 0.0
Thr
3.225ThrAla: 3.225 ± 0.755
0.509ThrCys: 0.509 ± 0.295
4.583ThrAsp: 4.583 ± 0.865
3.565ThrGlu: 3.565 ± 0.616
3.904ThrPhe: 3.904 ± 0.781
1.698ThrGly: 1.698 ± 0.458
0.34ThrHis: 0.34 ± 0.227
6.111ThrIle: 6.111 ± 0.836
2.546ThrLys: 2.546 ± 0.633
7.978ThrLeu: 7.978 ± 1.65
2.037ThrMet: 2.037 ± 0.647
3.395ThrAsn: 3.395 ± 1.306
1.698ThrPro: 1.698 ± 0.474
2.716ThrGln: 2.716 ± 0.737
3.395ThrArg: 3.395 ± 0.916
5.262ThrSer: 5.262 ± 0.614
4.583ThrThr: 4.583 ± 1.029
4.244ThrVal: 4.244 ± 0.843
0.849ThrTrp: 0.849 ± 0.282
2.037ThrTyr: 2.037 ± 0.851
0.0ThrXaa: 0.0 ± 0.0
Val
4.074ValAla: 4.074 ± 1.092
1.528ValCys: 1.528 ± 0.631
4.074ValAsp: 4.074 ± 0.83
3.904ValGlu: 3.904 ± 0.804
2.716ValPhe: 2.716 ± 0.709
2.546ValGly: 2.546 ± 0.457
0.34ValHis: 0.34 ± 0.293
4.923ValIle: 4.923 ± 0.835
3.395ValLys: 3.395 ± 0.798
5.602ValLeu: 5.602 ± 0.903
1.358ValMet: 1.358 ± 0.586
6.281ValAsn: 6.281 ± 1.243
2.377ValPro: 2.377 ± 0.65
2.037ValGln: 2.037 ± 0.601
1.867ValArg: 1.867 ± 0.641
4.753ValSer: 4.753 ± 0.95
5.093ValThr: 5.093 ± 0.875
2.207ValVal: 2.207 ± 0.535
0.34ValTrp: 0.34 ± 0.205
2.546ValTyr: 2.546 ± 0.62
0.0ValXaa: 0.0 ± 0.0
Trp
0.17TrpAla: 0.17 ± 0.138
0.509TrpCys: 0.509 ± 0.439
1.188TrpAsp: 1.188 ± 0.319
0.17TrpGlu: 0.17 ± 0.161
0.34TrpPhe: 0.34 ± 0.233
0.17TrpGly: 0.17 ± 0.186
0.0TrpHis: 0.0 ± 0.0
1.528TrpIle: 1.528 ± 0.471
1.867TrpLys: 1.867 ± 0.415
1.188TrpLeu: 1.188 ± 0.346
0.17TrpMet: 0.17 ± 0.188
0.679TrpAsn: 0.679 ± 0.339
0.34TrpPro: 0.34 ± 0.261
0.509TrpGln: 0.509 ± 0.422
0.34TrpArg: 0.34 ± 0.205
0.34TrpSer: 0.34 ± 0.207
1.019TrpThr: 1.019 ± 0.596
0.17TrpVal: 0.17 ± 0.188
0.17TrpTrp: 0.17 ± 0.192
0.679TrpTyr: 0.679 ± 0.326
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.395TyrAla: 3.395 ± 1.112
0.679TyrCys: 0.679 ± 0.355
3.395TyrAsp: 3.395 ± 0.924
3.225TyrGlu: 3.225 ± 0.587
1.867TyrPhe: 1.867 ± 0.427
2.377TyrGly: 2.377 ± 0.658
0.34TyrHis: 0.34 ± 0.293
3.565TyrIle: 3.565 ± 0.873
3.904TyrLys: 3.904 ± 0.722
2.716TyrLeu: 2.716 ± 0.852
0.849TyrMet: 0.849 ± 0.372
4.414TyrAsn: 4.414 ± 0.792
1.019TyrPro: 1.019 ± 0.356
1.528TyrGln: 1.528 ± 0.503
2.546TyrArg: 2.546 ± 0.649
4.583TyrSer: 4.583 ± 1.309
2.716TyrThr: 2.716 ± 0.965
3.056TyrVal: 3.056 ± 0.722
0.679TyrTrp: 0.679 ± 0.352
2.716TyrTyr: 2.716 ± 0.933
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12 proteins (5892 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski