Amino acid dipepetide frequency for Sweetwater Branch virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.717AlaAla: 0.717 ± 0.366
0.956AlaCys: 0.956 ± 0.384
3.822AlaAsp: 3.822 ± 0.909
0.478AlaGlu: 0.478 ± 0.551
0.956AlaPhe: 0.956 ± 0.483
1.194AlaGly: 1.194 ± 0.466
1.433AlaHis: 1.433 ± 0.478
3.106AlaIle: 3.106 ± 0.423
2.389AlaLys: 2.389 ± 1.118
2.867AlaLeu: 2.867 ± 0.348
1.672AlaMet: 1.672 ± 0.556
2.389AlaAsn: 2.389 ± 0.353
0.717AlaPro: 0.717 ± 0.59
0.239AlaGln: 0.239 ± 0.149
2.389AlaArg: 2.389 ± 0.933
6.211AlaSer: 6.211 ± 1.659
0.717AlaThr: 0.717 ± 0.267
1.194AlaVal: 1.194 ± 0.593
0.478AlaTrp: 0.478 ± 0.298
1.672AlaTyr: 1.672 ± 0.891
0.0AlaXaa: 0.0 ± 0.0
Cys
0.478CysAla: 0.478 ± 0.298
0.478CysCys: 0.478 ± 0.298
0.717CysAsp: 0.717 ± 0.385
1.194CysGlu: 1.194 ± 0.391
1.911CysPhe: 1.911 ± 0.826
0.478CysGly: 0.478 ± 0.234
0.956CysHis: 0.956 ± 0.48
1.194CysIle: 1.194 ± 0.841
2.15CysLys: 2.15 ± 0.881
2.15CysLeu: 2.15 ± 0.877
0.0CysMet: 0.0 ± 0.0
1.672CysAsn: 1.672 ± 0.476
0.478CysPro: 0.478 ± 0.564
0.717CysGln: 0.717 ± 0.378
0.717CysArg: 0.717 ± 0.318
2.15CysSer: 2.15 ± 0.695
0.956CysThr: 0.956 ± 0.258
0.717CysVal: 0.717 ± 0.448
0.478CysTrp: 0.478 ± 0.365
0.956CysTyr: 0.956 ± 0.461
0.0CysXaa: 0.0 ± 0.0
Asp
3.106AspAla: 3.106 ± 1.157
1.672AspCys: 1.672 ± 1.176
5.256AspAsp: 5.256 ± 1.727
3.583AspGlu: 3.583 ± 0.902
2.389AspPhe: 2.389 ± 0.53
3.106AspGly: 3.106 ± 0.952
2.15AspHis: 2.15 ± 0.415
2.628AspIle: 2.628 ± 0.74
2.867AspLys: 2.867 ± 0.748
9.556AspLeu: 9.556 ± 1.876
1.672AspMet: 1.672 ± 0.439
3.583AspAsn: 3.583 ± 1.189
2.867AspPro: 2.867 ± 0.74
2.628AspGln: 2.628 ± 0.608
1.672AspArg: 1.672 ± 0.511
3.583AspSer: 3.583 ± 1.303
4.061AspThr: 4.061 ± 0.808
3.344AspVal: 3.344 ± 0.926
1.672AspTrp: 1.672 ± 0.532
3.344AspTyr: 3.344 ± 0.771
0.0AspXaa: 0.0 ± 0.0
Glu
2.389GluAla: 2.389 ± 0.416
1.433GluCys: 1.433 ± 0.553
5.256GluAsp: 5.256 ± 0.975
2.628GluGlu: 2.628 ± 0.49
1.433GluPhe: 1.433 ± 0.566
2.15GluGly: 2.15 ± 0.569
1.194GluHis: 1.194 ± 0.426
3.822GluIle: 3.822 ± 0.77
3.344GluLys: 3.344 ± 0.674
6.928GluLeu: 6.928 ± 1.143
1.433GluMet: 1.433 ± 0.346
2.628GluAsn: 2.628 ± 0.643
1.194GluPro: 1.194 ± 0.476
0.956GluGln: 0.956 ± 0.34
1.194GluArg: 1.194 ± 0.497
5.733GluSer: 5.733 ± 0.822
3.106GluThr: 3.106 ± 0.718
4.061GluVal: 4.061 ± 1.178
0.239GluTrp: 0.239 ± 0.282
2.389GluTyr: 2.389 ± 0.709
0.0GluXaa: 0.0 ± 0.0
Phe
1.433PheAla: 1.433 ± 0.494
0.956PheCys: 0.956 ± 0.461
1.433PheAsp: 1.433 ± 0.58
0.956PheGlu: 0.956 ± 0.346
3.106PhePhe: 3.106 ± 0.556
2.867PheGly: 2.867 ± 0.59
1.194PheHis: 1.194 ± 0.505
2.867PheIle: 2.867 ± 1.012
5.017PheLys: 5.017 ± 0.905
3.583PheLeu: 3.583 ± 0.642
0.239PheMet: 0.239 ± 0.149
2.867PheAsn: 2.867 ± 1.16
2.389PhePro: 2.389 ± 1.443
1.433PheGln: 1.433 ± 0.712
1.672PheArg: 1.672 ± 0.793
2.867PheSer: 2.867 ± 0.269
1.911PheThr: 1.911 ± 0.491
2.628PheVal: 2.628 ± 0.504
0.239PheTrp: 0.239 ± 0.149
2.628PheTyr: 2.628 ± 0.6
0.0PheXaa: 0.0 ± 0.0
Gly
1.911GlyAla: 1.911 ± 0.387
0.478GlyCys: 0.478 ± 0.298
3.344GlyAsp: 3.344 ± 0.735
2.389GlyGlu: 2.389 ± 0.734
2.389GlyPhe: 2.389 ± 0.818
3.106GlyGly: 3.106 ± 0.61
1.672GlyHis: 1.672 ± 0.683
3.106GlyIle: 3.106 ± 0.757
3.106GlyLys: 3.106 ± 0.592
6.689GlyLeu: 6.689 ± 1.216
0.717GlyMet: 0.717 ± 0.476
1.433GlyAsn: 1.433 ± 0.309
1.194GlyPro: 1.194 ± 0.523
2.15GlyGln: 2.15 ± 0.402
1.911GlyArg: 1.911 ± 0.61
5.972GlySer: 5.972 ± 0.961
3.344GlyThr: 3.344 ± 0.978
2.628GlyVal: 2.628 ± 0.663
1.433GlyTrp: 1.433 ± 0.382
1.672GlyTyr: 1.672 ± 0.576
0.0GlyXaa: 0.0 ± 0.0
His
0.239HisAla: 0.239 ± 0.312
0.478HisCys: 0.478 ± 0.485
1.911HisAsp: 1.911 ± 0.45
1.672HisGlu: 1.672 ± 0.667
1.433HisPhe: 1.433 ± 0.532
1.433HisGly: 1.433 ± 0.906
0.717HisHis: 0.717 ± 0.447
1.911HisIle: 1.911 ± 0.732
1.433HisLys: 1.433 ± 0.384
3.344HisLeu: 3.344 ± 0.375
0.717HisMet: 0.717 ± 0.255
1.672HisAsn: 1.672 ± 0.793
1.911HisPro: 1.911 ± 1.01
0.956HisGln: 0.956 ± 0.543
1.433HisArg: 1.433 ± 0.421
1.911HisSer: 1.911 ± 0.545
0.717HisThr: 0.717 ± 0.447
2.389HisVal: 2.389 ± 0.653
0.717HisTrp: 0.717 ± 0.385
0.478HisTyr: 0.478 ± 0.298
0.0HisXaa: 0.0 ± 0.0
Ile
2.15IleAla: 2.15 ± 0.832
2.389IleCys: 2.389 ± 0.81
4.539IleAsp: 4.539 ± 1.246
4.061IleGlu: 4.061 ± 0.704
3.583IlePhe: 3.583 ± 0.745
4.778IleGly: 4.778 ± 1.014
1.433IleHis: 1.433 ± 0.421
5.256IleIle: 5.256 ± 1.767
8.361IleLys: 8.361 ± 0.974
8.6IleLeu: 8.6 ± 2.283
1.433IleMet: 1.433 ± 0.346
3.822IleAsn: 3.822 ± 0.852
4.061IlePro: 4.061 ± 1.135
3.106IleGln: 3.106 ± 0.933
4.778IleArg: 4.778 ± 1.881
7.167IleSer: 7.167 ± 1.203
3.106IleThr: 3.106 ± 1.346
3.583IleVal: 3.583 ± 0.93
1.433IleTrp: 1.433 ± 0.838
3.106IleTyr: 3.106 ± 0.516
0.0IleXaa: 0.0 ± 0.0
Lys
2.628LysAla: 2.628 ± 1.294
2.15LysCys: 2.15 ± 0.926
5.733LysAsp: 5.733 ± 1.386
3.822LysGlu: 3.822 ± 0.908
3.583LysPhe: 3.583 ± 1.174
4.778LysGly: 4.778 ± 1.395
1.911LysHis: 1.911 ± 0.934
8.6LysIle: 8.6 ± 1.368
5.972LysLys: 5.972 ± 1.212
7.883LysLeu: 7.883 ± 1.275
1.194LysMet: 1.194 ± 1.223
3.106LysAsn: 3.106 ± 0.881
2.867LysPro: 2.867 ± 0.771
2.389LysGln: 2.389 ± 0.835
2.628LysArg: 2.628 ± 0.685
5.256LysSer: 5.256 ± 1.572
2.867LysThr: 2.867 ± 0.845
3.106LysVal: 3.106 ± 1.385
1.672LysTrp: 1.672 ± 0.998
3.106LysTyr: 3.106 ± 1.186
0.0LysXaa: 0.0 ± 0.0
Leu
3.822LeuAla: 3.822 ± 0.413
1.911LeuCys: 1.911 ± 0.737
8.6LeuAsp: 8.6 ± 1.81
6.211LeuGlu: 6.211 ± 0.93
3.583LeuPhe: 3.583 ± 1.441
4.539LeuGly: 4.539 ± 1.612
1.194LeuHis: 1.194 ± 0.746
9.317LeuIle: 9.317 ± 1.796
7.406LeuLys: 7.406 ± 0.886
6.689LeuLeu: 6.689 ± 1.726
3.822LeuMet: 3.822 ± 1.042
8.361LeuAsn: 8.361 ± 0.956
3.822LeuPro: 3.822 ± 0.629
3.822LeuGln: 3.822 ± 0.89
5.733LeuArg: 5.733 ± 1.849
7.883LeuSer: 7.883 ± 1.412
5.256LeuThr: 5.256 ± 0.94
4.061LeuVal: 4.061 ± 1.246
0.956LeuTrp: 0.956 ± 0.34
4.061LeuTyr: 4.061 ± 0.739
0.0LeuXaa: 0.0 ± 0.0
Met
0.956MetAla: 0.956 ± 0.358
0.239MetCys: 0.239 ± 0.282
1.433MetAsp: 1.433 ± 0.584
1.194MetGlu: 1.194 ± 0.435
0.717MetPhe: 0.717 ± 0.255
0.478MetGly: 0.478 ± 0.231
0.717MetHis: 0.717 ± 0.515
3.344MetIle: 3.344 ± 1.533
0.717MetLys: 0.717 ± 0.684
1.194MetLeu: 1.194 ± 0.796
0.717MetMet: 0.717 ± 0.364
1.433MetAsn: 1.433 ± 0.364
0.717MetPro: 0.717 ± 0.372
0.478MetGln: 0.478 ± 0.273
1.672MetArg: 1.672 ± 1.212
2.15MetSer: 2.15 ± 0.62
1.911MetThr: 1.911 ± 0.349
0.956MetVal: 0.956 ± 0.537
0.239MetTrp: 0.239 ± 0.149
0.478MetTyr: 0.478 ± 0.35
0.0MetXaa: 0.0 ± 0.0
Asn
2.15AsnAla: 2.15 ± 1.028
0.717AsnCys: 0.717 ± 0.385
2.867AsnAsp: 2.867 ± 0.648
3.106AsnGlu: 3.106 ± 1.025
2.628AsnPhe: 2.628 ± 0.883
4.3AsnGly: 4.3 ± 1.048
1.672AsnHis: 1.672 ± 0.676
5.733AsnIle: 5.733 ± 1.146
4.061AsnLys: 4.061 ± 0.639
7.645AsnLeu: 7.645 ± 0.979
1.194AsnMet: 1.194 ± 0.476
4.539AsnAsn: 4.539 ± 0.534
2.867AsnPro: 2.867 ± 0.622
1.911AsnGln: 1.911 ± 0.688
1.911AsnArg: 1.911 ± 0.487
4.3AsnSer: 4.3 ± 0.709
2.15AsnThr: 2.15 ± 0.369
3.583AsnVal: 3.583 ± 1.675
1.672AsnTrp: 1.672 ± 0.503
3.822AsnTyr: 3.822 ± 0.461
0.0AsnXaa: 0.0 ± 0.0
Pro
0.956ProAla: 0.956 ± 0.366
0.239ProCys: 0.239 ± 0.383
3.106ProAsp: 3.106 ± 1.468
1.672ProGlu: 1.672 ± 0.503
1.194ProPhe: 1.194 ± 0.717
1.194ProGly: 1.194 ± 0.803
1.433ProHis: 1.433 ± 0.654
4.061ProIle: 4.061 ± 0.666
3.106ProLys: 3.106 ± 0.741
3.344ProLeu: 3.344 ± 0.879
0.478ProMet: 0.478 ± 0.322
1.911ProAsn: 1.911 ± 0.588
1.433ProPro: 1.433 ± 0.74
1.433ProGln: 1.433 ± 0.643
0.239ProArg: 0.239 ± 0.149
4.539ProSer: 4.539 ± 0.414
3.583ProThr: 3.583 ± 1.339
1.433ProVal: 1.433 ± 0.408
0.717ProTrp: 0.717 ± 0.378
2.389ProTyr: 2.389 ± 0.459
0.0ProXaa: 0.0 ± 0.0
Gln
0.717GlnAla: 0.717 ± 0.448
1.194GlnCys: 1.194 ± 0.468
2.867GlnAsp: 2.867 ± 0.9
2.15GlnGlu: 2.15 ± 0.825
1.194GlnPhe: 1.194 ± 0.523
1.194GlnGly: 1.194 ± 0.41
1.672GlnHis: 1.672 ± 0.974
2.628GlnIle: 2.628 ± 1.112
3.106GlnLys: 3.106 ± 0.709
2.389GlnLeu: 2.389 ± 0.444
0.478GlnMet: 0.478 ± 0.231
1.433GlnAsn: 1.433 ± 0.51
0.239GlnPro: 0.239 ± 0.149
0.956GlnGln: 0.956 ± 0.366
1.672GlnArg: 1.672 ± 0.511
1.672GlnSer: 1.672 ± 0.928
2.628GlnThr: 2.628 ± 0.593
1.911GlnVal: 1.911 ± 0.911
0.239GlnTrp: 0.239 ± 0.149
0.956GlnTyr: 0.956 ± 0.554
0.0GlnXaa: 0.0 ± 0.0
Arg
3.583ArgAla: 3.583 ± 0.85
1.194ArgCys: 1.194 ± 0.491
0.956ArgAsp: 0.956 ± 0.484
2.15ArgGlu: 2.15 ± 0.499
2.389ArgPhe: 2.389 ± 0.673
1.911ArgGly: 1.911 ± 0.385
0.478ArgHis: 0.478 ± 0.298
1.911ArgIle: 1.911 ± 0.941
2.389ArgLys: 2.389 ± 0.995
3.106ArgLeu: 3.106 ± 0.671
0.478ArgMet: 0.478 ± 0.597
4.3ArgAsn: 4.3 ± 1.229
2.15ArgPro: 2.15 ± 0.853
0.956ArgGln: 0.956 ± 0.437
0.717ArgArg: 0.717 ± 0.372
4.778ArgSer: 4.778 ± 0.724
3.583ArgThr: 3.583 ± 0.814
2.15ArgVal: 2.15 ± 0.55
0.717ArgTrp: 0.717 ± 0.429
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
3.106SerAla: 3.106 ± 0.879
2.389SerCys: 2.389 ± 0.844
4.3SerAsp: 4.3 ± 0.748
6.211SerGlu: 6.211 ± 1.135
4.061SerPhe: 4.061 ± 0.668
4.3SerGly: 4.3 ± 0.819
2.15SerHis: 2.15 ± 0.863
7.645SerIle: 7.645 ± 1.454
5.972SerLys: 5.972 ± 2.758
8.122SerLeu: 8.122 ± 2.14
1.433SerMet: 1.433 ± 0.481
5.495SerAsn: 5.495 ± 1.554
3.344SerPro: 3.344 ± 0.795
1.911SerGln: 1.911 ± 0.595
3.344SerArg: 3.344 ± 0.456
7.167SerSer: 7.167 ± 1.902
4.539SerThr: 4.539 ± 0.544
3.822SerVal: 3.822 ± 1.559
1.911SerTrp: 1.911 ± 0.475
4.539SerTyr: 4.539 ± 1.346
0.0SerXaa: 0.0 ± 0.0
Thr
1.911ThrAla: 1.911 ± 0.923
0.478ThrCys: 0.478 ± 0.365
3.344ThrAsp: 3.344 ± 0.502
2.867ThrGlu: 2.867 ± 0.571
1.672ThrPhe: 1.672 ± 0.475
2.628ThrGly: 2.628 ± 0.621
2.15ThrHis: 2.15 ± 0.856
5.495ThrIle: 5.495 ± 1.101
3.822ThrLys: 3.822 ± 0.74
5.495ThrLeu: 5.495 ± 1.315
0.956ThrMet: 0.956 ± 0.356
3.822ThrAsn: 3.822 ± 0.927
1.911ThrPro: 1.911 ± 0.647
1.672ThrGln: 1.672 ± 0.843
1.672ThrArg: 1.672 ± 0.463
4.778ThrSer: 4.778 ± 0.97
1.911ThrThr: 1.911 ± 0.743
3.106ThrVal: 3.106 ± 0.748
1.911ThrTrp: 1.911 ± 0.631
2.867ThrTyr: 2.867 ± 1.507
0.0ThrXaa: 0.0 ± 0.0
Val
1.672ValAla: 1.672 ± 0.667
0.478ValCys: 0.478 ± 0.298
1.911ValAsp: 1.911 ± 0.395
1.911ValGlu: 1.911 ± 1.141
1.911ValPhe: 1.911 ± 0.857
3.106ValGly: 3.106 ± 0.921
2.389ValHis: 2.389 ± 0.641
4.778ValIle: 4.778 ± 0.653
4.539ValLys: 4.539 ± 1.67
4.061ValLeu: 4.061 ± 1.864
1.433ValMet: 1.433 ± 0.48
4.539ValAsn: 4.539 ± 1.043
1.911ValPro: 1.911 ± 0.729
1.911ValGln: 1.911 ± 0.423
1.672ValArg: 1.672 ± 0.964
3.106ValSer: 3.106 ± 1.023
5.256ValThr: 5.256 ± 1.054
1.672ValVal: 1.672 ± 0.782
0.478ValTrp: 0.478 ± 0.231
1.911ValTyr: 1.911 ± 0.665
0.0ValXaa: 0.0 ± 0.0
Trp
0.717TrpAla: 0.717 ± 0.597
0.0TrpCys: 0.0 ± 0.0
0.478TrpAsp: 0.478 ± 0.231
1.911TrpGlu: 1.911 ± 0.475
0.478TrpPhe: 0.478 ± 0.234
1.194TrpGly: 1.194 ± 0.746
0.478TrpHis: 0.478 ± 0.359
1.194TrpIle: 1.194 ± 0.657
1.194TrpLys: 1.194 ± 0.442
2.389TrpLeu: 2.389 ± 0.558
0.956TrpMet: 0.956 ± 0.602
0.956TrpAsn: 0.956 ± 0.478
0.717TrpPro: 0.717 ± 0.448
0.478TrpGln: 0.478 ± 0.365
0.478TrpArg: 0.478 ± 0.231
1.194TrpSer: 1.194 ± 0.523
0.478TrpThr: 0.478 ± 0.231
1.672TrpVal: 1.672 ± 1.08
0.239TrpTrp: 0.239 ± 0.282
0.717TrpTyr: 0.717 ± 0.267
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.956TyrAla: 0.956 ± 0.423
0.717TyrCys: 0.717 ± 0.255
2.867TyrAsp: 2.867 ± 1.074
3.344TyrGlu: 3.344 ± 0.592
1.672TyrPhe: 1.672 ± 0.499
1.911TyrGly: 1.911 ± 0.704
0.478TyrHis: 0.478 ± 0.298
2.15TyrIle: 2.15 ± 0.624
4.3TyrLys: 4.3 ± 1.055
5.017TyrLeu: 5.017 ± 1.39
0.478TyrMet: 0.478 ± 0.359
2.867TyrAsn: 2.867 ± 1.452
1.672TyrPro: 1.672 ± 0.422
1.194TyrGln: 1.194 ± 0.523
2.389TyrArg: 2.389 ± 0.554
3.344TyrSer: 3.344 ± 0.729
2.389TyrThr: 2.389 ± 0.55
2.867TyrVal: 2.867 ± 1.173
0.478TyrTrp: 0.478 ± 0.421
4.061TyrTyr: 4.061 ± 0.955
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (4187 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski