Amino acid dipepetide frequency for St Croix River virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.312AlaAla: 6.312 ± 1.348
0.0AlaCys: 0.0 ± 0.0
2.559AlaAsp: 2.559 ± 0.517
3.924AlaGlu: 3.924 ± 0.789
3.924AlaPhe: 3.924 ± 0.981
3.582AlaGly: 3.582 ± 0.774
1.535AlaHis: 1.535 ± 0.358
5.288AlaIle: 5.288 ± 0.54
2.218AlaLys: 2.218 ± 0.859
9.894AlaLeu: 9.894 ± 1.651
2.218AlaMet: 2.218 ± 0.962
3.241AlaAsn: 3.241 ± 1.14
3.924AlaPro: 3.924 ± 0.94
2.559AlaGln: 2.559 ± 0.404
5.288AlaArg: 5.288 ± 0.744
6.994AlaSer: 6.994 ± 1.426
4.947AlaThr: 4.947 ± 0.327
5.118AlaVal: 5.118 ± 1.27
1.365AlaTrp: 1.365 ± 0.399
3.582AlaTyr: 3.582 ± 0.876
0.0AlaXaa: 0.0 ± 0.0
Cys
1.194CysAla: 1.194 ± 0.544
0.0CysCys: 0.0 ± 0.0
0.682CysAsp: 0.682 ± 0.255
0.341CysGlu: 0.341 ± 0.181
0.682CysPhe: 0.682 ± 0.472
0.512CysGly: 0.512 ± 0.271
0.512CysHis: 0.512 ± 0.225
0.0CysIle: 0.0 ± 0.0
0.171CysLys: 0.171 ± 0.224
1.365CysLeu: 1.365 ± 0.361
0.0CysMet: 0.0 ± 0.0
0.341CysAsn: 0.341 ± 0.23
0.341CysPro: 0.341 ± 0.177
0.682CysGln: 0.682 ± 0.394
0.682CysArg: 0.682 ± 0.649
0.512CysSer: 0.512 ± 0.225
0.853CysThr: 0.853 ± 0.376
0.682CysVal: 0.682 ± 0.454
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.412AspAla: 3.412 ± 0.975
0.171AspCys: 0.171 ± 0.188
2.729AspAsp: 2.729 ± 0.602
3.753AspGlu: 3.753 ± 1.013
2.388AspPhe: 2.388 ± 0.668
5.118AspGly: 5.118 ± 0.661
1.876AspHis: 1.876 ± 0.492
3.241AspIle: 3.241 ± 0.796
2.047AspLys: 2.047 ± 0.351
6.141AspLeu: 6.141 ± 1.165
1.706AspMet: 1.706 ± 0.585
1.024AspAsn: 1.024 ± 0.34
4.777AspPro: 4.777 ± 0.78
1.365AspGln: 1.365 ± 0.432
3.412AspArg: 3.412 ± 0.924
3.412AspSer: 3.412 ± 0.323
2.047AspThr: 2.047 ± 0.53
6.141AspVal: 6.141 ± 1.082
0.0AspTrp: 0.0 ± 0.0
1.365AspTyr: 1.365 ± 0.3
0.0AspXaa: 0.0 ± 0.0
Glu
5.288GluAla: 5.288 ± 0.921
0.341GluCys: 0.341 ± 0.325
1.876GluAsp: 1.876 ± 0.737
3.582GluGlu: 3.582 ± 0.85
2.218GluPhe: 2.218 ± 0.425
3.241GluGly: 3.241 ± 0.681
2.388GluHis: 2.388 ± 0.622
3.582GluIle: 3.582 ± 0.856
2.729GluLys: 2.729 ± 1.281
3.582GluLeu: 3.582 ± 0.279
2.218GluMet: 2.218 ± 0.45
2.388GluAsn: 2.388 ± 0.605
1.194GluPro: 1.194 ± 0.52
0.853GluGln: 0.853 ± 0.394
4.947GluArg: 4.947 ± 0.876
3.412GluSer: 3.412 ± 0.606
1.706GluThr: 1.706 ± 0.443
3.071GluVal: 3.071 ± 0.682
0.853GluTrp: 0.853 ± 0.33
2.047GluTyr: 2.047 ± 0.473
0.0GluXaa: 0.0 ± 0.0
Phe
2.218PheAla: 2.218 ± 0.57
0.512PheCys: 0.512 ± 0.487
2.047PheAsp: 2.047 ± 0.626
1.876PheGlu: 1.876 ± 0.721
1.876PhePhe: 1.876 ± 0.471
3.924PheGly: 3.924 ± 0.609
1.876PheHis: 1.876 ± 0.577
1.876PheIle: 1.876 ± 0.319
1.876PheLys: 1.876 ± 0.39
5.459PheLeu: 5.459 ± 0.841
0.171PheMet: 0.171 ± 0.267
1.365PheAsn: 1.365 ± 0.462
2.9PhePro: 2.9 ± 0.757
2.047PheGln: 2.047 ± 0.776
3.582PheArg: 3.582 ± 0.547
6.141PheSer: 6.141 ± 1.228
3.753PheThr: 3.753 ± 0.983
2.559PheVal: 2.559 ± 0.658
0.171PheTrp: 0.171 ± 0.224
2.047PheTyr: 2.047 ± 0.325
0.0PheXaa: 0.0 ± 0.0
Gly
5.118GlyAla: 5.118 ± 1.22
0.512GlyCys: 0.512 ± 0.286
5.971GlyAsp: 5.971 ± 1.012
3.582GlyGlu: 3.582 ± 0.765
2.559GlyPhe: 2.559 ± 0.619
4.606GlyGly: 4.606 ± 1.245
1.194GlyHis: 1.194 ± 0.378
2.559GlyIle: 2.559 ± 0.602
2.218GlyLys: 2.218 ± 0.434
3.412GlyLeu: 3.412 ± 0.556
2.047GlyMet: 2.047 ± 0.563
1.194GlyAsn: 1.194 ± 0.416
5.288GlyPro: 5.288 ± 0.889
1.706GlyGln: 1.706 ± 0.307
3.412GlyArg: 3.412 ± 0.801
4.435GlySer: 4.435 ± 0.435
2.388GlyThr: 2.388 ± 0.444
4.435GlyVal: 4.435 ± 1.024
0.853GlyTrp: 0.853 ± 0.458
2.388GlyTyr: 2.388 ± 0.656
0.0GlyXaa: 0.0 ± 0.0
His
2.559HisAla: 2.559 ± 0.496
0.171HisCys: 0.171 ± 0.137
0.853HisAsp: 0.853 ± 0.251
0.341HisGlu: 0.341 ± 0.181
1.365HisPhe: 1.365 ± 0.317
1.365HisGly: 1.365 ± 0.314
0.512HisHis: 0.512 ± 0.28
2.388HisIle: 2.388 ± 0.376
0.341HisLys: 0.341 ± 0.282
4.947HisLeu: 4.947 ± 1.09
0.853HisMet: 0.853 ± 0.298
0.512HisAsn: 0.512 ± 0.271
3.241HisPro: 3.241 ± 1.299
1.024HisGln: 1.024 ± 0.271
1.706HisArg: 1.706 ± 0.57
3.241HisSer: 3.241 ± 0.812
1.194HisThr: 1.194 ± 0.493
2.729HisVal: 2.729 ± 0.754
0.341HisTrp: 0.341 ± 0.22
1.194HisTyr: 1.194 ± 0.388
0.0HisXaa: 0.0 ± 0.0
Ile
3.924IleAla: 3.924 ± 0.654
0.853IleCys: 0.853 ± 0.36
3.241IleAsp: 3.241 ± 0.968
2.559IleGlu: 2.559 ± 0.589
2.218IlePhe: 2.218 ± 0.6
3.582IleGly: 3.582 ± 0.844
2.218IleHis: 2.218 ± 0.373
2.218IleIle: 2.218 ± 0.532
1.876IleLys: 1.876 ± 0.336
5.629IleLeu: 5.629 ± 0.637
0.853IleMet: 0.853 ± 0.453
1.194IleAsn: 1.194 ± 0.53
4.094IlePro: 4.094 ± 0.786
1.876IleGln: 1.876 ± 0.618
3.241IleArg: 3.241 ± 0.839
5.288IleSer: 5.288 ± 1.082
3.412IleThr: 3.412 ± 0.861
2.9IleVal: 2.9 ± 0.502
0.682IleTrp: 0.682 ± 0.42
1.876IleTyr: 1.876 ± 0.633
0.0IleXaa: 0.0 ± 0.0
Lys
2.218LysAla: 2.218 ± 0.558
0.512LysCys: 0.512 ± 0.312
2.559LysAsp: 2.559 ± 0.647
2.218LysGlu: 2.218 ± 0.971
1.365LysPhe: 1.365 ± 0.433
2.388LysGly: 2.388 ± 0.447
1.365LysHis: 1.365 ± 0.418
1.535LysIle: 1.535 ± 0.369
1.535LysLys: 1.535 ± 0.742
2.559LysLeu: 2.559 ± 0.391
0.682LysMet: 0.682 ± 0.271
1.194LysAsn: 1.194 ± 0.279
1.365LysPro: 1.365 ± 0.336
1.535LysGln: 1.535 ± 0.588
2.388LysArg: 2.388 ± 0.888
1.706LysSer: 1.706 ± 0.364
1.706LysThr: 1.706 ± 0.475
2.388LysVal: 2.388 ± 0.53
0.341LysTrp: 0.341 ± 0.239
1.194LysTyr: 1.194 ± 0.382
0.0LysXaa: 0.0 ± 0.0
Leu
8.018LeuAla: 8.018 ± 1.365
1.706LeuCys: 1.706 ± 0.542
5.8LeuAsp: 5.8 ± 1.101
3.924LeuGlu: 3.924 ± 1.259
6.141LeuPhe: 6.141 ± 1.835
5.459LeuGly: 5.459 ± 0.918
2.9LeuHis: 2.9 ± 0.553
5.459LeuIle: 5.459 ± 0.848
3.924LeuLys: 3.924 ± 0.452
9.724LeuLeu: 9.724 ± 1.909
3.412LeuMet: 3.412 ± 0.822
3.753LeuAsn: 3.753 ± 0.95
5.971LeuPro: 5.971 ± 0.776
3.924LeuGln: 3.924 ± 0.965
8.871LeuArg: 8.871 ± 1.195
9.041LeuSer: 9.041 ± 1.258
6.824LeuThr: 6.824 ± 0.799
5.8LeuVal: 5.8 ± 0.638
1.024LeuTrp: 1.024 ± 0.588
2.218LeuTyr: 2.218 ± 0.541
0.0LeuXaa: 0.0 ± 0.0
Met
2.9MetAla: 2.9 ± 0.837
0.341MetCys: 0.341 ± 0.21
1.365MetAsp: 1.365 ± 0.463
1.706MetGlu: 1.706 ± 0.506
1.024MetPhe: 1.024 ± 0.368
1.194MetGly: 1.194 ± 0.772
1.024MetHis: 1.024 ± 0.382
0.512MetIle: 0.512 ± 0.31
0.341MetLys: 0.341 ± 0.314
3.412MetLeu: 3.412 ± 0.709
1.194MetMet: 1.194 ± 0.314
0.853MetAsn: 0.853 ± 0.26
1.365MetPro: 1.365 ± 0.537
0.512MetGln: 0.512 ± 0.329
1.365MetArg: 1.365 ± 0.359
1.365MetSer: 1.365 ± 0.397
0.682MetThr: 0.682 ± 0.306
0.853MetVal: 0.853 ± 0.347
0.512MetTrp: 0.512 ± 0.208
1.024MetTyr: 1.024 ± 0.31
0.0MetXaa: 0.0 ± 0.0
Asn
2.388AsnAla: 2.388 ± 1.052
0.0AsnCys: 0.0 ± 0.0
1.706AsnAsp: 1.706 ± 0.612
1.365AsnGlu: 1.365 ± 0.331
1.876AsnPhe: 1.876 ± 0.464
1.706AsnGly: 1.706 ± 0.774
0.682AsnHis: 0.682 ± 0.37
1.706AsnIle: 1.706 ± 0.972
0.171AsnLys: 0.171 ± 0.192
4.094AsnLeu: 4.094 ± 0.894
0.853AsnMet: 0.853 ± 0.275
0.512AsnAsn: 0.512 ± 0.236
1.535AsnPro: 1.535 ± 0.349
0.512AsnGln: 0.512 ± 0.26
2.047AsnArg: 2.047 ± 0.743
2.218AsnSer: 2.218 ± 0.491
1.194AsnThr: 1.194 ± 0.492
1.706AsnVal: 1.706 ± 0.377
0.171AsnTrp: 0.171 ± 0.159
0.853AsnTyr: 0.853 ± 0.218
0.0AsnXaa: 0.0 ± 0.0
Pro
3.582ProAla: 3.582 ± 0.834
0.512ProCys: 0.512 ± 0.257
3.582ProAsp: 3.582 ± 0.992
3.071ProGlu: 3.071 ± 0.714
3.924ProPhe: 3.924 ± 0.328
2.559ProGly: 2.559 ± 0.558
2.388ProHis: 2.388 ± 0.519
3.241ProIle: 3.241 ± 0.633
1.365ProLys: 1.365 ± 0.351
6.653ProLeu: 6.653 ± 1.002
0.341ProMet: 0.341 ± 0.181
2.047ProAsn: 2.047 ± 0.422
7.677ProPro: 7.677 ± 1.084
2.559ProGln: 2.559 ± 0.622
3.071ProArg: 3.071 ± 0.885
8.018ProSer: 8.018 ± 1.595
4.265ProThr: 4.265 ± 0.607
4.094ProVal: 4.094 ± 0.884
0.512ProTrp: 0.512 ± 0.34
2.047ProTyr: 2.047 ± 0.404
0.0ProXaa: 0.0 ± 0.0
Gln
2.559GlnAla: 2.559 ± 0.426
0.171GlnCys: 0.171 ± 0.157
2.9GlnAsp: 2.9 ± 0.858
2.047GlnGlu: 2.047 ± 0.739
0.682GlnPhe: 0.682 ± 0.393
2.047GlnGly: 2.047 ± 0.698
1.194GlnHis: 1.194 ± 0.662
2.218GlnIle: 2.218 ± 0.509
1.194GlnLys: 1.194 ± 0.314
3.924GlnLeu: 3.924 ± 0.477
0.512GlnMet: 0.512 ± 0.256
0.341GlnAsn: 0.341 ± 0.175
2.047GlnPro: 2.047 ± 0.361
1.194GlnGln: 1.194 ± 0.388
2.559GlnArg: 2.559 ± 0.778
2.559GlnSer: 2.559 ± 0.644
1.876GlnThr: 1.876 ± 0.742
2.559GlnVal: 2.559 ± 0.518
0.682GlnTrp: 0.682 ± 0.523
0.512GlnTyr: 0.512 ± 0.279
0.0GlnXaa: 0.0 ± 0.0
Arg
5.8ArgAla: 5.8 ± 0.916
0.853ArgCys: 0.853 ± 0.594
3.753ArgAsp: 3.753 ± 0.767
4.435ArgGlu: 4.435 ± 0.832
3.412ArgPhe: 3.412 ± 0.603
5.459ArgGly: 5.459 ± 0.878
1.876ArgHis: 1.876 ± 0.283
3.753ArgIle: 3.753 ± 0.544
3.753ArgLys: 3.753 ± 1.112
5.288ArgLeu: 5.288 ± 0.997
2.218ArgMet: 2.218 ± 0.47
1.876ArgAsn: 1.876 ± 0.657
2.729ArgPro: 2.729 ± 0.783
2.218ArgGln: 2.218 ± 0.742
6.312ArgArg: 6.312 ± 1.268
6.482ArgSer: 6.482 ± 1.099
4.606ArgThr: 4.606 ± 0.647
5.629ArgVal: 5.629 ± 0.572
0.171ArgTrp: 0.171 ± 0.176
2.047ArgTyr: 2.047 ± 0.587
0.0ArgXaa: 0.0 ± 0.0
Ser
6.994SerAla: 6.994 ± 1.557
1.365SerCys: 1.365 ± 0.456
6.653SerAsp: 6.653 ± 0.906
3.924SerGlu: 3.924 ± 0.739
4.435SerPhe: 4.435 ± 1.069
4.094SerGly: 4.094 ± 0.966
2.729SerHis: 2.729 ± 0.638
4.435SerIle: 4.435 ± 0.89
2.047SerLys: 2.047 ± 0.355
11.6SerLeu: 11.6 ± 2.314
1.365SerMet: 1.365 ± 0.306
2.218SerAsn: 2.218 ± 0.503
6.141SerPro: 6.141 ± 0.764
2.729SerGln: 2.729 ± 0.49
7.677SerArg: 7.677 ± 0.827
9.041SerSer: 9.041 ± 1.485
4.777SerThr: 4.777 ± 0.894
5.629SerVal: 5.629 ± 1.148
0.853SerTrp: 0.853 ± 0.35
1.365SerTyr: 1.365 ± 0.433
0.0SerXaa: 0.0 ± 0.0
Thr
4.606ThrAla: 4.606 ± 1.069
0.341ThrCys: 0.341 ± 0.211
2.559ThrAsp: 2.559 ± 0.53
2.218ThrGlu: 2.218 ± 0.522
3.241ThrPhe: 3.241 ± 0.64
2.559ThrGly: 2.559 ± 0.726
1.365ThrHis: 1.365 ± 0.626
3.071ThrIle: 3.071 ± 0.462
2.388ThrLys: 2.388 ± 0.334
6.482ThrLeu: 6.482 ± 0.969
1.194ThrMet: 1.194 ± 0.606
1.535ThrAsn: 1.535 ± 0.518
4.265ThrPro: 4.265 ± 0.808
1.876ThrGln: 1.876 ± 0.586
2.729ThrArg: 2.729 ± 0.371
7.335ThrSer: 7.335 ± 1.108
2.559ThrThr: 2.559 ± 0.714
2.559ThrVal: 2.559 ± 0.63
0.682ThrTrp: 0.682 ± 0.271
1.706ThrTyr: 1.706 ± 0.593
0.0ThrXaa: 0.0 ± 0.0
Val
5.459ValAla: 5.459 ± 1.402
0.853ValCys: 0.853 ± 0.413
3.071ValAsp: 3.071 ± 0.433
4.094ValGlu: 4.094 ± 0.494
3.924ValPhe: 3.924 ± 0.88
4.265ValGly: 4.265 ± 0.904
2.218ValHis: 2.218 ± 0.455
4.435ValIle: 4.435 ± 0.864
1.706ValLys: 1.706 ± 0.476
4.947ValLeu: 4.947 ± 0.813
1.535ValMet: 1.535 ± 0.469
0.853ValAsn: 0.853 ± 0.272
3.924ValPro: 3.924 ± 0.712
3.241ValGln: 3.241 ± 0.804
5.971ValArg: 5.971 ± 0.872
5.118ValSer: 5.118 ± 0.652
4.606ValThr: 4.606 ± 1.25
4.094ValVal: 4.094 ± 0.762
0.341ValTrp: 0.341 ± 0.251
2.047ValTyr: 2.047 ± 0.325
0.0ValXaa: 0.0 ± 0.0
Trp
0.682TrpAla: 0.682 ± 0.405
0.0TrpCys: 0.0 ± 0.0
0.682TrpAsp: 0.682 ± 0.308
1.194TrpGlu: 1.194 ± 0.516
0.512TrpPhe: 0.512 ± 0.412
0.0TrpGly: 0.0 ± 0.0
0.341TrpHis: 0.341 ± 0.215
0.853TrpIle: 0.853 ± 0.44
0.341TrpLys: 0.341 ± 0.217
1.024TrpLeu: 1.024 ± 0.488
0.0TrpMet: 0.0 ± 0.0
0.341TrpAsn: 0.341 ± 0.22
0.341TrpPro: 0.341 ± 0.236
0.171TrpGln: 0.171 ± 0.162
1.024TrpArg: 1.024 ± 0.313
0.512TrpSer: 0.512 ± 0.226
0.341TrpThr: 0.341 ± 0.376
1.024TrpVal: 1.024 ± 0.508
0.171TrpTrp: 0.171 ± 0.162
0.171TrpTyr: 0.171 ± 0.176
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.241TyrAla: 3.241 ± 0.76
0.341TyrCys: 0.341 ± 0.23
1.535TyrAsp: 1.535 ± 0.399
1.365TyrGlu: 1.365 ± 0.385
0.682TyrPhe: 0.682 ± 0.361
2.218TyrGly: 2.218 ± 0.451
0.682TyrHis: 0.682 ± 0.196
1.365TyrIle: 1.365 ± 0.323
0.512TyrLys: 0.512 ± 0.358
3.753TyrLeu: 3.753 ± 0.816
0.0TyrMet: 0.0 ± 0.0
0.512TyrAsn: 0.512 ± 0.271
2.388TyrPro: 2.388 ± 0.377
1.194TyrGln: 1.194 ± 0.412
2.388TyrArg: 2.388 ± 0.761
3.241TyrSer: 3.241 ± 0.633
1.535TyrThr: 1.535 ± 0.492
2.729TyrVal: 2.729 ± 0.594
0.0TyrTrp: 0.0 ± 0.0
0.682TyrTyr: 0.682 ± 0.222
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (5863 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski