Amino acid dipepetide frequency for Perinet vesiculovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.471AlaAla: 3.471 ± 1.832
1.157AlaCys: 1.157 ± 0.52
2.603AlaAsp: 2.603 ± 0.926
1.736AlaGlu: 1.736 ± 0.741
1.157AlaPhe: 1.157 ± 0.411
2.893AlaGly: 2.893 ± 0.441
1.446AlaHis: 1.446 ± 0.523
3.182AlaIle: 3.182 ± 0.8
2.314AlaLys: 2.314 ± 1.041
6.942AlaLeu: 6.942 ± 1.419
0.579AlaMet: 0.579 ± 0.329
3.471AlaAsn: 3.471 ± 0.92
0.868AlaPro: 0.868 ± 0.978
2.025AlaGln: 2.025 ± 1.04
1.736AlaArg: 1.736 ± 0.471
4.05AlaSer: 4.05 ± 0.721
2.314AlaThr: 2.314 ± 0.462
2.603AlaVal: 2.603 ± 0.539
1.157AlaTrp: 1.157 ± 1.06
1.446AlaTyr: 1.446 ± 0.898
0.0AlaXaa: 0.0 ± 0.0
Cys
1.446CysAla: 1.446 ± 0.308
0.289CysCys: 0.289 ± 0.164
0.868CysAsp: 0.868 ± 0.256
0.579CysGlu: 0.579 ± 0.26
0.289CysPhe: 0.289 ± 0.368
1.157CysGly: 1.157 ± 0.343
0.579CysHis: 0.579 ± 0.704
0.579CysIle: 0.579 ± 0.26
2.314CysLys: 2.314 ± 1.04
1.446CysLeu: 1.446 ± 0.439
0.289CysMet: 0.289 ± 0.164
0.579CysAsn: 0.579 ± 0.392
0.579CysPro: 0.579 ± 0.704
0.868CysGln: 0.868 ± 0.256
1.736CysArg: 1.736 ± 0.471
1.736CysSer: 1.736 ± 0.781
0.868CysThr: 0.868 ± 0.589
0.579CysVal: 0.579 ± 0.329
0.579CysTrp: 0.579 ± 0.329
0.289CysTyr: 0.289 ± 0.164
0.0CysXaa: 0.0 ± 0.0
Asp
3.182AspAla: 3.182 ± 1.181
0.868AspCys: 0.868 ± 0.392
3.76AspAsp: 3.76 ± 2.268
3.471AspGlu: 3.471 ± 1.09
2.314AspPhe: 2.314 ± 0.462
3.182AspGly: 3.182 ± 1.253
1.736AspHis: 1.736 ± 0.444
3.182AspIle: 3.182 ± 0.659
4.05AspLys: 4.05 ± 1.139
8.389AspLeu: 8.389 ± 0.81
2.893AspMet: 2.893 ± 0.668
2.893AspAsn: 2.893 ± 1.331
4.339AspPro: 4.339 ± 0.371
1.446AspGln: 1.446 ± 0.473
1.446AspArg: 1.446 ± 0.592
2.603AspSer: 2.603 ± 0.676
3.471AspThr: 3.471 ± 0.67
3.471AspVal: 3.471 ± 0.863
0.868AspTrp: 0.868 ± 0.367
2.603AspTyr: 2.603 ± 0.811
0.0AspXaa: 0.0 ± 0.0
Glu
2.025GluAla: 2.025 ± 0.962
1.157GluCys: 1.157 ± 0.343
4.918GluAsp: 4.918 ± 1.386
3.76GluGlu: 3.76 ± 2.607
4.918GluPhe: 4.918 ± 0.782
3.471GluGly: 3.471 ± 1.392
1.157GluHis: 1.157 ± 0.465
2.893GluIle: 2.893 ± 0.747
4.05GluLys: 4.05 ± 2.041
5.496GluLeu: 5.496 ± 0.363
0.579GluMet: 0.579 ± 0.26
2.314GluAsn: 2.314 ± 1.304
2.603GluPro: 2.603 ± 0.688
1.446GluGln: 1.446 ± 0.738
0.868GluArg: 0.868 ± 0.256
4.918GluSer: 4.918 ± 1.266
3.471GluThr: 3.471 ± 1.106
2.603GluVal: 2.603 ± 0.676
1.446GluTrp: 1.446 ± 0.993
3.182GluTyr: 3.182 ± 0.94
0.0GluXaa: 0.0 ± 0.0
Phe
2.314PheAla: 2.314 ± 0.623
0.289PheCys: 0.289 ± 0.164
2.025PheAsp: 2.025 ± 0.587
2.025PheGlu: 2.025 ± 0.423
1.157PhePhe: 1.157 ± 0.47
3.182PheGly: 3.182 ± 0.908
1.157PheHis: 1.157 ± 0.657
1.446PheIle: 1.446 ± 0.489
2.893PheLys: 2.893 ± 0.593
5.207PheLeu: 5.207 ± 1.634
1.157PheMet: 1.157 ± 0.658
3.182PheAsn: 3.182 ± 0.849
3.76PhePro: 3.76 ± 1.513
1.736PheGln: 1.736 ± 0.62
2.893PheArg: 2.893 ± 0.947
2.025PheSer: 2.025 ± 0.804
2.025PheThr: 2.025 ± 0.43
2.314PheVal: 2.314 ± 0.601
1.157PheTrp: 1.157 ± 0.47
1.157PheTyr: 1.157 ± 0.662
0.0PheXaa: 0.0 ± 0.0
Gly
0.868GlyAla: 0.868 ± 0.8
0.289GlyCys: 0.289 ± 0.164
2.603GlyAsp: 2.603 ± 0.792
1.446GlyGlu: 1.446 ± 0.738
2.025GlyPhe: 2.025 ± 0.506
2.603GlyGly: 2.603 ± 0.664
0.868GlyHis: 0.868 ± 0.493
4.628GlyIle: 4.628 ± 2.18
3.76GlyLys: 3.76 ± 0.643
11.571GlyLeu: 11.571 ± 0.177
1.157GlyMet: 1.157 ± 0.465
2.603GlyAsn: 2.603 ± 0.661
2.603GlyPro: 2.603 ± 1.365
2.893GlyGln: 2.893 ± 1.209
4.628GlyArg: 4.628 ± 1.357
4.628GlySer: 4.628 ± 0.755
4.339GlyThr: 4.339 ± 1.039
2.603GlyVal: 2.603 ± 0.539
1.157GlyTrp: 1.157 ± 0.493
1.446GlyTyr: 1.446 ± 0.49
0.0GlyXaa: 0.0 ± 0.0
His
0.579HisAla: 0.579 ± 0.26
0.289HisCys: 0.289 ± 0.164
0.868HisAsp: 0.868 ± 0.256
1.446HisGlu: 1.446 ± 0.308
2.314HisPhe: 2.314 ± 0.897
1.446HisGly: 1.446 ± 0.447
0.868HisHis: 0.868 ± 0.801
1.736HisIle: 1.736 ± 0.987
1.157HisLys: 1.157 ± 0.47
1.446HisLeu: 1.446 ± 0.592
0.289HisMet: 0.289 ± 0.477
0.579HisAsn: 0.579 ± 0.26
2.025HisPro: 2.025 ± 0.478
0.868HisGln: 0.868 ± 0.493
1.157HisArg: 1.157 ± 0.658
2.025HisSer: 2.025 ± 1.073
0.868HisThr: 0.868 ± 0.493
2.025HisVal: 2.025 ± 0.864
1.157HisTrp: 1.157 ± 0.658
0.868HisTyr: 0.868 ± 0.493
0.0HisXaa: 0.0 ± 0.0
Ile
2.603IleAla: 2.603 ± 0.433
2.603IleCys: 2.603 ± 0.769
5.496IleAsp: 5.496 ± 0.849
4.339IleGlu: 4.339 ± 0.917
2.314IlePhe: 2.314 ± 0.486
3.471IleGly: 3.471 ± 1.318
1.157IleHis: 1.157 ± 0.343
2.893IleIle: 2.893 ± 1.331
4.05IleLys: 4.05 ± 1.548
4.628IleLeu: 4.628 ± 0.682
0.579IleMet: 0.579 ± 0.329
2.893IleAsn: 2.893 ± 0.593
3.471IlePro: 3.471 ± 0.67
2.314IleGln: 2.314 ± 0.466
6.653IleArg: 6.653 ± 2.877
6.075IleSer: 6.075 ± 1.623
2.314IleThr: 2.314 ± 1.309
3.76IleVal: 3.76 ± 1.21
0.289IleTrp: 0.289 ± 0.164
2.603IleTyr: 2.603 ± 1.48
0.0IleXaa: 0.0 ± 0.0
Lys
2.025LysAla: 2.025 ± 1.597
1.446LysCys: 1.446 ± 0.439
4.339LysAsp: 4.339 ± 0.588
4.05LysGlu: 4.05 ± 0.84
3.182LysPhe: 3.182 ± 0.942
4.05LysGly: 4.05 ± 0.561
0.289LysHis: 0.289 ± 0.368
4.628LysIle: 4.628 ± 1.373
5.785LysLys: 5.785 ± 3.771
5.207LysLeu: 5.207 ± 1.138
1.157LysMet: 1.157 ± 0.658
3.76LysAsn: 3.76 ± 1.055
2.025LysPro: 2.025 ± 1.2
0.868LysGln: 0.868 ± 0.366
3.76LysArg: 3.76 ± 1.017
5.785LysSer: 5.785 ± 1.097
4.918LysThr: 4.918 ± 0.77
5.496LysVal: 5.496 ± 1.038
1.446LysTrp: 1.446 ± 0.49
1.157LysTyr: 1.157 ± 0.656
0.0LysXaa: 0.0 ± 0.0
Leu
4.628LeuAla: 4.628 ± 0.329
1.736LeuCys: 1.736 ± 0.781
5.207LeuAsp: 5.207 ± 1.307
6.364LeuGlu: 6.364 ± 1.213
4.339LeuPhe: 4.339 ± 0.809
6.075LeuGly: 6.075 ± 0.845
2.603LeuHis: 2.603 ± 0.78
8.678LeuIle: 8.678 ± 1.621
7.81LeuLys: 7.81 ± 2.111
7.81LeuLeu: 7.81 ± 2.02
3.471LeuMet: 3.471 ± 1.252
4.05LeuAsn: 4.05 ± 0.89
4.339LeuPro: 4.339 ± 1.859
3.182LeuGln: 3.182 ± 0.422
5.785LeuArg: 5.785 ± 1.622
9.257LeuSer: 9.257 ± 1.214
5.207LeuThr: 5.207 ± 1.122
4.339LeuVal: 4.339 ± 0.731
1.446LeuTrp: 1.446 ± 0.308
2.893LeuTyr: 2.893 ± 0.946
0.0LeuXaa: 0.0 ± 0.0
Met
0.868MetAla: 0.868 ± 0.367
0.0MetCys: 0.0 ± 0.0
0.868MetAsp: 0.868 ± 0.433
2.893MetGlu: 2.893 ± 0.796
1.157MetPhe: 1.157 ± 0.47
1.157MetGly: 1.157 ± 0.658
1.157MetHis: 1.157 ± 0.411
2.025MetIle: 2.025 ± 0.774
1.736MetLys: 1.736 ± 0.735
2.603MetLeu: 2.603 ± 0.676
2.025MetMet: 2.025 ± 0.883
1.736MetAsn: 1.736 ± 0.483
0.579MetPro: 0.579 ± 0.329
1.157MetGln: 1.157 ± 0.465
0.868MetArg: 0.868 ± 0.561
2.893MetSer: 2.893 ± 0.367
2.025MetThr: 2.025 ± 0.583
0.868MetVal: 0.868 ± 0.493
0.289MetTrp: 0.289 ± 0.164
0.289MetTyr: 0.289 ± 0.532
0.0MetXaa: 0.0 ± 0.0
Asn
2.893AsnAla: 2.893 ± 0.528
0.0AsnCys: 0.0 ± 0.0
2.603AsnAsp: 2.603 ± 1.022
1.157AsnGlu: 1.157 ± 0.403
1.446AsnPhe: 1.446 ± 0.507
3.182AsnGly: 3.182 ± 1.064
1.446AsnHis: 1.446 ± 0.556
2.025AsnIle: 2.025 ± 0.587
2.025AsnLys: 2.025 ± 0.355
5.496AsnLeu: 5.496 ± 1.293
0.868AsnMet: 0.868 ± 0.364
2.893AsnAsn: 2.893 ± 1.862
3.182AsnPro: 3.182 ± 0.316
2.025AsnGln: 2.025 ± 0.566
2.025AsnArg: 2.025 ± 0.908
5.207AsnSer: 5.207 ± 0.686
2.314AsnThr: 2.314 ± 0.931
0.868AsnVal: 0.868 ± 0.367
2.314AsnTrp: 2.314 ± 0.541
2.893AsnTyr: 2.893 ± 0.796
0.0AsnXaa: 0.0 ± 0.0
Pro
3.76ProAla: 3.76 ± 0.878
0.289ProCys: 0.289 ± 0.368
4.05ProAsp: 4.05 ± 0.699
3.182ProGlu: 3.182 ± 2.745
1.736ProPhe: 1.736 ± 1.516
2.603ProGly: 2.603 ± 1.01
1.736ProHis: 1.736 ± 0.642
2.893ProIle: 2.893 ± 0.979
1.446ProLys: 1.446 ± 0.662
3.182ProLeu: 3.182 ± 1.091
1.736ProMet: 1.736 ± 1.035
2.314ProAsn: 2.314 ± 0.901
2.603ProPro: 2.603 ± 0.912
0.868ProGln: 0.868 ± 0.597
1.446ProArg: 1.446 ± 0.473
4.918ProSer: 4.918 ± 1.68
5.207ProThr: 5.207 ± 1.021
1.736ProVal: 1.736 ± 0.419
0.579ProTrp: 0.579 ± 0.392
2.025ProTyr: 2.025 ± 1.395
0.0ProXaa: 0.0 ± 0.0
Gln
1.736GlnAla: 1.736 ± 0.633
0.868GlnCys: 0.868 ± 0.367
1.157GlnAsp: 1.157 ± 0.658
3.471GlnGlu: 3.471 ± 2.06
1.446GlnPhe: 1.446 ± 0.489
2.025GlnGly: 2.025 ± 0.517
0.579GlnHis: 0.579 ± 0.329
1.736GlnIle: 1.736 ± 0.513
1.736GlnLys: 1.736 ± 0.513
2.603GlnLeu: 2.603 ± 1.052
0.868GlnMet: 0.868 ± 0.433
1.157GlnAsn: 1.157 ± 0.343
0.868GlnPro: 0.868 ± 0.392
0.289GlnGln: 0.289 ± 0.164
1.736GlnArg: 1.736 ± 0.383
2.893GlnSer: 2.893 ± 0.404
2.603GlnThr: 2.603 ± 0.664
2.314GlnVal: 2.314 ± 0.799
0.579GlnTrp: 0.579 ± 0.649
1.446GlnTyr: 1.446 ± 0.657
0.0GlnXaa: 0.0 ± 0.0
Arg
2.893ArgAla: 2.893 ± 0.894
1.157ArgCys: 1.157 ± 0.47
2.314ArgAsp: 2.314 ± 0.946
4.05ArgGlu: 4.05 ± 1.564
3.182ArgPhe: 3.182 ± 0.641
2.314ArgGly: 2.314 ± 0.931
0.579ArgHis: 0.579 ± 0.329
3.471ArgIle: 3.471 ± 0.632
3.471ArgLys: 3.471 ± 0.914
3.182ArgLeu: 3.182 ± 0.794
3.182ArgMet: 3.182 ± 1.411
2.893ArgAsn: 2.893 ± 1.185
1.157ArgPro: 1.157 ± 0.403
0.579ArgGln: 0.579 ± 0.329
2.603ArgArg: 2.603 ± 0.811
4.918ArgSer: 4.918 ± 1.69
3.182ArgThr: 3.182 ± 0.407
4.918ArgVal: 4.918 ± 1.815
0.868ArgTrp: 0.868 ± 0.256
2.603ArgTyr: 2.603 ± 0.649
0.0ArgXaa: 0.0 ± 0.0
Ser
4.918SerAla: 4.918 ± 1.224
0.579SerCys: 0.579 ± 0.26
5.785SerAsp: 5.785 ± 1.323
5.785SerGlu: 5.785 ± 1.234
2.893SerPhe: 2.893 ± 0.869
4.339SerGly: 4.339 ± 0.809
3.182SerHis: 3.182 ± 1.388
5.785SerIle: 5.785 ± 2.503
4.918SerLys: 4.918 ± 0.798
6.653SerLeu: 6.653 ± 0.901
0.868SerMet: 0.868 ± 0.493
3.182SerAsn: 3.182 ± 0.684
4.918SerPro: 4.918 ± 1.943
3.76SerGln: 3.76 ± 1.014
4.628SerArg: 4.628 ± 0.79
8.1SerSer: 8.1 ± 1.528
4.918SerThr: 4.918 ± 0.249
6.075SerVal: 6.075 ± 1.183
1.446SerTrp: 1.446 ± 0.822
3.471SerTyr: 3.471 ± 0.792
0.0SerXaa: 0.0 ± 0.0
Thr
3.182ThrAla: 3.182 ± 0.675
2.025ThrCys: 2.025 ± 0.583
2.893ThrAsp: 2.893 ± 1.575
2.893ThrGlu: 2.893 ± 1.23
1.736ThrPhe: 1.736 ± 0.345
3.76ThrGly: 3.76 ± 0.742
0.868ThrHis: 0.868 ± 0.256
4.918ThrIle: 4.918 ± 0.72
4.339ThrLys: 4.339 ± 0.82
7.232ThrLeu: 7.232 ± 1.196
3.182ThrMet: 3.182 ± 1.388
2.025ThrAsn: 2.025 ± 0.774
2.603ThrPro: 2.603 ± 1.643
1.736ThrGln: 1.736 ± 0.413
2.314ThrArg: 2.314 ± 0.939
3.471ThrSer: 3.471 ± 0.897
3.471ThrThr: 3.471 ± 1.588
4.05ThrVal: 4.05 ± 0.84
1.736ThrTrp: 1.736 ± 0.783
1.446ThrTyr: 1.446 ± 1.168
0.0ThrXaa: 0.0 ± 0.0
Val
1.736ValAla: 1.736 ± 0.62
1.446ValCys: 1.446 ± 0.592
4.918ValAsp: 4.918 ± 0.782
1.736ValGlu: 1.736 ± 0.506
2.025ValPhe: 2.025 ± 0.937
3.76ValGly: 3.76 ± 0.738
1.446ValHis: 1.446 ± 0.592
4.339ValIle: 4.339 ± 1.746
3.182ValLys: 3.182 ± 0.908
4.628ValLeu: 4.628 ± 0.816
1.157ValMet: 1.157 ± 0.668
1.736ValAsn: 1.736 ± 1.176
4.339ValPro: 4.339 ± 0.695
2.314ValGln: 2.314 ± 0.486
4.05ValArg: 4.05 ± 0.971
5.207ValSer: 5.207 ± 1.178
4.05ValThr: 4.05 ± 1.129
2.025ValVal: 2.025 ± 0.35
0.289ValTrp: 0.289 ± 0.352
1.157ValTyr: 1.157 ± 0.343
0.0ValXaa: 0.0 ± 0.0
Trp
0.289TrpAla: 0.289 ± 0.164
0.0TrpCys: 0.0 ± 0.0
2.025TrpAsp: 2.025 ± 0.632
1.446TrpGlu: 1.446 ± 0.439
1.446TrpPhe: 1.446 ± 0.738
1.736TrpGly: 1.736 ± 0.67
0.289TrpHis: 0.289 ± 0.164
1.157TrpIle: 1.157 ± 0.52
1.446TrpLys: 1.446 ± 0.337
1.736TrpLeu: 1.736 ± 0.874
0.289TrpMet: 0.289 ± 0.368
0.868TrpAsn: 0.868 ± 0.493
0.289TrpPro: 0.289 ± 0.164
0.0TrpGln: 0.0 ± 0.0
0.868TrpArg: 0.868 ± 0.493
2.025TrpSer: 2.025 ± 0.804
0.868TrpThr: 0.868 ± 0.366
1.736TrpVal: 1.736 ± 0.413
0.289TrpTrp: 0.289 ± 0.368
0.289TrpTyr: 0.289 ± 0.352
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.736TyrAla: 1.736 ± 0.783
1.157TyrCys: 1.157 ± 1.246
1.157TyrAsp: 1.157 ± 0.343
1.446TyrGlu: 1.446 ± 0.507
2.025TyrPhe: 2.025 ± 0.587
2.314TyrGly: 2.314 ± 0.462
0.868TyrHis: 0.868 ± 0.256
2.314TyrIle: 2.314 ± 0.541
2.893TyrLys: 2.893 ± 0.764
3.76TyrLeu: 3.76 ± 0.742
0.579TyrMet: 0.579 ± 0.58
1.736TyrAsn: 1.736 ± 0.483
1.157TyrPro: 1.157 ± 0.403
1.736TyrGln: 1.736 ± 0.383
2.314TyrArg: 2.314 ± 0.811
3.471TyrSer: 3.471 ± 1.072
1.446TyrThr: 1.446 ± 0.632
1.157TyrVal: 1.157 ± 1.299
0.0TyrTrp: 0.0 ± 0.0
0.868TyrTyr: 0.868 ± 0.589
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3458 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski