Amino acid dipepetide frequency for Australian bat lyssavirus (isolate Bat/AUS/1996) (ABLV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.706AlaAla: 1.706 ± 1.156
0.487AlaCys: 0.487 ± 0.542
1.706AlaAsp: 1.706 ± 0.998
4.386AlaGlu: 4.386 ± 1.543
0.731AlaPhe: 0.731 ± 0.339
1.462AlaGly: 1.462 ± 0.568
1.949AlaHis: 1.949 ± 1.039
3.655AlaIle: 3.655 ± 0.955
2.437AlaLys: 2.437 ± 0.958
4.142AlaLeu: 4.142 ± 0.826
0.975AlaMet: 0.975 ± 0.581
1.706AlaAsn: 1.706 ± 0.432
3.411AlaPro: 3.411 ± 1.241
1.706AlaGln: 1.706 ± 1.025
2.924AlaArg: 2.924 ± 0.547
3.168AlaSer: 3.168 ± 0.354
1.706AlaThr: 1.706 ± 0.669
1.706AlaVal: 1.706 ± 0.899
0.244AlaTrp: 0.244 ± 0.146
1.462AlaTyr: 1.462 ± 0.675
0.0AlaXaa: 0.0 ± 0.0
Cys
0.731CysAla: 0.731 ± 0.584
0.487CysCys: 0.487 ± 0.247
0.487CysAsp: 0.487 ± 0.247
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.218CysGly: 1.218 ± 0.496
0.487CysHis: 0.487 ± 0.542
1.949CysIle: 1.949 ± 0.983
0.731CysLys: 0.731 ± 0.758
2.437CysLeu: 2.437 ± 0.949
0.487CysMet: 0.487 ± 0.568
0.244CysAsn: 0.244 ± 0.146
1.706CysPro: 1.706 ± 0.373
0.731CysGln: 0.731 ± 0.41
0.487CysArg: 0.487 ± 0.606
2.924CysSer: 2.924 ± 0.651
0.731CysThr: 0.731 ± 0.909
0.0CysVal: 0.0 ± 0.0
0.244CysTrp: 0.244 ± 0.146
0.975CysTyr: 0.975 ± 0.434
0.0CysXaa: 0.0 ± 0.0
Asp
1.949AspAla: 1.949 ± 0.715
0.244AspCys: 0.244 ± 0.311
7.797AspAsp: 7.797 ± 3.468
2.68AspGlu: 2.68 ± 1.605
3.168AspPhe: 3.168 ± 0.749
3.411AspGly: 3.411 ± 0.354
1.218AspHis: 1.218 ± 0.397
3.899AspIle: 3.899 ± 0.869
2.193AspLys: 2.193 ± 0.57
7.554AspLeu: 7.554 ± 1.485
0.975AspMet: 0.975 ± 0.498
2.924AspAsn: 2.924 ± 0.875
4.873AspPro: 4.873 ± 0.711
2.437AspGln: 2.437 ± 0.859
1.949AspArg: 1.949 ± 0.658
2.193AspSer: 2.193 ± 0.415
0.731AspThr: 0.731 ± 0.439
2.68AspVal: 2.68 ± 0.6
1.462AspTrp: 1.462 ± 0.5
2.924AspTyr: 2.924 ± 0.841
0.0AspXaa: 0.0 ± 0.0
Glu
4.142GluAla: 4.142 ± 1.879
0.487GluCys: 0.487 ± 0.247
4.873GluAsp: 4.873 ± 1.574
5.117GluGlu: 5.117 ± 1.61
1.949GluPhe: 1.949 ± 0.688
3.899GluGly: 3.899 ± 0.302
1.462GluHis: 1.462 ± 0.843
5.361GluIle: 5.361 ± 1.729
4.386GluLys: 4.386 ± 0.746
4.142GluLeu: 4.142 ± 0.878
1.949GluMet: 1.949 ± 0.309
1.462GluAsn: 1.462 ± 0.499
2.924GluPro: 2.924 ± 0.747
1.706GluGln: 1.706 ± 1.337
2.68GluArg: 2.68 ± 0.747
7.066GluSer: 7.066 ± 1.438
2.437GluThr: 2.437 ± 1.367
4.142GluVal: 4.142 ± 0.607
0.731GluTrp: 0.731 ± 0.27
0.975GluTyr: 0.975 ± 0.859
0.0GluXaa: 0.0 ± 0.0
Phe
1.218PheAla: 1.218 ± 0.614
0.487PheCys: 0.487 ± 0.247
1.218PheAsp: 1.218 ± 0.732
3.168PheGlu: 3.168 ± 1.572
3.655PhePhe: 3.655 ± 0.91
1.462PheGly: 1.462 ± 0.47
1.706PheHis: 1.706 ± 0.815
1.949PheIle: 1.949 ± 0.82
2.68PheLys: 2.68 ± 0.308
3.899PheLeu: 3.899 ± 0.788
0.244PheMet: 0.244 ± 0.146
1.949PheAsn: 1.949 ± 0.536
4.142PhePro: 4.142 ± 0.856
3.168PheGln: 3.168 ± 1.594
3.655PheArg: 3.655 ± 0.795
3.411PheSer: 3.411 ± 0.624
1.706PheThr: 1.706 ± 0.889
1.949PheVal: 1.949 ± 0.321
0.244PheTrp: 0.244 ± 0.146
0.731PheTyr: 0.731 ± 0.439
0.0PheXaa: 0.0 ± 0.0
Gly
2.68GlyAla: 2.68 ± 0.592
1.218GlyCys: 1.218 ± 0.503
2.437GlyAsp: 2.437 ± 0.724
3.411GlyGlu: 3.411 ± 1.171
2.437GlyPhe: 2.437 ± 1.578
3.411GlyGly: 3.411 ± 1.24
1.218GlyHis: 1.218 ± 0.336
4.63GlyIle: 4.63 ± 2.006
3.655GlyLys: 3.655 ± 1.79
7.066GlyLeu: 7.066 ± 1.003
1.218GlyMet: 1.218 ± 0.568
2.193GlyAsn: 2.193 ± 0.834
3.411GlyPro: 3.411 ± 0.538
2.193GlyGln: 2.193 ± 1.144
2.924GlyArg: 2.924 ± 0.725
4.142GlySer: 4.142 ± 0.751
2.193GlyThr: 2.193 ± 1.186
1.949GlyVal: 1.949 ± 0.647
0.975GlyTrp: 0.975 ± 0.565
2.193GlyTyr: 2.193 ± 0.53
0.0GlyXaa: 0.0 ± 0.0
His
0.731HisAla: 0.731 ± 0.339
0.0HisCys: 0.0 ± 0.0
1.218HisAsp: 1.218 ± 0.496
1.218HisGlu: 1.218 ± 0.336
1.218HisPhe: 1.218 ± 0.552
0.731HisGly: 0.731 ± 0.439
0.731HisHis: 0.731 ± 0.337
2.193HisIle: 2.193 ± 0.698
1.218HisLys: 1.218 ± 0.632
2.924HisLeu: 2.924 ± 0.883
0.0HisMet: 0.0 ± 0.0
0.244HisAsn: 0.244 ± 0.303
2.193HisPro: 2.193 ± 0.827
1.462HisGln: 1.462 ± 0.679
0.731HisArg: 0.731 ± 0.41
2.924HisSer: 2.924 ± 0.347
0.487HisThr: 0.487 ± 0.423
1.462HisVal: 1.462 ± 0.681
0.975HisTrp: 0.975 ± 0.357
0.731HisTyr: 0.731 ± 0.27
0.0HisXaa: 0.0 ± 0.0
Ile
3.411IleAla: 3.411 ± 1.447
1.706IleCys: 1.706 ± 0.532
4.386IleAsp: 4.386 ± 0.802
3.411IleGlu: 3.411 ± 0.844
3.655IlePhe: 3.655 ± 0.702
3.411IleGly: 3.411 ± 0.824
2.193IleHis: 2.193 ± 0.773
6.335IleIle: 6.335 ± 1.892
2.193IleLys: 2.193 ± 0.623
8.041IleLeu: 8.041 ± 1.048
1.462IleMet: 1.462 ± 0.5
3.168IleAsn: 3.168 ± 1.208
3.168IlePro: 3.168 ± 1.423
3.411IleGln: 3.411 ± 1.181
5.361IleArg: 5.361 ± 1.062
6.092IleSer: 6.092 ± 1.16
4.386IleThr: 4.386 ± 1.059
4.63IleVal: 4.63 ± 0.695
1.706IleTrp: 1.706 ± 0.628
2.437IleTyr: 2.437 ± 1.19
0.0IleXaa: 0.0 ± 0.0
Lys
1.706LysAla: 1.706 ± 0.889
0.975LysCys: 0.975 ± 0.357
2.437LysAsp: 2.437 ± 0.962
4.63LysGlu: 4.63 ± 1.39
2.193LysPhe: 2.193 ± 1.054
2.68LysGly: 2.68 ± 1.268
0.731LysHis: 0.731 ± 0.27
5.604LysIle: 5.604 ± 1.693
5.117LysLys: 5.117 ± 1.378
5.361LysLeu: 5.361 ± 0.682
3.168LysMet: 3.168 ± 0.993
0.731LysAsn: 0.731 ± 0.93
4.386LysPro: 4.386 ± 1.298
0.487LysGln: 0.487 ± 0.423
2.68LysArg: 2.68 ± 1.078
5.604LysSer: 5.604 ± 1.147
4.63LysThr: 4.63 ± 0.956
4.873LysVal: 4.873 ± 1.119
0.975LysTrp: 0.975 ± 0.357
1.218LysTyr: 1.218 ± 0.597
0.0LysXaa: 0.0 ± 0.0
Leu
6.823LeuAla: 6.823 ± 0.809
1.949LeuCys: 1.949 ± 0.638
6.579LeuAsp: 6.579 ± 1.265
4.63LeuGlu: 4.63 ± 1.365
3.411LeuPhe: 3.411 ± 1.493
5.117LeuGly: 5.117 ± 0.825
1.218LeuHis: 1.218 ± 0.773
8.041LeuIle: 8.041 ± 1.63
6.092LeuLys: 6.092 ± 1.037
9.747LeuLeu: 9.747 ± 2.008
2.924LeuMet: 2.924 ± 1.185
4.63LeuAsn: 4.63 ± 0.972
4.142LeuPro: 4.142 ± 1.023
2.68LeuGln: 2.68 ± 0.342
7.797LeuArg: 7.797 ± 1.502
10.478LeuSer: 10.478 ± 1.64
4.386LeuThr: 4.386 ± 1.531
6.092LeuVal: 6.092 ± 0.72
1.706LeuTrp: 1.706 ± 0.752
4.63LeuTyr: 4.63 ± 0.709
0.0LeuXaa: 0.0 ± 0.0
Met
1.949MetAla: 1.949 ± 1.238
0.731MetCys: 0.731 ± 0.339
0.975MetAsp: 0.975 ± 0.4
0.975MetGlu: 0.975 ± 0.845
1.218MetPhe: 1.218 ± 0.552
0.244MetGly: 0.244 ± 0.311
0.0MetHis: 0.0 ± 0.0
1.218MetIle: 1.218 ± 0.557
2.193MetLys: 2.193 ± 1.121
2.193MetLeu: 2.193 ± 1.022
0.487MetMet: 0.487 ± 0.291
2.68MetAsn: 2.68 ± 1.926
0.244MetPro: 0.244 ± 0.303
0.975MetGln: 0.975 ± 0.776
1.462MetArg: 1.462 ± 0.821
4.873MetSer: 4.873 ± 0.615
1.706MetThr: 1.706 ± 0.815
0.487MetVal: 0.487 ± 0.293
0.0MetTrp: 0.0 ± 0.0
0.244MetTyr: 0.244 ± 0.146
0.0MetXaa: 0.0 ± 0.0
Asn
1.949AsnAla: 1.949 ± 0.813
0.975AsnCys: 0.975 ± 0.467
2.437AsnAsp: 2.437 ± 0.624
1.218AsnGlu: 1.218 ± 1.012
3.168AsnPhe: 3.168 ± 1.253
1.706AsnGly: 1.706 ± 0.9
1.462AsnHis: 1.462 ± 0.578
4.873AsnIle: 4.873 ± 1.052
1.462AsnLys: 1.462 ± 0.54
4.142AsnLeu: 4.142 ± 0.499
0.975AsnMet: 0.975 ± 0.803
1.462AsnAsn: 1.462 ± 0.45
3.168AsnPro: 3.168 ± 1.008
1.218AsnGln: 1.218 ± 0.568
2.924AsnArg: 2.924 ± 0.875
4.386AsnSer: 4.386 ± 1.336
1.218AsnThr: 1.218 ± 0.45
0.487AsnVal: 0.487 ± 0.293
0.975AsnTrp: 0.975 ± 0.498
1.218AsnTyr: 1.218 ± 0.552
0.0AsnXaa: 0.0 ± 0.0
Pro
1.218ProAla: 1.218 ± 0.614
0.731ProCys: 0.731 ± 0.453
2.924ProAsp: 2.924 ± 0.941
7.066ProGlu: 7.066 ± 1.843
0.244ProPhe: 0.244 ± 0.311
1.949ProGly: 1.949 ± 0.431
0.731ProHis: 0.731 ± 0.584
2.68ProIle: 2.68 ± 0.805
2.924ProLys: 2.924 ± 0.677
6.579ProLeu: 6.579 ± 1.234
0.487ProMet: 0.487 ± 0.293
2.924ProAsn: 2.924 ± 1.113
3.168ProPro: 3.168 ± 1.205
1.706ProGln: 1.706 ± 0.586
1.218ProArg: 1.218 ± 0.475
8.285ProSer: 8.285 ± 1.862
2.437ProThr: 2.437 ± 0.287
2.437ProVal: 2.437 ± 1.464
0.244ProTrp: 0.244 ± 0.303
1.949ProTyr: 1.949 ± 0.687
0.0ProXaa: 0.0 ± 0.0
Gln
0.975GlnAla: 0.975 ± 0.434
0.487GlnCys: 0.487 ± 0.402
1.949GlnAsp: 1.949 ± 0.592
1.218GlnGlu: 1.218 ± 0.45
1.949GlnPhe: 1.949 ± 0.432
1.949GlnGly: 1.949 ± 0.514
1.218GlnHis: 1.218 ± 0.475
3.655GlnIle: 3.655 ± 1.14
1.706GlnLys: 1.706 ± 0.53
4.142GlnLeu: 4.142 ± 1.629
0.487GlnMet: 0.487 ± 0.547
1.218GlnAsn: 1.218 ± 0.336
0.244GlnPro: 0.244 ± 0.146
0.487GlnGln: 0.487 ± 0.402
2.193GlnArg: 2.193 ± 1.094
3.411GlnSer: 3.411 ± 0.909
2.68GlnThr: 2.68 ± 0.952
3.411GlnVal: 3.411 ± 1.301
0.244GlnTrp: 0.244 ± 0.444
0.244GlnTyr: 0.244 ± 0.311
0.0GlnXaa: 0.0 ± 0.0
Arg
2.193ArgAla: 2.193 ± 0.84
1.218ArgCys: 1.218 ± 0.658
1.949ArgAsp: 1.949 ± 0.867
2.924ArgGlu: 2.924 ± 0.966
2.924ArgPhe: 2.924 ± 0.623
4.63ArgGly: 4.63 ± 1.022
1.462ArgHis: 1.462 ± 0.668
2.437ArgIle: 2.437 ± 0.901
4.63ArgLys: 4.63 ± 1.192
4.873ArgLeu: 4.873 ± 0.573
2.193ArgMet: 2.193 ± 0.415
2.924ArgAsn: 2.924 ± 0.998
0.975ArgPro: 0.975 ± 0.303
2.193ArgGln: 2.193 ± 0.759
2.68ArgArg: 2.68 ± 1.253
5.117ArgSer: 5.117 ± 1.222
3.899ArgThr: 3.899 ± 0.411
3.411ArgVal: 3.411 ± 1.334
0.731ArgTrp: 0.731 ± 0.439
2.68ArgTyr: 2.68 ± 0.401
0.0ArgXaa: 0.0 ± 0.0
Ser
4.142SerAla: 4.142 ± 1.018
2.924SerCys: 2.924 ± 1.003
4.386SerAsp: 4.386 ± 1.873
6.335SerGlu: 6.335 ± 1.74
4.386SerPhe: 4.386 ± 0.542
8.041SerGly: 8.041 ± 2.205
2.437SerHis: 2.437 ± 0.287
7.066SerIle: 7.066 ± 1.658
6.823SerLys: 6.823 ± 1.969
10.478SerLeu: 10.478 ± 1.761
2.437SerMet: 2.437 ± 0.355
2.68SerAsn: 2.68 ± 0.933
3.899SerPro: 3.899 ± 0.816
2.437SerGln: 2.437 ± 0.508
8.528SerArg: 8.528 ± 1.224
8.285SerSer: 8.285 ± 1.444
4.873SerThr: 4.873 ± 0.52
4.873SerVal: 4.873 ± 1.324
2.437SerTrp: 2.437 ± 0.724
4.142SerTyr: 4.142 ± 1.279
0.0SerXaa: 0.0 ± 0.0
Thr
1.462ThrAla: 1.462 ± 1.198
0.731ThrCys: 0.731 ± 0.337
1.706ThrAsp: 1.706 ± 0.373
2.437ThrGlu: 2.437 ± 1.582
1.218ThrPhe: 1.218 ± 0.336
3.899ThrGly: 3.899 ± 0.152
1.218ThrHis: 1.218 ± 0.552
2.437ThrIle: 2.437 ± 1.005
1.706ThrLys: 1.706 ± 0.616
5.361ThrLeu: 5.361 ± 1.339
1.706ThrMet: 1.706 ± 0.741
2.68ThrAsn: 2.68 ± 0.401
1.462ThrPro: 1.462 ± 0.679
2.437ThrGln: 2.437 ± 0.624
2.193ThrArg: 2.193 ± 1.016
4.873ThrSer: 4.873 ± 0.697
5.361ThrThr: 5.361 ± 2.377
3.655ThrVal: 3.655 ± 0.991
1.462ThrTrp: 1.462 ± 0.445
2.437ThrTyr: 2.437 ± 0.818
0.0ThrXaa: 0.0 ± 0.0
Val
1.218ValAla: 1.218 ± 0.497
0.731ValCys: 0.731 ± 0.609
4.63ValAsp: 4.63 ± 0.69
4.142ValGlu: 4.142 ± 1.713
3.411ValPhe: 3.411 ± 1.23
4.386ValGly: 4.386 ± 1.186
1.218ValHis: 1.218 ± 0.475
3.411ValIle: 3.411 ± 1.065
3.899ValLys: 3.899 ± 1.152
3.655ValLeu: 3.655 ± 1.637
0.487ValMet: 0.487 ± 0.402
2.193ValAsn: 2.193 ± 0.34
2.924ValPro: 2.924 ± 0.308
1.218ValGln: 1.218 ± 0.732
2.193ValArg: 2.193 ± 0.779
6.823ValSer: 6.823 ± 0.906
2.68ValThr: 2.68 ± 0.915
1.706ValVal: 1.706 ± 0.496
0.244ValTrp: 0.244 ± 0.146
1.949ValTyr: 1.949 ± 0.713
0.0ValXaa: 0.0 ± 0.0
Trp
0.731TrpAla: 0.731 ± 0.462
0.487TrpCys: 0.487 ± 0.542
0.731TrpAsp: 0.731 ± 0.453
1.218TrpGlu: 1.218 ± 0.496
0.244TrpPhe: 0.244 ± 0.146
0.975TrpGly: 0.975 ± 0.586
0.487TrpHis: 0.487 ± 0.293
1.462TrpIle: 1.462 ± 0.878
0.487TrpLys: 0.487 ± 0.247
1.462TrpLeu: 1.462 ± 0.445
0.244TrpMet: 0.244 ± 0.303
0.731TrpAsn: 0.731 ± 0.439
0.244TrpPro: 0.244 ± 0.146
0.0TrpGln: 0.0 ± 0.0
0.487TrpArg: 0.487 ± 0.293
3.168TrpSer: 3.168 ± 1.089
0.731TrpThr: 0.731 ± 0.339
1.462TrpVal: 1.462 ± 0.855
0.0TrpTrp: 0.0 ± 0.0
0.244TrpTyr: 0.244 ± 0.146
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.731TyrAla: 0.731 ± 0.27
0.244TyrCys: 0.244 ± 0.146
2.437TyrAsp: 2.437 ± 0.818
1.706TyrGlu: 1.706 ± 0.815
1.462TyrPhe: 1.462 ± 0.678
0.975TyrGly: 0.975 ± 0.434
0.244TyrHis: 0.244 ± 0.303
1.706TyrIle: 1.706 ± 0.815
3.655TyrLys: 3.655 ± 0.891
4.873TyrLeu: 4.873 ± 1.098
1.706TyrMet: 1.706 ± 0.55
2.68TyrAsn: 2.68 ± 0.522
1.218TyrPro: 1.218 ± 0.336
1.218TyrGln: 1.218 ± 0.732
0.975TyrArg: 0.975 ± 0.434
4.386TyrSer: 4.386 ± 1.534
1.462TyrThr: 1.462 ± 0.773
1.706TyrVal: 1.706 ± 0.74
0.0TyrTrp: 0.0 ± 0.0
0.487TyrTyr: 0.487 ± 0.293
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (4105 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski