Amino acid dipepetide frequency for European bat lyssavirus 2 (strain Human/Scotland/RV1333/2002) (EBLV2)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.383AlaAla: 1.383 ± 0.738
1.153AlaCys: 1.153 ± 0.78
1.844AlaAsp: 1.844 ± 1.155
5.994AlaGlu: 5.994 ± 2.519
1.153AlaPhe: 1.153 ± 0.517
2.075AlaGly: 2.075 ± 0.564
2.075AlaHis: 2.075 ± 1.039
3.227AlaIle: 3.227 ± 1.065
2.766AlaLys: 2.766 ± 0.379
7.377AlaLeu: 7.377 ± 0.632
0.692AlaMet: 0.692 ± 0.335
1.153AlaAsn: 1.153 ± 0.481
1.614AlaPro: 1.614 ± 0.716
3.227AlaGln: 3.227 ± 0.961
2.766AlaArg: 2.766 ± 0.934
2.536AlaSer: 2.536 ± 0.477
1.153AlaThr: 1.153 ± 0.499
3.458AlaVal: 3.458 ± 1.467
0.231AlaTrp: 0.231 ± 0.144
2.536AlaTyr: 2.536 ± 1.046
0.0AlaXaa: 0.0 ± 0.0
Cys
0.461CysAla: 0.461 ± 0.537
0.461CysCys: 0.461 ± 0.253
0.692CysAsp: 0.692 ± 0.3
0.0CysGlu: 0.0 ± 0.0
0.231CysPhe: 0.231 ± 0.144
0.692CysGly: 0.692 ± 0.517
0.231CysHis: 0.231 ± 0.341
1.614CysIle: 1.614 ± 0.758
0.461CysLys: 0.461 ± 0.253
2.305CysLeu: 2.305 ± 0.806
0.0CysMet: 0.0 ± 0.0
0.231CysAsn: 0.231 ± 0.144
0.922CysPro: 0.922 ± 0.312
0.692CysGln: 0.692 ± 0.36
0.692CysArg: 0.692 ± 0.606
2.766CysSer: 2.766 ± 0.731
0.461CysThr: 0.461 ± 0.565
0.692CysVal: 0.692 ± 0.334
0.231CysTrp: 0.231 ± 0.144
0.692CysTyr: 0.692 ± 0.3
0.0CysXaa: 0.0 ± 0.0
Asp
3.458AspAla: 3.458 ± 0.874
0.231AspCys: 0.231 ± 0.268
5.994AspAsp: 5.994 ± 2.062
3.919AspGlu: 3.919 ± 1.802
3.227AspPhe: 3.227 ± 0.775
3.919AspGly: 3.919 ± 0.84
0.692AspHis: 0.692 ± 0.433
3.227AspIle: 3.227 ± 0.538
3.688AspLys: 3.688 ± 1.073
6.224AspLeu: 6.224 ± 1.31
1.383AspMet: 1.383 ± 0.683
3.227AspAsn: 3.227 ± 0.964
4.61AspPro: 4.61 ± 0.528
2.536AspGln: 2.536 ± 0.834
2.536AspArg: 2.536 ± 0.847
2.305AspSer: 2.305 ± 0.96
0.692AspThr: 0.692 ± 0.433
2.997AspVal: 2.997 ± 0.988
0.922AspTrp: 0.922 ± 0.371
2.305AspTyr: 2.305 ± 1.2
0.0AspXaa: 0.0 ± 0.0
Glu
4.38GluAla: 4.38 ± 1.328
0.461GluCys: 0.461 ± 0.253
6.224GluAsp: 6.224 ± 2.581
7.607GluGlu: 7.607 ± 3.042
1.383GluPhe: 1.383 ± 0.514
4.841GluGly: 4.841 ± 0.648
1.153GluHis: 1.153 ± 1.14
4.61GluIle: 4.61 ± 1.192
3.227GluLys: 3.227 ± 1.01
3.458GluLeu: 3.458 ± 0.965
2.075GluMet: 2.075 ± 0.466
1.844GluAsn: 1.844 ± 0.497
1.844GluPro: 1.844 ± 0.454
1.153GluGln: 1.153 ± 0.856
2.997GluArg: 2.997 ± 1.189
8.068GluSer: 8.068 ± 2.148
1.844GluThr: 1.844 ± 0.773
2.997GluVal: 2.997 ± 0.394
1.614GluTrp: 1.614 ± 0.817
0.922GluTyr: 0.922 ± 0.685
0.0GluXaa: 0.0 ± 0.0
Phe
0.692PheAla: 0.692 ± 0.272
0.461PheCys: 0.461 ± 0.403
1.614PheAsp: 1.614 ± 0.81
2.536PheGlu: 2.536 ± 1.401
3.919PhePhe: 3.919 ± 0.668
1.383PheGly: 1.383 ± 0.44
1.844PheHis: 1.844 ± 0.675
1.383PheIle: 1.383 ± 0.485
4.38PheLys: 4.38 ± 0.94
4.61PheLeu: 4.61 ± 0.252
0.692PheMet: 0.692 ± 0.433
2.766PheAsn: 2.766 ± 0.281
3.919PhePro: 3.919 ± 0.925
2.536PheGln: 2.536 ± 0.487
3.688PheArg: 3.688 ± 0.482
4.149PheSer: 4.149 ± 0.926
1.844PheThr: 1.844 ± 0.667
2.536PheVal: 2.536 ± 0.738
0.231PheTrp: 0.231 ± 0.144
1.153PheTyr: 1.153 ± 0.504
0.0PheXaa: 0.0 ± 0.0
Gly
1.844GlyAla: 1.844 ± 0.795
1.153GlyCys: 1.153 ± 0.484
2.766GlyAsp: 2.766 ± 1.217
2.766GlyGlu: 2.766 ± 1.313
2.536GlyPhe: 2.536 ± 0.81
5.533GlyGly: 5.533 ± 0.728
0.922GlyHis: 0.922 ± 0.312
4.149GlyIle: 4.149 ± 1.464
5.533GlyLys: 5.533 ± 2.329
6.455GlyLeu: 6.455 ± 1.78
0.692GlyMet: 0.692 ± 0.334
2.305GlyAsn: 2.305 ± 0.795
2.997GlyPro: 2.997 ± 0.701
1.383GlyGln: 1.383 ± 0.738
2.075GlyArg: 2.075 ± 0.672
4.38GlySer: 4.38 ± 0.722
2.536GlyThr: 2.536 ± 1.154
4.841GlyVal: 4.841 ± 2.376
0.231GlyTrp: 0.231 ± 0.144
2.305GlyTyr: 2.305 ± 0.583
0.0GlyXaa: 0.0 ± 0.0
His
0.692HisAla: 0.692 ± 0.272
0.0HisCys: 0.0 ± 0.0
1.153HisAsp: 1.153 ± 0.763
0.922HisGlu: 0.922 ± 0.457
1.844HisPhe: 1.844 ± 0.901
0.692HisGly: 0.692 ± 0.433
0.692HisHis: 0.692 ± 0.652
2.075HisIle: 2.075 ± 0.608
0.922HisLys: 0.922 ± 0.457
2.536HisLeu: 2.536 ± 0.955
0.0HisMet: 0.0 ± 0.0
0.692HisAsn: 0.692 ± 0.652
1.383HisPro: 1.383 ± 0.647
1.383HisGln: 1.383 ± 0.738
0.692HisArg: 0.692 ± 0.433
1.844HisSer: 1.844 ± 0.675
0.231HisThr: 0.231 ± 0.268
1.383HisVal: 1.383 ± 0.44
0.922HisTrp: 0.922 ± 0.397
0.692HisTyr: 0.692 ± 0.3
0.0HisXaa: 0.0 ± 0.0
Ile
3.458IleAla: 3.458 ± 1.388
1.153IleCys: 1.153 ± 0.493
2.766IleAsp: 2.766 ± 1.294
2.997IleGlu: 2.997 ± 0.712
3.458IlePhe: 3.458 ± 0.927
1.614IleGly: 1.614 ± 0.57
2.075IleHis: 2.075 ± 0.65
4.149IleIle: 4.149 ± 0.852
3.227IleLys: 3.227 ± 0.858
7.377IleLeu: 7.377 ± 1.903
1.614IleMet: 1.614 ± 0.761
2.305IleAsn: 2.305 ± 0.58
5.071IlePro: 5.071 ± 1.26
2.766IleGln: 2.766 ± 0.417
3.919IleArg: 3.919 ± 0.942
6.916IleSer: 6.916 ± 1.136
3.458IleThr: 3.458 ± 1.021
3.919IleVal: 3.919 ± 0.451
2.075IleTrp: 2.075 ± 0.62
2.075IleTyr: 2.075 ± 1.06
0.0IleXaa: 0.0 ± 0.0
Lys
1.383LysAla: 1.383 ± 0.521
0.692LysCys: 0.692 ± 0.3
3.227LysAsp: 3.227 ± 1.102
4.61LysGlu: 4.61 ± 1.481
3.919LysPhe: 3.919 ± 1.937
2.075LysGly: 2.075 ± 0.822
0.692LysHis: 0.692 ± 0.3
7.146LysIle: 7.146 ± 2.692
7.377LysLys: 7.377 ± 2.146
5.763LysLeu: 5.763 ± 0.865
2.075LysMet: 2.075 ± 0.479
1.383LysAsn: 1.383 ± 0.573
3.458LysPro: 3.458 ± 0.333
0.922LysGln: 0.922 ± 0.397
2.997LysArg: 2.997 ± 1.271
5.302LysSer: 5.302 ± 0.809
4.61LysThr: 4.61 ± 1.398
4.841LysVal: 4.841 ± 1.185
0.692LysTrp: 0.692 ± 0.3
1.383LysTyr: 1.383 ± 0.738
0.0LysXaa: 0.0 ± 0.0
Leu
7.377LeuAla: 7.377 ± 0.72
2.075LeuCys: 2.075 ± 0.781
5.994LeuAsp: 5.994 ± 1.519
6.455LeuGlu: 6.455 ± 1.162
3.919LeuPhe: 3.919 ± 1.107
6.916LeuGly: 6.916 ± 1.273
0.922LeuHis: 0.922 ± 0.397
6.455LeuIle: 6.455 ± 2.47
5.994LeuLys: 5.994 ± 1.271
7.607LeuLeu: 7.607 ± 1.999
3.688LeuMet: 3.688 ± 1.406
3.919LeuAsn: 3.919 ± 0.487
3.919LeuPro: 3.919 ± 0.653
3.458LeuGln: 3.458 ± 1.366
6.685LeuArg: 6.685 ± 1.899
8.529LeuSer: 8.529 ± 3.115
3.227LeuThr: 3.227 ± 1.039
8.068LeuVal: 8.068 ± 0.992
1.383LeuTrp: 1.383 ± 0.719
3.919LeuTyr: 3.919 ± 1.238
0.0LeuXaa: 0.0 ± 0.0
Met
2.536MetAla: 2.536 ± 1.195
0.461MetCys: 0.461 ± 0.229
1.153MetAsp: 1.153 ± 0.53
0.692MetGlu: 0.692 ± 0.652
1.383MetPhe: 1.383 ± 0.544
0.231MetGly: 0.231 ± 0.268
0.231MetHis: 0.231 ± 0.283
1.153MetIle: 1.153 ± 0.517
1.153MetLys: 1.153 ± 0.688
2.075MetLeu: 2.075 ± 1.06
0.231MetMet: 0.231 ± 0.144
2.075MetAsn: 2.075 ± 1.299
1.383MetPro: 1.383 ± 0.943
1.383MetGln: 1.383 ± 0.7
1.383MetArg: 1.383 ± 0.586
3.919MetSer: 3.919 ± 1.198
1.844MetThr: 1.844 ± 0.675
0.922MetVal: 0.922 ± 0.371
0.0MetTrp: 0.0 ± 0.0
0.231MetTyr: 0.231 ± 0.144
0.0MetXaa: 0.0 ± 0.0
Asn
2.305AsnAla: 2.305 ± 0.84
0.692AsnCys: 0.692 ± 0.433
2.075AsnAsp: 2.075 ± 0.547
0.231AsnGlu: 0.231 ± 0.144
3.227AsnPhe: 3.227 ± 0.758
2.075AsnGly: 2.075 ± 1.095
1.153AsnHis: 1.153 ± 0.53
3.458AsnIle: 3.458 ± 0.827
1.614AsnLys: 1.614 ± 0.373
4.841AsnLeu: 4.841 ± 1.186
0.692AsnMet: 0.692 ± 0.645
0.922AsnAsn: 0.922 ± 0.397
2.766AsnPro: 2.766 ± 1.024
1.153AsnGln: 1.153 ± 0.665
2.536AsnArg: 2.536 ± 0.477
5.533AsnSer: 5.533 ± 1.008
1.614AsnThr: 1.614 ± 0.645
0.922AsnVal: 0.922 ± 0.371
0.922AsnTrp: 0.922 ± 0.575
1.383AsnTyr: 1.383 ± 0.625
0.0AsnXaa: 0.0 ± 0.0
Pro
2.997ProAla: 2.997 ± 1.316
0.0ProCys: 0.0 ± 0.0
3.227ProAsp: 3.227 ± 1.301
4.841ProGlu: 4.841 ± 0.782
0.692ProPhe: 0.692 ± 0.652
2.075ProGly: 2.075 ± 0.744
0.922ProHis: 0.922 ± 0.552
2.766ProIle: 2.766 ± 0.608
1.614ProLys: 1.614 ± 1.011
6.916ProLeu: 6.916 ± 1.429
1.844ProMet: 1.844 ± 0.529
1.844ProAsn: 1.844 ± 0.879
2.766ProPro: 2.766 ± 1.141
2.075ProGln: 2.075 ± 0.547
1.614ProArg: 1.614 ± 0.551
7.146ProSer: 7.146 ± 1.01
2.075ProThr: 2.075 ± 1.04
1.614ProVal: 1.614 ± 0.782
0.231ProTrp: 0.231 ± 0.283
2.305ProTyr: 2.305 ± 0.392
0.0ProXaa: 0.0 ± 0.0
Gln
2.997GlnAla: 2.997 ± 1.951
0.231GlnCys: 0.231 ± 0.341
2.766GlnAsp: 2.766 ± 1.211
2.305GlnGlu: 2.305 ± 0.614
1.614GlnPhe: 1.614 ± 0.502
1.614GlnGly: 1.614 ± 0.52
0.922GlnHis: 0.922 ± 0.397
3.688GlnIle: 3.688 ± 1.046
1.614GlnLys: 1.614 ± 0.57
4.841GlnLeu: 4.841 ± 1.848
1.383GlnMet: 1.383 ± 0.606
0.922GlnAsn: 0.922 ± 0.312
0.231GlnPro: 0.231 ± 0.144
1.614GlnGln: 1.614 ± 0.836
2.305GlnArg: 2.305 ± 0.962
4.149GlnSer: 4.149 ± 1.114
2.536GlnThr: 2.536 ± 0.68
2.075GlnVal: 2.075 ± 0.806
0.461GlnTrp: 0.461 ± 0.319
0.231GlnTyr: 0.231 ± 0.268
0.0GlnXaa: 0.0 ± 0.0
Arg
2.766ArgAla: 2.766 ± 0.741
1.383ArgCys: 1.383 ± 0.7
2.766ArgAsp: 2.766 ± 0.969
2.305ArgGlu: 2.305 ± 0.806
2.766ArgPhe: 2.766 ± 0.49
3.458ArgGly: 3.458 ± 1.408
1.153ArgHis: 1.153 ± 0.556
2.075ArgIle: 2.075 ± 0.729
2.997ArgLys: 2.997 ± 0.982
5.302ArgLeu: 5.302 ± 0.574
1.614ArgMet: 1.614 ± 0.677
2.075ArgAsn: 2.075 ± 0.997
2.305ArgPro: 2.305 ± 0.464
2.536ArgGln: 2.536 ± 0.565
2.305ArgArg: 2.305 ± 0.587
4.61ArgSer: 4.61 ± 1.051
2.997ArgThr: 2.997 ± 0.903
2.997ArgVal: 2.997 ± 1.105
0.922ArgTrp: 0.922 ± 0.578
2.766ArgTyr: 2.766 ± 0.379
0.0ArgXaa: 0.0 ± 0.0
Ser
4.38SerAla: 4.38 ± 1.096
1.153SerCys: 1.153 ± 0.537
5.994SerAsp: 5.994 ± 0.847
5.763SerGlu: 5.763 ± 1.657
4.149SerPhe: 4.149 ± 0.561
6.685SerGly: 6.685 ± 1.243
1.614SerHis: 1.614 ± 0.551
4.841SerIle: 4.841 ± 1.726
7.607SerLys: 7.607 ± 1.872
7.838SerLeu: 7.838 ± 3.324
1.153SerMet: 1.153 ± 0.531
2.305SerAsn: 2.305 ± 0.986
4.841SerPro: 4.841 ± 1.073
4.841SerGln: 4.841 ± 2.079
5.994SerArg: 5.994 ± 1.938
8.529SerSer: 8.529 ± 1.375
6.916SerThr: 6.916 ± 1.091
5.994SerVal: 5.994 ± 0.712
2.305SerTrp: 2.305 ± 0.987
4.61SerTyr: 4.61 ± 1.272
0.0SerXaa: 0.0 ± 0.0
Thr
2.305ThrAla: 2.305 ± 1.157
0.922ThrCys: 0.922 ± 0.312
2.766ThrAsp: 2.766 ± 0.395
1.614ThrGlu: 1.614 ± 0.836
1.153ThrPhe: 1.153 ± 0.353
4.61ThrGly: 4.61 ± 0.682
0.922ThrHis: 0.922 ± 0.371
3.227ThrIle: 3.227 ± 1.204
2.766ThrLys: 2.766 ± 0.694
4.38ThrLeu: 4.38 ± 1.244
1.383ThrMet: 1.383 ± 0.867
2.305ThrAsn: 2.305 ± 0.612
1.844ThrPro: 1.844 ± 0.757
2.766ThrGln: 2.766 ± 0.702
2.305ThrArg: 2.305 ± 0.986
2.536ThrSer: 2.536 ± 0.914
4.38ThrThr: 4.38 ± 1.992
3.688ThrVal: 3.688 ± 0.999
1.153ThrTrp: 1.153 ± 0.556
2.075ThrTyr: 2.075 ± 0.667
0.0ThrXaa: 0.0 ± 0.0
Val
1.614ValAla: 1.614 ± 0.513
0.922ValCys: 0.922 ± 0.361
2.766ValAsp: 2.766 ± 0.668
4.38ValGlu: 4.38 ± 1.663
3.919ValPhe: 3.919 ± 1.212
5.071ValGly: 5.071 ± 1.408
2.075ValHis: 2.075 ± 0.504
3.458ValIle: 3.458 ± 0.873
2.997ValLys: 2.997 ± 1.013
3.919ValLeu: 3.919 ± 1.553
0.461ValMet: 0.461 ± 0.289
5.763ValAsn: 5.763 ± 1.598
2.075ValPro: 2.075 ± 0.581
1.153ValGln: 1.153 ± 0.517
3.458ValArg: 3.458 ± 1.519
6.455ValSer: 6.455 ± 1.621
4.61ValThr: 4.61 ± 1.062
3.227ValVal: 3.227 ± 0.315
0.231ValTrp: 0.231 ± 0.144
1.844ValTyr: 1.844 ± 0.724
0.0ValXaa: 0.0 ± 0.0
Trp
0.922TrpAla: 0.922 ± 0.361
0.692TrpCys: 0.692 ± 0.334
0.231TrpAsp: 0.231 ± 0.341
1.383TrpGlu: 1.383 ± 0.44
0.231TrpPhe: 0.231 ± 0.144
0.922TrpGly: 0.922 ± 0.578
0.461TrpHis: 0.461 ± 0.289
1.153TrpIle: 1.153 ± 0.722
0.922TrpLys: 0.922 ± 0.507
1.383TrpLeu: 1.383 ± 0.457
0.231TrpMet: 0.231 ± 0.283
0.922TrpAsn: 0.922 ± 0.578
0.231TrpPro: 0.231 ± 0.144
0.0TrpGln: 0.0 ± 0.0
0.231TrpArg: 0.231 ± 0.144
3.227TrpSer: 3.227 ± 1.391
0.461TrpThr: 0.461 ± 0.253
0.922TrpVal: 0.922 ± 0.319
0.0TrpTrp: 0.0 ± 0.0
0.231TrpTyr: 0.231 ± 0.144
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.461TyrAla: 0.461 ± 0.289
0.231TyrCys: 0.231 ± 0.144
2.075TyrAsp: 2.075 ± 0.806
1.383TyrGlu: 1.383 ± 0.485
1.614TyrPhe: 1.614 ± 0.513
0.922TyrGly: 0.922 ± 0.371
0.0TyrHis: 0.0 ± 0.0
2.536TyrIle: 2.536 ± 0.441
3.688TyrLys: 3.688 ± 0.855
5.302TyrLeu: 5.302 ± 1.007
2.305TyrMet: 2.305 ± 1.15
1.153TyrAsn: 1.153 ± 0.722
1.383TyrPro: 1.383 ± 0.521
0.922TyrGln: 0.922 ± 0.578
0.922TyrArg: 0.922 ± 0.371
4.61TyrSer: 4.61 ± 1.158
1.614TyrThr: 1.614 ± 1.061
2.305TyrVal: 2.305 ± 0.634
0.0TyrTrp: 0.0 ± 0.0
0.461TyrTyr: 0.461 ± 0.289
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (4339 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski