Amino acid dipepetide frequency for Wuhan Louse Fly Virus 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.39AlaAla: 1.39 ± 1.109
2.502AlaCys: 2.502 ± 0.587
1.668AlaAsp: 1.668 ± 0.607
2.502AlaGlu: 2.502 ± 1.376
1.946AlaPhe: 1.946 ± 0.567
1.668AlaGly: 1.668 ± 1.093
0.834AlaHis: 0.834 ± 0.348
2.224AlaIle: 2.224 ± 0.568
3.614AlaLys: 3.614 ± 0.809
2.502AlaLeu: 2.502 ± 0.611
0.278AlaMet: 0.278 ± 0.421
1.668AlaAsn: 1.668 ± 0.367
1.39AlaPro: 1.39 ± 0.822
1.946AlaGln: 1.946 ± 0.837
1.946AlaArg: 1.946 ± 0.546
2.224AlaSer: 2.224 ± 0.537
2.502AlaThr: 2.502 ± 1.043
1.668AlaVal: 1.668 ± 0.607
0.556AlaTrp: 0.556 ± 0.55
2.78AlaTyr: 2.78 ± 0.801
0.0AlaXaa: 0.0 ± 0.0
Cys
0.556CysAla: 0.556 ± 0.537
0.834CysCys: 0.834 ± 0.325
0.556CysAsp: 0.556 ± 0.29
1.39CysGlu: 1.39 ± 0.508
0.834CysPhe: 0.834 ± 0.587
1.112CysGly: 1.112 ± 0.58
1.39CysHis: 1.39 ± 0.576
1.946CysIle: 1.946 ± 0.721
2.224CysLys: 2.224 ± 0.651
2.224CysLeu: 2.224 ± 1.062
0.278CysMet: 0.278 ± 0.157
1.112CysAsn: 1.112 ± 0.402
1.112CysPro: 1.112 ± 0.575
0.834CysGln: 0.834 ± 0.397
0.278CysArg: 0.278 ± 0.157
0.834CysSer: 0.834 ± 0.325
0.834CysThr: 0.834 ± 0.47
0.834CysVal: 0.834 ± 0.627
0.278CysTrp: 0.278 ± 0.157
0.556CysTyr: 0.556 ± 0.515
0.0CysXaa: 0.0 ± 0.0
Asp
2.224AspAla: 2.224 ± 1.581
1.39AspCys: 1.39 ± 0.873
3.614AspAsp: 3.614 ± 1.942
6.672AspGlu: 6.672 ± 1.126
2.502AspPhe: 2.502 ± 1.411
2.502AspGly: 2.502 ± 0.894
2.224AspHis: 2.224 ± 1.536
2.502AspIle: 2.502 ± 0.538
5.838AspLys: 5.838 ± 2.145
7.228AspLeu: 7.228 ± 0.553
1.39AspMet: 1.39 ± 0.287
2.78AspAsn: 2.78 ± 0.721
2.502AspPro: 2.502 ± 0.739
2.78AspGln: 2.78 ± 0.435
1.112AspArg: 1.112 ± 0.627
3.058AspSer: 3.058 ± 0.862
2.502AspThr: 2.502 ± 0.715
3.614AspVal: 3.614 ± 2.209
1.668AspTrp: 1.668 ± 0.671
3.614AspTyr: 3.614 ± 1.04
0.0AspXaa: 0.0 ± 0.0
Glu
3.336GluAla: 3.336 ± 0.651
1.668GluCys: 1.668 ± 1.209
3.058GluAsp: 3.058 ± 1.329
5.838GluGlu: 5.838 ± 3.328
3.058GluPhe: 3.058 ± 1.39
2.502GluGly: 2.502 ± 0.791
1.39GluHis: 1.39 ± 0.531
9.174GluIle: 9.174 ± 1.973
6.116GluLys: 6.116 ± 1.294
6.672GluLeu: 6.672 ± 1.101
2.502GluMet: 2.502 ± 0.609
3.892GluAsn: 3.892 ± 0.883
1.39GluPro: 1.39 ± 0.587
2.224GluGln: 2.224 ± 0.796
1.946GluArg: 1.946 ± 0.774
5.004GluSer: 5.004 ± 1.41
5.004GluThr: 5.004 ± 0.832
3.614GluVal: 3.614 ± 0.763
1.668GluTrp: 1.668 ± 1.45
2.502GluTyr: 2.502 ± 0.715
0.0GluXaa: 0.0 ± 0.0
Phe
0.278PheAla: 0.278 ± 0.157
1.39PheCys: 1.39 ± 1.005
2.78PheAsp: 2.78 ± 0.885
1.668PheGlu: 1.668 ± 0.644
2.224PhePhe: 2.224 ± 0.935
2.224PheGly: 2.224 ± 1.071
0.278PheHis: 0.278 ± 0.157
1.946PheIle: 1.946 ± 0.701
5.838PheLys: 5.838 ± 1.381
5.004PheLeu: 5.004 ± 0.796
0.834PheMet: 0.834 ± 0.348
2.78PheAsn: 2.78 ± 0.431
2.502PhePro: 2.502 ± 0.917
1.668PheGln: 1.668 ± 0.335
3.058PheArg: 3.058 ± 1.243
3.892PheSer: 3.892 ± 1.17
1.112PheThr: 1.112 ± 0.627
1.112PheVal: 1.112 ± 0.586
0.278PheTrp: 0.278 ± 0.157
0.556PheTyr: 0.556 ± 0.69
0.0PheXaa: 0.0 ± 0.0
Gly
1.668GlyAla: 1.668 ± 0.941
0.0GlyCys: 0.0 ± 0.0
4.17GlyAsp: 4.17 ± 1.588
3.336GlyGlu: 3.336 ± 1.168
2.78GlyPhe: 2.78 ± 0.654
3.614GlyGly: 3.614 ± 1.076
0.278GlyHis: 0.278 ± 0.157
5.282GlyIle: 5.282 ± 1.72
5.282GlyLys: 5.282 ± 0.518
6.95GlyLeu: 6.95 ± 2.014
1.39GlyMet: 1.39 ± 0.608
3.892GlyAsn: 3.892 ± 1.11
0.834GlyPro: 0.834 ± 0.47
0.834GlyGln: 0.834 ± 0.47
1.668GlyArg: 1.668 ± 0.497
4.448GlySer: 4.448 ± 1.237
3.614GlyThr: 3.614 ± 2.24
2.502GlyVal: 2.502 ± 1.518
0.556GlyTrp: 0.556 ± 0.314
2.224GlyTyr: 2.224 ± 0.476
0.0GlyXaa: 0.0 ± 0.0
His
1.946HisAla: 1.946 ± 1.118
0.0HisCys: 0.0 ± 0.0
1.112HisAsp: 1.112 ± 0.58
1.39HisGlu: 1.39 ± 0.624
0.834HisPhe: 0.834 ± 0.47
0.834HisGly: 0.834 ± 0.47
0.278HisHis: 0.278 ± 0.353
2.224HisIle: 2.224 ± 0.428
1.946HisLys: 1.946 ± 0.567
1.39HisLeu: 1.39 ± 0.511
0.0HisMet: 0.0 ± 0.0
1.946HisAsn: 1.946 ± 0.647
1.668HisPro: 1.668 ± 0.335
1.112HisGln: 1.112 ± 0.402
0.556HisArg: 0.556 ± 0.314
1.668HisSer: 1.668 ± 1.253
1.39HisThr: 1.39 ± 0.508
0.834HisVal: 0.834 ± 0.587
0.834HisTrp: 0.834 ± 0.546
1.112HisTyr: 1.112 ± 0.388
0.0HisXaa: 0.0 ± 0.0
Ile
4.448IleAla: 4.448 ± 0.952
1.668IleCys: 1.668 ± 0.588
4.726IleAsp: 4.726 ± 2.196
5.838IleGlu: 5.838 ± 1.001
2.78IlePhe: 2.78 ± 1.153
6.672IleGly: 6.672 ± 1.642
2.502IleHis: 2.502 ± 1.103
7.784IleIle: 7.784 ± 2.257
8.062IleLys: 8.062 ± 2.322
7.506IleLeu: 7.506 ± 0.507
2.224IleMet: 2.224 ± 0.937
5.838IleAsn: 5.838 ± 1.178
4.17IlePro: 4.17 ± 0.527
3.892IleGln: 3.892 ± 0.887
3.336IleArg: 3.336 ± 0.711
5.282IleSer: 5.282 ± 1.215
5.282IleThr: 5.282 ± 1.398
2.78IleVal: 2.78 ± 0.634
0.556IleTrp: 0.556 ± 0.29
2.78IleTyr: 2.78 ± 1.033
0.0IleXaa: 0.0 ± 0.0
Lys
2.224LysAla: 2.224 ± 0.534
1.112LysCys: 1.112 ± 0.388
6.116LysAsp: 6.116 ± 0.714
6.394LysGlu: 6.394 ± 1.647
3.892LysPhe: 3.892 ± 1.469
3.892LysGly: 3.892 ± 1.086
1.39LysHis: 1.39 ± 0.587
9.73LysIle: 9.73 ± 2.181
7.784LysLys: 7.784 ± 1.421
8.34LysLeu: 8.34 ± 1.645
1.668LysMet: 1.668 ± 0.868
5.282LysAsn: 5.282 ± 1.648
2.502LysPro: 2.502 ± 0.899
3.336LysGln: 3.336 ± 0.902
4.17LysArg: 4.17 ± 0.786
3.614LysSer: 3.614 ± 1.061
4.17LysThr: 4.17 ± 0.924
4.448LysVal: 4.448 ± 1.273
2.224LysTrp: 2.224 ± 0.565
3.336LysTyr: 3.336 ± 0.785
0.0LysXaa: 0.0 ± 0.0
Leu
2.78LeuAla: 2.78 ± 1.084
2.502LeuCys: 2.502 ± 0.584
5.56LeuAsp: 5.56 ± 0.857
7.506LeuGlu: 7.506 ± 1.576
4.726LeuPhe: 4.726 ± 0.758
5.838LeuGly: 5.838 ± 0.73
2.502LeuHis: 2.502 ± 0.703
9.174LeuIle: 9.174 ± 2.532
6.672LeuLys: 6.672 ± 1.509
7.228LeuLeu: 7.228 ± 2.935
2.224LeuMet: 2.224 ± 1.254
7.784LeuAsn: 7.784 ± 1.728
2.78LeuPro: 2.78 ± 1.286
2.78LeuGln: 2.78 ± 0.784
6.95LeuArg: 6.95 ± 1.173
8.34LeuSer: 8.34 ± 1.456
6.116LeuThr: 6.116 ± 2.141
2.224LeuVal: 2.224 ± 0.605
0.556LeuTrp: 0.556 ± 0.353
4.448LeuTyr: 4.448 ± 1.362
0.0LeuXaa: 0.0 ± 0.0
Met
1.39MetAla: 1.39 ± 1.109
0.834MetCys: 0.834 ± 0.47
1.39MetAsp: 1.39 ± 0.976
0.556MetGlu: 0.556 ± 0.314
0.834MetPhe: 0.834 ± 0.47
1.668MetGly: 1.668 ± 0.835
0.278MetHis: 0.278 ± 0.157
2.78MetIle: 2.78 ± 0.885
0.278MetLys: 0.278 ± 0.157
2.224MetLeu: 2.224 ± 0.543
0.556MetMet: 0.556 ± 0.314
1.668MetAsn: 1.668 ± 0.763
0.278MetPro: 0.278 ± 0.596
0.556MetGln: 0.556 ± 0.353
1.112MetArg: 1.112 ± 0.597
2.224MetSer: 2.224 ± 0.331
0.834MetThr: 0.834 ± 0.761
0.556MetVal: 0.556 ± 0.315
0.278MetTrp: 0.278 ± 0.157
1.112MetTyr: 1.112 ± 0.513
0.0MetXaa: 0.0 ± 0.0
Asn
2.224AsnAla: 2.224 ± 1.411
0.278AsnCys: 0.278 ± 0.596
3.336AsnAsp: 3.336 ± 1.429
4.17AsnGlu: 4.17 ± 1.312
2.78AsnPhe: 2.78 ± 1.133
2.78AsnGly: 2.78 ± 1.069
2.502AsnHis: 2.502 ± 0.771
5.282AsnIle: 5.282 ± 0.518
3.892AsnLys: 3.892 ± 0.867
8.34AsnLeu: 8.34 ± 1.534
1.112AsnMet: 1.112 ± 1.112
2.502AsnAsn: 2.502 ± 0.715
2.224AsnPro: 2.224 ± 1.134
2.502AsnGln: 2.502 ± 0.588
1.668AsnArg: 1.668 ± 0.944
5.56AsnSer: 5.56 ± 1.451
2.224AsnThr: 2.224 ± 0.565
3.614AsnVal: 3.614 ± 0.528
1.668AsnTrp: 1.668 ± 0.609
2.502AsnTyr: 2.502 ± 1.085
0.0AsnXaa: 0.0 ± 0.0
Pro
1.39ProAla: 1.39 ± 0.901
0.0ProCys: 0.0 ± 0.0
4.17ProAsp: 4.17 ± 1.096
1.39ProGlu: 1.39 ± 0.901
1.112ProPhe: 1.112 ± 0.319
1.668ProGly: 1.668 ± 1.069
1.112ProHis: 1.112 ± 0.586
3.892ProIle: 3.892 ± 1.052
2.78ProLys: 2.78 ± 0.675
3.336ProLeu: 3.336 ± 0.925
0.556ProMet: 0.556 ± 0.353
1.112ProAsn: 1.112 ± 0.513
1.668ProPro: 1.668 ± 0.68
1.39ProGln: 1.39 ± 1.232
0.834ProArg: 0.834 ± 0.325
3.058ProSer: 3.058 ± 0.733
2.78ProThr: 2.78 ± 1.022
1.946ProVal: 1.946 ± 0.569
0.834ProTrp: 0.834 ± 0.558
1.39ProTyr: 1.39 ± 0.287
0.0ProXaa: 0.0 ± 0.0
Gln
1.39GlnAla: 1.39 ± 1.546
1.112GlnCys: 1.112 ± 0.731
2.224GlnAsp: 2.224 ± 1.536
3.614GlnGlu: 3.614 ± 0.306
0.834GlnPhe: 0.834 ± 0.348
2.224GlnGly: 2.224 ± 0.444
0.556GlnHis: 0.556 ± 0.842
3.892GlnIle: 3.892 ± 1.286
2.502GlnLys: 2.502 ± 0.557
3.614GlnLeu: 3.614 ± 1.111
0.834GlnMet: 0.834 ± 0.348
3.336GlnAsn: 3.336 ± 1.028
1.112GlnPro: 1.112 ± 0.408
0.556GlnGln: 0.556 ± 0.353
1.112GlnArg: 1.112 ± 0.58
1.39GlnSer: 1.39 ± 0.667
0.834GlnThr: 0.834 ± 0.348
1.946GlnVal: 1.946 ± 0.647
0.0GlnTrp: 0.0 ± 0.0
1.112GlnTyr: 1.112 ± 0.319
0.0GlnXaa: 0.0 ± 0.0
Arg
3.058ArgAla: 3.058 ± 1.006
0.834ArgCys: 0.834 ± 0.627
1.946ArgAsp: 1.946 ± 0.561
3.336ArgGlu: 3.336 ± 0.67
2.224ArgPhe: 2.224 ± 0.565
1.668ArgGly: 1.668 ± 0.651
1.39ArgHis: 1.39 ± 0.508
3.336ArgIle: 3.336 ± 0.805
3.058ArgLys: 3.058 ± 1.034
2.78ArgLeu: 2.78 ± 0.952
1.39ArgMet: 1.39 ± 1.341
2.502ArgAsn: 2.502 ± 0.705
2.224ArgPro: 2.224 ± 0.75
1.946ArgGln: 1.946 ± 0.312
1.668ArgArg: 1.668 ± 0.793
4.448ArgSer: 4.448 ± 1.253
2.502ArgThr: 2.502 ± 0.851
1.946ArgVal: 1.946 ± 0.939
0.278ArgTrp: 0.278 ± 0.353
1.39ArgTyr: 1.39 ± 0.682
0.0ArgXaa: 0.0 ± 0.0
Ser
2.502SerAla: 2.502 ± 0.919
1.112SerCys: 1.112 ± 0.388
5.56SerAsp: 5.56 ± 1.446
5.004SerGlu: 5.004 ± 1.155
3.336SerPhe: 3.336 ± 0.711
3.614SerGly: 3.614 ± 0.805
1.112SerHis: 1.112 ± 0.402
6.394SerIle: 6.394 ± 1.937
6.95SerLys: 6.95 ± 0.403
6.394SerLeu: 6.394 ± 1.722
1.39SerMet: 1.39 ± 0.484
2.78SerAsn: 2.78 ± 0.962
2.224SerPro: 2.224 ± 0.707
0.834SerGln: 0.834 ± 0.305
4.17SerArg: 4.17 ± 0.787
2.502SerSer: 2.502 ± 0.825
3.336SerThr: 3.336 ± 1.224
1.946SerVal: 1.946 ± 1.475
2.78SerTrp: 2.78 ± 0.974
1.668SerTyr: 1.668 ± 0.499
0.0SerXaa: 0.0 ± 0.0
Thr
1.39ThrAla: 1.39 ± 0.683
0.556ThrCys: 0.556 ± 0.55
2.78ThrAsp: 2.78 ± 0.654
4.726ThrGlu: 4.726 ± 0.417
1.112ThrPhe: 1.112 ± 0.627
4.726ThrGly: 4.726 ± 1.821
0.834ThrHis: 0.834 ± 0.47
2.78ThrIle: 2.78 ± 0.931
3.058ThrLys: 3.058 ± 1.107
6.394ThrLeu: 6.394 ± 2.908
0.834ThrMet: 0.834 ± 0.325
5.004ThrAsn: 5.004 ± 1.216
1.39ThrPro: 1.39 ± 0.682
1.39ThrGln: 1.39 ± 0.481
3.336ThrArg: 3.336 ± 1.681
2.224ThrSer: 2.224 ± 0.917
2.502ThrThr: 2.502 ± 0.594
3.336ThrVal: 3.336 ± 0.67
2.224ThrTrp: 2.224 ± 0.476
1.112ThrTyr: 1.112 ± 0.388
0.0ThrXaa: 0.0 ± 0.0
Val
1.946ValAla: 1.946 ± 1.331
1.112ValCys: 1.112 ± 0.388
3.336ValAsp: 3.336 ± 1.251
2.502ValGlu: 2.502 ± 1.342
1.668ValPhe: 1.668 ± 0.637
2.224ValGly: 2.224 ± 0.568
0.556ValHis: 0.556 ± 0.314
3.892ValIle: 3.892 ± 0.397
3.892ValLys: 3.892 ± 1.494
3.892ValLeu: 3.892 ± 1.049
0.834ValMet: 0.834 ± 0.546
3.058ValAsn: 3.058 ± 1.618
2.502ValPro: 2.502 ± 0.461
1.39ValGln: 1.39 ± 1.182
2.224ValArg: 2.224 ± 0.777
3.058ValSer: 3.058 ± 0.75
1.39ValThr: 1.39 ± 1.158
1.946ValVal: 1.946 ± 1.033
0.834ValTrp: 0.834 ± 0.964
1.946ValTyr: 1.946 ± 0.469
0.0ValXaa: 0.0 ± 0.0
Trp
0.278TrpAla: 0.278 ± 0.157
0.556TrpCys: 0.556 ± 0.314
1.112TrpAsp: 1.112 ± 0.388
2.224TrpGlu: 2.224 ± 0.758
1.112TrpPhe: 1.112 ± 0.782
1.112TrpGly: 1.112 ± 0.627
0.0TrpHis: 0.0 ± 0.0
2.224TrpIle: 2.224 ± 1.113
0.834TrpLys: 0.834 ± 0.47
1.39TrpLeu: 1.39 ± 0.531
0.556TrpMet: 0.556 ± 0.314
0.834TrpAsn: 0.834 ± 0.305
0.556TrpPro: 0.556 ± 0.314
0.556TrpGln: 0.556 ± 0.72
1.39TrpArg: 1.39 ± 1.326
0.278TrpSer: 0.278 ± 0.157
0.834TrpThr: 0.834 ± 0.558
1.668TrpVal: 1.668 ± 1.611
0.278TrpTrp: 0.278 ± 0.157
1.112TrpTyr: 1.112 ± 0.388
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.39TyrAla: 1.39 ± 0.511
0.556TyrCys: 0.556 ± 0.314
2.502TyrAsp: 2.502 ± 0.917
2.502TyrGlu: 2.502 ± 1.693
0.834TyrPhe: 0.834 ± 0.305
2.78TyrGly: 2.78 ± 0.318
1.668TyrHis: 1.668 ± 0.335
1.668TyrIle: 1.668 ± 0.826
5.282TyrLys: 5.282 ± 0.913
5.282TyrLeu: 5.282 ± 1.904
0.278TyrMet: 0.278 ± 0.421
1.39TyrAsn: 1.39 ± 0.662
1.39TyrPro: 1.39 ± 0.784
1.668TyrGln: 1.668 ± 0.361
1.39TyrArg: 1.39 ± 0.764
2.502TyrSer: 2.502 ± 0.461
1.946TyrThr: 1.946 ± 0.523
1.668TyrVal: 1.668 ± 0.498
0.556TyrTrp: 0.556 ± 0.29
0.834TyrTyr: 0.834 ± 0.558
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3598 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski