Amino acid dipepetide frequency for Helminthosporium victoriae 145S virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.398AlaAla: 3.398 ± 0.875
1.133AlaCys: 1.133 ± 0.344
5.096AlaAsp: 5.096 ± 1.096
5.379AlaGlu: 5.379 ± 0.952
3.398AlaPhe: 3.398 ± 0.517
5.946AlaGly: 5.946 ± 1.467
1.416AlaHis: 1.416 ± 0.676
4.53AlaIle: 4.53 ± 1.023
3.964AlaLys: 3.964 ± 0.608
3.681AlaLeu: 3.681 ± 0.795
3.398AlaMet: 3.398 ± 0.79
4.247AlaAsn: 4.247 ± 1.184
0.849AlaPro: 0.849 ± 0.214
0.566AlaGln: 0.566 ± 0.432
5.379AlaArg: 5.379 ± 1.444
5.096AlaSer: 5.096 ± 1.129
3.398AlaThr: 3.398 ± 1.205
4.53AlaVal: 4.53 ± 0.691
1.416AlaTrp: 1.416 ± 0.413
2.548AlaTyr: 2.548 ± 0.471
0.0AlaXaa: 0.0 ± 0.0
Cys
2.548CysAla: 2.548 ± 0.116
0.283CysCys: 0.283 ± 0.28
1.416CysAsp: 1.416 ± 0.621
0.849CysGlu: 0.849 ± 0.569
0.566CysPhe: 0.566 ± 0.275
1.982CysGly: 1.982 ± 1.029
0.283CysHis: 0.283 ± 0.216
1.133CysIle: 1.133 ± 1.121
0.566CysLys: 0.566 ± 0.263
0.566CysLeu: 0.566 ± 0.263
0.566CysMet: 0.566 ± 0.275
0.283CysAsn: 0.283 ± 0.216
1.133CysPro: 1.133 ± 0.344
0.566CysGln: 0.566 ± 0.511
0.566CysArg: 0.566 ± 0.275
0.849CysSer: 0.849 ± 0.295
0.566CysThr: 0.566 ± 0.275
0.566CysVal: 0.566 ± 0.275
0.283CysTrp: 0.283 ± 0.255
1.982CysTyr: 1.982 ± 0.716
0.0CysXaa: 0.0 ± 0.0
Asp
5.096AspAla: 5.096 ± 0.469
1.133AspCys: 1.133 ± 0.344
4.53AspAsp: 4.53 ± 0.472
5.379AspGlu: 5.379 ± 0.671
2.265AspPhe: 2.265 ± 0.692
5.096AspGly: 5.096 ± 0.926
1.699AspHis: 1.699 ± 0.359
3.398AspIle: 3.398 ± 0.762
4.53AspLys: 4.53 ± 0.972
4.813AspLeu: 4.813 ± 0.542
2.831AspMet: 2.831 ± 0.682
1.699AspAsn: 1.699 ± 0.237
1.133AspPro: 1.133 ± 0.614
1.416AspGln: 1.416 ± 0.772
5.096AspArg: 5.096 ± 1.178
4.813AspSer: 4.813 ± 2.32
1.416AspThr: 1.416 ± 0.953
6.229AspVal: 6.229 ± 1.273
1.416AspTrp: 1.416 ± 0.8
1.982AspTyr: 1.982 ± 1.239
0.0AspXaa: 0.0 ± 0.0
Glu
5.946GluAla: 5.946 ± 1.72
1.699GluCys: 1.699 ± 0.551
3.398GluAsp: 3.398 ± 1.105
5.096GluGlu: 5.096 ± 0.614
2.548GluPhe: 2.548 ± 0.737
3.681GluGly: 3.681 ± 1.311
1.699GluHis: 1.699 ± 0.612
2.831GluIle: 2.831 ± 0.551
5.946GluLys: 5.946 ± 0.852
5.379GluLeu: 5.379 ± 0.282
4.247GluMet: 4.247 ± 0.406
3.114GluAsn: 3.114 ± 1.159
1.982GluPro: 1.982 ± 0.679
0.849GluGln: 0.849 ± 0.648
5.096GluArg: 5.096 ± 0.974
6.512GluSer: 6.512 ± 1.899
2.265GluThr: 2.265 ± 0.77
6.795GluVal: 6.795 ± 0.666
3.114GluTrp: 3.114 ± 1.017
3.964GluTyr: 3.964 ± 0.868
0.0GluXaa: 0.0 ± 0.0
Phe
1.699PheAla: 1.699 ± 1.009
0.849PheCys: 0.849 ± 0.408
3.114PheAsp: 3.114 ± 0.542
3.964PheGlu: 3.964 ± 1.093
1.133PhePhe: 1.133 ± 0.344
2.265PheGly: 2.265 ± 0.748
0.849PheHis: 0.849 ± 0.214
0.566PheIle: 0.566 ± 0.432
1.982PheLys: 1.982 ± 0.464
1.699PheLeu: 1.699 ± 0.534
1.133PheMet: 1.133 ± 0.344
2.548PheAsn: 2.548 ± 0.405
0.566PhePro: 0.566 ± 0.263
0.0PheGln: 0.0 ± 0.0
3.114PheArg: 3.114 ± 0.63
2.265PheSer: 2.265 ± 0.688
1.416PheThr: 1.416 ± 0.291
1.133PheVal: 1.133 ± 0.597
0.283PheTrp: 0.283 ± 0.242
0.283PheTyr: 0.283 ± 0.216
0.0PheXaa: 0.0 ± 0.0
Gly
3.681GlyAla: 3.681 ± 0.554
0.849GlyCys: 0.849 ± 0.266
3.964GlyAsp: 3.964 ± 0.555
3.398GlyGlu: 3.398 ± 0.474
1.133GlyPhe: 1.133 ± 0.535
3.681GlyGly: 3.681 ± 0.699
1.699GlyHis: 1.699 ± 0.42
4.53GlyIle: 4.53 ± 0.94
3.964GlyLys: 3.964 ± 1.352
5.946GlyLeu: 5.946 ± 0.73
4.53GlyMet: 4.53 ± 1.249
3.398GlyAsn: 3.398 ± 0.675
0.283GlyPro: 0.283 ± 0.255
0.849GlyGln: 0.849 ± 0.266
4.247GlyArg: 4.247 ± 0.733
5.946GlySer: 5.946 ± 1.546
3.114GlyThr: 3.114 ± 0.955
5.946GlyVal: 5.946 ± 1.41
1.699GlyTrp: 1.699 ± 0.54
1.699GlyTyr: 1.699 ± 0.619
0.0GlyXaa: 0.0 ± 0.0
His
1.416HisAla: 1.416 ± 0.959
0.283HisCys: 0.283 ± 0.28
1.416HisAsp: 1.416 ± 0.522
1.699HisGlu: 1.699 ± 0.54
0.283HisPhe: 0.283 ± 0.255
0.849HisGly: 0.849 ± 0.463
0.849HisHis: 0.849 ± 0.421
1.982HisIle: 1.982 ± 0.387
1.699HisLys: 1.699 ± 0.54
1.133HisLeu: 1.133 ± 0.681
1.416HisMet: 1.416 ± 0.291
1.699HisAsn: 1.699 ± 0.59
0.0HisPro: 0.0 ± 0.0
0.849HisGln: 0.849 ± 0.214
1.133HisArg: 1.133 ± 0.089
1.699HisSer: 1.699 ± 0.534
0.283HisThr: 0.283 ± 0.216
0.849HisVal: 0.849 ± 0.421
0.849HisTrp: 0.849 ± 0.214
0.849HisTyr: 0.849 ± 0.408
0.0HisXaa: 0.0 ± 0.0
Ile
3.964IleAla: 3.964 ± 0.636
0.566IleCys: 0.566 ± 0.56
3.681IleAsp: 3.681 ± 0.999
3.681IleGlu: 3.681 ± 0.448
1.699IlePhe: 1.699 ± 0.302
3.398IleGly: 3.398 ± 0.814
1.133IleHis: 1.133 ± 0.479
0.849IleIle: 0.849 ± 0.463
2.831IleLys: 2.831 ± 0.713
3.964IleLeu: 3.964 ± 0.55
1.416IleMet: 1.416 ± 0.253
2.548IleAsn: 2.548 ± 0.757
3.114IlePro: 3.114 ± 0.589
0.283IleGln: 0.283 ± 0.28
3.114IleArg: 3.114 ± 0.302
3.114IleSer: 3.114 ± 0.734
4.53IleThr: 4.53 ± 1.556
2.831IleVal: 2.831 ± 0.297
0.0IleTrp: 0.0 ± 0.0
1.133IleTyr: 1.133 ± 0.7
0.0IleXaa: 0.0 ± 0.0
Lys
3.964LysAla: 3.964 ± 0.817
0.566LysCys: 0.566 ± 0.275
5.379LysAsp: 5.379 ± 1.342
4.247LysGlu: 4.247 ± 0.406
1.699LysPhe: 1.699 ± 0.915
4.53LysGly: 4.53 ± 0.488
1.133LysHis: 1.133 ± 0.711
2.265LysIle: 2.265 ± 0.661
5.096LysLys: 5.096 ± 0.79
7.928LysLeu: 7.928 ± 0.605
3.114LysMet: 3.114 ± 0.407
2.548LysAsn: 2.548 ± 0.732
1.982LysPro: 1.982 ± 0.572
1.699LysGln: 1.699 ± 0.541
3.114LysArg: 3.114 ± 1.197
4.247LysSer: 4.247 ± 1.493
3.114LysThr: 3.114 ± 0.506
5.096LysVal: 5.096 ± 1.318
1.699LysTrp: 1.699 ± 0.686
1.416LysTyr: 1.416 ± 0.672
0.0LysXaa: 0.0 ± 0.0
Leu
7.644LeuAla: 7.644 ± 1.565
1.982LeuCys: 1.982 ± 0.559
5.379LeuAsp: 5.379 ± 1.293
5.379LeuGlu: 5.379 ± 1.239
3.398LeuPhe: 3.398 ± 1.372
3.964LeuGly: 3.964 ± 0.4
1.416LeuHis: 1.416 ± 0.148
2.548LeuIle: 2.548 ± 0.667
3.114LeuLys: 3.114 ± 0.542
5.663LeuLeu: 5.663 ± 1.113
3.114LeuMet: 3.114 ± 0.845
1.699LeuAsn: 1.699 ± 0.715
3.398LeuPro: 3.398 ± 0.735
0.566LeuGln: 0.566 ± 0.485
5.096LeuArg: 5.096 ± 0.941
5.946LeuSer: 5.946 ± 1.825
4.53LeuThr: 4.53 ± 1.042
6.795LeuVal: 6.795 ± 0.542
0.849LeuTrp: 0.849 ± 0.569
1.982LeuTyr: 1.982 ± 0.714
0.0LeuXaa: 0.0 ± 0.0
Met
5.096MetAla: 5.096 ± 1.541
1.416MetCys: 1.416 ± 0.59
1.699MetAsp: 1.699 ± 0.789
4.247MetGlu: 4.247 ± 0.406
2.265MetPhe: 2.265 ± 0.812
3.681MetGly: 3.681 ± 0.798
1.416MetHis: 1.416 ± 0.621
3.398MetIle: 3.398 ± 0.306
3.114MetLys: 3.114 ± 1.204
2.831MetLeu: 2.831 ± 0.175
2.265MetMet: 2.265 ± 0.718
1.699MetAsn: 1.699 ± 0.686
2.265MetPro: 2.265 ± 0.524
1.133MetGln: 1.133 ± 0.089
3.398MetArg: 3.398 ± 0.994
2.831MetSer: 2.831 ± 0.243
3.681MetThr: 3.681 ± 0.623
2.548MetVal: 2.548 ± 0.513
0.0MetTrp: 0.0 ± 0.0
1.982MetTyr: 1.982 ± 0.597
0.0MetXaa: 0.0 ± 0.0
Asn
2.265AsnAla: 2.265 ± 0.416
0.849AsnCys: 0.849 ± 0.569
3.114AsnAsp: 3.114 ± 0.843
2.831AsnGlu: 2.831 ± 0.513
2.265AsnPhe: 2.265 ± 0.244
2.831AsnGly: 2.831 ± 0.852
0.566AsnHis: 0.566 ± 0.275
2.265AsnIle: 2.265 ± 0.179
1.699AsnLys: 1.699 ± 0.496
3.964AsnLeu: 3.964 ± 0.627
1.699AsnMet: 1.699 ± 0.359
1.133AsnAsn: 1.133 ± 0.535
1.416AsnPro: 1.416 ± 0.593
0.0AsnGln: 0.0 ± 0.0
3.681AsnArg: 3.681 ± 0.713
2.548AsnSer: 2.548 ± 0.456
2.831AsnThr: 2.831 ± 1.025
4.813AsnVal: 4.813 ± 0.655
0.566AsnTrp: 0.566 ± 0.275
0.849AsnTyr: 0.849 ± 0.485
0.0AsnXaa: 0.0 ± 0.0
Pro
2.548ProAla: 2.548 ± 0.625
0.0ProCys: 0.0 ± 0.0
2.548ProAsp: 2.548 ± 0.642
2.548ProGlu: 2.548 ± 0.429
0.566ProPhe: 0.566 ± 0.268
2.831ProGly: 2.831 ± 0.771
0.849ProHis: 0.849 ± 0.214
0.566ProIle: 0.566 ± 0.263
2.831ProLys: 2.831 ± 0.581
0.566ProLeu: 0.566 ± 0.485
1.416ProMet: 1.416 ± 0.412
1.699ProAsn: 1.699 ± 0.674
1.982ProPro: 1.982 ± 0.272
1.133ProGln: 1.133 ± 0.374
0.849ProArg: 0.849 ± 0.248
1.982ProSer: 1.982 ± 0.684
1.982ProThr: 1.982 ± 1.085
2.265ProVal: 2.265 ± 0.884
0.283ProTrp: 0.283 ± 0.28
0.849ProTyr: 0.849 ± 0.421
0.0ProXaa: 0.0 ± 0.0
Gln
1.133GlnAla: 1.133 ± 0.689
0.0GlnCys: 0.0 ± 0.0
0.566GlnAsp: 0.566 ± 0.313
2.265GlnGlu: 2.265 ± 0.893
0.849GlnPhe: 0.849 ± 0.648
0.849GlnGly: 0.849 ± 0.408
0.566GlnHis: 0.566 ± 0.268
0.0GlnIle: 0.0 ± 0.0
1.133GlnLys: 1.133 ± 0.374
0.849GlnLeu: 0.849 ± 0.248
2.265GlnMet: 2.265 ± 0.639
0.566GlnAsn: 0.566 ± 0.268
0.283GlnPro: 0.283 ± 0.216
0.566GlnGln: 0.566 ± 0.432
0.283GlnArg: 0.283 ± 0.255
1.699GlnSer: 1.699 ± 0.541
0.849GlnThr: 0.849 ± 0.472
1.982GlnVal: 1.982 ± 0.597
0.0GlnTrp: 0.0 ± 0.0
1.416GlnTyr: 1.416 ± 0.357
0.0GlnXaa: 0.0 ± 0.0
Arg
3.398ArgAla: 3.398 ± 1.011
1.699ArgCys: 1.699 ± 0.378
3.964ArgAsp: 3.964 ± 0.736
6.512ArgGlu: 6.512 ± 0.848
1.416ArgPhe: 1.416 ± 0.716
3.964ArgGly: 3.964 ± 1.114
0.849ArgHis: 0.849 ± 0.648
5.096ArgIle: 5.096 ± 1.275
4.813ArgLys: 4.813 ± 0.414
4.813ArgLeu: 4.813 ± 0.456
1.982ArgMet: 1.982 ± 0.574
1.982ArgAsn: 1.982 ± 0.616
2.831ArgPro: 2.831 ± 0.502
1.416ArgGln: 1.416 ± 0.716
6.229ArgArg: 6.229 ± 0.535
3.398ArgSer: 3.398 ± 0.996
3.681ArgThr: 3.681 ± 0.644
6.512ArgVal: 6.512 ± 1.187
0.849ArgTrp: 0.849 ± 0.408
1.982ArgTyr: 1.982 ± 0.661
0.0ArgXaa: 0.0 ± 0.0
Ser
4.813SerAla: 4.813 ± 1.636
1.699SerCys: 1.699 ± 0.915
4.813SerAsp: 4.813 ± 1.727
5.946SerGlu: 5.946 ± 0.736
1.699SerPhe: 1.699 ± 0.359
3.964SerGly: 3.964 ± 0.587
1.133SerHis: 1.133 ± 0.55
3.681SerIle: 3.681 ± 0.745
5.096SerLys: 5.096 ± 1.175
5.379SerLeu: 5.379 ± 0.298
3.398SerMet: 3.398 ± 0.539
4.247SerAsn: 4.247 ± 0.938
2.265SerPro: 2.265 ± 0.966
1.416SerGln: 1.416 ± 0.515
3.398SerArg: 3.398 ± 0.84
7.361SerSer: 7.361 ± 1.166
2.831SerThr: 2.831 ± 1.095
7.644SerVal: 7.644 ± 2.391
0.283SerTrp: 0.283 ± 0.216
3.114SerTyr: 3.114 ± 1.374
0.0SerXaa: 0.0 ± 0.0
Thr
2.548ThrAla: 2.548 ± 1.273
0.283ThrCys: 0.283 ± 0.242
1.982ThrAsp: 1.982 ± 0.314
3.964ThrGlu: 3.964 ± 0.736
0.849ThrPhe: 0.849 ± 0.471
4.247ThrGly: 4.247 ± 1.11
1.133ThrHis: 1.133 ± 0.526
3.681ThrIle: 3.681 ± 1.103
4.247ThrLys: 4.247 ± 0.792
2.831ThrLeu: 2.831 ± 0.713
2.831ThrMet: 2.831 ± 0.755
1.982ThrAsn: 1.982 ± 0.47
2.548ThrPro: 2.548 ± 1.021
1.416ThrGln: 1.416 ± 0.618
2.831ThrArg: 2.831 ± 0.351
2.548ThrSer: 2.548 ± 2.298
3.681ThrThr: 3.681 ± 0.763
4.53ThrVal: 4.53 ± 1.023
1.416ThrTrp: 1.416 ± 0.477
1.133ThrTyr: 1.133 ± 0.526
0.0ThrXaa: 0.0 ± 0.0
Val
4.53ValAla: 4.53 ± 1.046
1.416ValCys: 1.416 ± 0.148
5.379ValAsp: 5.379 ± 0.553
7.361ValGlu: 7.361 ± 0.809
1.982ValPhe: 1.982 ± 0.5
3.398ValGly: 3.398 ± 0.549
1.133ValHis: 1.133 ± 0.763
3.114ValIle: 3.114 ± 0.658
3.681ValLys: 3.681 ± 0.48
8.211ValLeu: 8.211 ± 1.63
5.663ValMet: 5.663 ± 1.296
3.681ValAsn: 3.681 ± 0.999
1.699ValPro: 1.699 ± 0.643
1.982ValGln: 1.982 ± 0.716
7.078ValArg: 7.078 ± 0.781
7.644ValSer: 7.644 ± 1.039
3.964ValThr: 3.964 ± 0.988
10.193ValVal: 10.193 ± 1.017
0.566ValTrp: 0.566 ± 0.275
2.265ValTyr: 2.265 ± 0.718
0.0ValXaa: 0.0 ± 0.0
Trp
1.982TrpAla: 1.982 ± 0.906
0.566TrpCys: 0.566 ± 0.341
1.416TrpAsp: 1.416 ± 0.433
0.566TrpGlu: 0.566 ± 0.432
0.283TrpPhe: 0.283 ± 0.255
0.849TrpGly: 0.849 ± 0.451
0.0TrpHis: 0.0 ± 0.0
0.283TrpIle: 0.283 ± 0.216
1.699TrpLys: 1.699 ± 0.703
1.133TrpLeu: 1.133 ± 0.506
1.416TrpMet: 1.416 ± 0.818
0.566TrpAsn: 0.566 ± 0.275
0.0TrpPro: 0.0 ± 0.0
0.283TrpGln: 0.283 ± 0.255
1.133TrpArg: 1.133 ± 0.344
0.849TrpSer: 0.849 ± 0.266
0.566TrpThr: 0.566 ± 0.56
1.982TrpVal: 1.982 ± 0.903
0.566TrpTrp: 0.566 ± 0.275
0.283TrpTyr: 0.283 ± 0.28
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.699TyrAla: 1.699 ± 0.718
0.283TyrCys: 0.283 ± 0.242
3.114TyrAsp: 3.114 ± 0.545
0.849TyrGlu: 0.849 ± 0.248
0.566TyrPhe: 0.566 ± 0.263
2.548TyrGly: 2.548 ± 0.704
1.416TyrHis: 1.416 ± 0.621
1.416TyrIle: 1.416 ± 0.412
2.831TyrLys: 2.831 ± 0.297
3.114TyrLeu: 3.114 ± 0.268
1.982TyrMet: 1.982 ± 0.175
0.849TyrAsn: 0.849 ± 0.214
0.566TyrPro: 0.566 ± 0.275
1.133TyrGln: 1.133 ± 0.707
2.548TyrArg: 2.548 ± 0.486
2.831TyrSer: 2.831 ± 0.351
1.982TyrThr: 1.982 ± 0.387
1.699TyrVal: 1.699 ± 0.359
0.283TyrTrp: 0.283 ± 0.216
0.283TyrTyr: 0.283 ± 0.216
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3533 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski