Amino acid dipepetide frequency for Anthurium mosaic-associated virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.822AlaAla: 2.822 ± 0.533
0.847AlaCys: 0.847 ± 0.504
3.105AlaAsp: 3.105 ± 0.927
2.822AlaGlu: 2.822 ± 0.725
2.258AlaPhe: 2.258 ± 0.822
4.798AlaGly: 4.798 ± 0.807
0.564AlaHis: 0.564 ± 0.246
4.234AlaIle: 4.234 ± 1.291
3.387AlaLys: 3.387 ± 0.709
5.08AlaLeu: 5.08 ± 0.985
1.129AlaMet: 1.129 ± 0.527
1.976AlaAsn: 1.976 ± 0.247
1.129AlaPro: 1.129 ± 0.894
1.976AlaGln: 1.976 ± 0.629
2.822AlaArg: 2.822 ± 0.748
3.105AlaSer: 3.105 ± 1.243
2.822AlaThr: 2.822 ± 1.048
1.411AlaVal: 1.411 ± 0.388
0.564AlaTrp: 0.564 ± 0.447
2.258AlaTyr: 2.258 ± 0.846
0.0AlaXaa: 0.0 ± 0.0
Cys
1.129CysAla: 1.129 ± 0.324
0.282CysCys: 0.282 ± 0.223
1.693CysAsp: 1.693 ± 0.613
1.129CysGlu: 1.129 ± 0.695
1.129CysPhe: 1.129 ± 0.599
0.847CysGly: 0.847 ± 0.389
0.282CysHis: 0.282 ± 0.45
1.693CysIle: 1.693 ± 0.431
1.976CysLys: 1.976 ± 0.456
0.847CysLeu: 0.847 ± 0.587
0.0CysMet: 0.0 ± 0.0
0.564CysAsn: 0.564 ± 0.448
1.129CysPro: 1.129 ± 0.587
0.282CysGln: 0.282 ± 0.45
1.129CysArg: 1.129 ± 0.464
3.105CysSer: 3.105 ± 1.213
1.411CysThr: 1.411 ± 0.359
1.693CysVal: 1.693 ± 0.325
0.564CysTrp: 0.564 ± 0.448
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.105AspAla: 3.105 ± 1.053
0.564AspCys: 0.564 ± 0.296
1.693AspAsp: 1.693 ± 0.481
5.927AspGlu: 5.927 ± 1.925
3.669AspPhe: 3.669 ± 0.969
3.669AspGly: 3.669 ± 0.912
0.282AspHis: 0.282 ± 0.244
2.54AspIle: 2.54 ± 0.402
3.669AspLys: 3.669 ± 0.824
1.976AspLeu: 1.976 ± 0.247
2.258AspMet: 2.258 ± 0.56
2.258AspAsn: 2.258 ± 0.553
1.693AspPro: 1.693 ± 0.639
1.129AspGln: 1.129 ± 0.486
2.258AspArg: 2.258 ± 1.247
3.105AspSer: 3.105 ± 1.143
1.693AspThr: 1.693 ± 0.481
4.516AspVal: 4.516 ± 1.304
1.411AspTrp: 1.411 ± 0.373
1.129AspTyr: 1.129 ± 0.383
0.0AspXaa: 0.0 ± 0.0
Glu
3.669GluAla: 3.669 ± 1.354
0.564GluCys: 0.564 ± 0.246
3.387GluAsp: 3.387 ± 1.017
10.161GluGlu: 10.161 ± 3.843
3.105GluPhe: 3.105 ± 1.255
5.645GluGly: 5.645 ± 1.374
1.411GluHis: 1.411 ± 0.451
7.621GluIle: 7.621 ± 2.123
5.645GluLys: 5.645 ± 1.672
6.774GluLeu: 6.774 ± 1.781
3.387GluMet: 3.387 ± 0.791
2.54GluAsn: 2.54 ± 1.372
1.129GluPro: 1.129 ± 0.383
1.976GluGln: 1.976 ± 1.073
7.903GluArg: 7.903 ± 3.013
5.08GluSer: 5.08 ± 1.181
5.08GluThr: 5.08 ± 1.201
3.951GluVal: 3.951 ± 0.923
3.387GluTrp: 3.387 ± 0.807
2.258GluTyr: 2.258 ± 1.056
0.0GluXaa: 0.0 ± 0.0
Phe
1.129PheAla: 1.129 ± 0.993
2.54PheCys: 2.54 ± 1.292
1.129PheAsp: 1.129 ± 0.623
1.976PheGlu: 1.976 ± 0.384
2.54PhePhe: 2.54 ± 3.168
3.387PheGly: 3.387 ± 1.168
0.847PheHis: 0.847 ± 0.44
2.822PheIle: 2.822 ± 0.599
0.847PheLys: 0.847 ± 0.639
5.08PheLeu: 5.08 ± 4.457
0.564PheMet: 0.564 ± 0.486
1.976PheAsn: 1.976 ± 0.953
1.976PhePro: 1.976 ± 1.774
1.129PheGln: 1.129 ± 0.587
1.411PheArg: 1.411 ± 0.356
4.798PheSer: 4.798 ± 3.123
1.976PheThr: 1.976 ± 0.707
2.822PheVal: 2.822 ± 1.027
0.847PheTrp: 0.847 ± 0.457
0.282PheTyr: 0.282 ± 0.263
0.0PheXaa: 0.0 ± 0.0
Gly
3.105GlyAla: 3.105 ± 1.144
1.411GlyCys: 1.411 ± 0.96
3.105GlyAsp: 3.105 ± 1.079
6.492GlyGlu: 6.492 ± 2.077
3.105GlyPhe: 3.105 ± 0.696
5.927GlyGly: 5.927 ± 1.144
0.282GlyHis: 0.282 ± 0.263
4.516GlyIle: 4.516 ± 1.015
6.209GlyLys: 6.209 ± 1.5
6.209GlyLeu: 6.209 ± 1.5
1.976GlyMet: 1.976 ± 0.807
4.516GlyAsn: 4.516 ± 0.659
2.258GlyPro: 2.258 ± 0.913
1.129GlyGln: 1.129 ± 0.695
5.363GlyArg: 5.363 ± 0.863
6.774GlySer: 6.774 ± 1.152
3.105GlyThr: 3.105 ± 0.57
5.08GlyVal: 5.08 ± 1.602
2.258GlyTrp: 2.258 ± 0.544
2.822GlyTyr: 2.822 ± 0.936
0.0GlyXaa: 0.0 ± 0.0
His
0.564HisAla: 0.564 ± 0.246
1.129HisCys: 1.129 ± 0.519
0.847HisAsp: 0.847 ± 0.198
0.564HisGlu: 0.564 ± 0.296
0.564HisPhe: 0.564 ± 0.489
0.847HisGly: 0.847 ± 0.546
0.282HisHis: 0.282 ± 0.45
0.847HisIle: 0.847 ± 0.733
0.564HisLys: 0.564 ± 0.296
1.693HisLeu: 1.693 ± 1.254
0.564HisMet: 0.564 ± 0.246
1.693HisAsn: 1.693 ± 0.723
0.564HisPro: 0.564 ± 0.448
0.0HisGln: 0.0 ± 0.0
0.847HisArg: 0.847 ± 0.389
1.411HisSer: 1.411 ± 0.811
0.282HisThr: 0.282 ± 0.223
1.129HisVal: 1.129 ± 0.537
0.0HisTrp: 0.0 ± 0.0
0.282HisTyr: 0.282 ± 0.223
0.0HisXaa: 0.0 ± 0.0
Ile
2.822IleAla: 2.822 ± 0.409
2.258IleCys: 2.258 ± 1.142
3.105IleAsp: 3.105 ± 0.831
5.08IleGlu: 5.08 ± 1.529
2.822IlePhe: 2.822 ± 1.262
5.645IleGly: 5.645 ± 1.115
0.282IleHis: 0.282 ± 0.263
4.234IleIle: 4.234 ± 0.472
5.363IleLys: 5.363 ± 1.717
5.645IleLeu: 5.645 ± 1.263
1.693IleMet: 1.693 ± 0.671
2.822IleAsn: 2.822 ± 0.65
2.258IlePro: 2.258 ± 0.669
0.564IleGln: 0.564 ± 0.246
3.951IleArg: 3.951 ± 0.743
5.645IleSer: 5.645 ± 1.457
2.822IleThr: 2.822 ± 0.732
4.516IleVal: 4.516 ± 1.172
1.129IleTrp: 1.129 ± 0.3
1.411IleTyr: 1.411 ± 0.385
0.0IleXaa: 0.0 ± 0.0
Lys
3.105LysAla: 3.105 ± 1.413
1.411LysCys: 1.411 ± 0.303
3.387LysAsp: 3.387 ± 1.11
9.314LysGlu: 9.314 ± 3.963
1.693LysPhe: 1.693 ± 0.4
3.951LysGly: 3.951 ± 1.242
0.282LysHis: 0.282 ± 0.263
5.08LysIle: 5.08 ± 1.255
7.056LysLys: 7.056 ± 2.013
7.903LysLeu: 7.903 ± 0.959
3.387LysMet: 3.387 ± 0.962
2.54LysAsn: 2.54 ± 1.083
1.976LysPro: 1.976 ± 0.247
1.411LysGln: 1.411 ± 0.669
4.516LysArg: 4.516 ± 1.616
4.516LysSer: 4.516 ± 1.188
2.822LysThr: 2.822 ± 0.327
4.798LysVal: 4.798 ± 1.18
0.564LysTrp: 0.564 ± 0.264
3.669LysTyr: 3.669 ± 0.164
0.0LysXaa: 0.0 ± 0.0
Leu
4.234LeuAla: 4.234 ± 0.715
2.54LeuCys: 2.54 ± 0.867
4.234LeuAsp: 4.234 ± 0.577
6.209LeuGlu: 6.209 ± 2.657
3.951LeuPhe: 3.951 ± 1.723
8.185LeuGly: 8.185 ± 1.137
2.258LeuHis: 2.258 ± 1.197
4.516LeuIle: 4.516 ± 0.824
5.645LeuLys: 5.645 ± 1.309
7.903LeuLeu: 7.903 ± 2.588
3.105LeuMet: 3.105 ± 0.855
4.516LeuAsn: 4.516 ± 1.29
4.234LeuPro: 4.234 ± 1.996
1.411LeuGln: 1.411 ± 1.245
5.363LeuArg: 5.363 ± 1.193
11.572LeuSer: 11.572 ± 6.34
3.951LeuThr: 3.951 ± 1.414
4.516LeuVal: 4.516 ± 0.494
0.847LeuTrp: 0.847 ± 0.45
3.387LeuTyr: 3.387 ± 1.807
0.0LeuXaa: 0.0 ± 0.0
Met
2.54MetAla: 2.54 ± 1.091
0.282MetCys: 0.282 ± 0.45
1.411MetAsp: 1.411 ± 0.612
4.234MetGlu: 4.234 ± 0.983
0.0MetPhe: 0.0 ± 0.0
3.669MetGly: 3.669 ± 1.359
0.282MetHis: 0.282 ± 0.223
0.847MetIle: 0.847 ± 0.639
2.258MetLys: 2.258 ± 0.586
4.516MetLeu: 4.516 ± 1.405
0.847MetMet: 0.847 ± 0.47
1.129MetAsn: 1.129 ± 0.522
0.564MetPro: 0.564 ± 0.448
0.564MetGln: 0.564 ± 0.264
2.822MetArg: 2.822 ± 0.936
3.669MetSer: 3.669 ± 0.728
0.847MetThr: 0.847 ± 0.491
2.822MetVal: 2.822 ± 0.982
0.0MetTrp: 0.0 ± 0.0
0.564MetTyr: 0.564 ± 0.447
0.0MetXaa: 0.0 ± 0.0
Asn
2.258AsnAla: 2.258 ± 1.263
1.129AsnCys: 1.129 ± 0.492
2.822AsnAsp: 2.822 ± 1.631
2.258AsnGlu: 2.258 ± 0.766
1.411AsnPhe: 1.411 ± 1.245
2.258AsnGly: 2.258 ± 0.92
1.129AsnHis: 1.129 ± 0.304
3.105AsnIle: 3.105 ± 0.934
4.516AsnLys: 4.516 ± 1.341
5.645AsnLeu: 5.645 ± 1.335
1.976AsnMet: 1.976 ± 0.456
0.847AsnAsn: 0.847 ± 0.398
1.411AsnPro: 1.411 ± 0.923
1.411AsnGln: 1.411 ± 0.688
1.976AsnArg: 1.976 ± 0.528
4.234AsnSer: 4.234 ± 3.812
1.411AsnThr: 1.411 ± 0.595
2.54AsnVal: 2.54 ± 0.557
0.847AsnTrp: 0.847 ± 0.198
0.564AsnTyr: 0.564 ± 0.296
0.0AsnXaa: 0.0 ± 0.0
Pro
2.822ProAla: 2.822 ± 0.661
0.847ProCys: 0.847 ± 0.625
2.54ProAsp: 2.54 ± 0.938
3.105ProGlu: 3.105 ± 1.323
1.693ProPhe: 1.693 ± 0.898
1.693ProGly: 1.693 ± 0.325
0.282ProHis: 0.282 ± 0.244
1.693ProIle: 1.693 ± 0.864
3.387ProLys: 3.387 ± 0.746
3.387ProLeu: 3.387 ± 2.888
1.411ProMet: 1.411 ± 1.449
0.564ProAsn: 0.564 ± 0.246
1.976ProPro: 1.976 ± 1.568
0.847ProGln: 0.847 ± 0.625
1.411ProArg: 1.411 ± 0.694
3.387ProSer: 3.387 ± 2.237
1.976ProThr: 1.976 ± 0.821
1.129ProVal: 1.129 ± 0.587
0.564ProTrp: 0.564 ± 0.489
1.129ProTyr: 1.129 ± 0.894
0.0ProXaa: 0.0 ± 0.0
Gln
0.847GlnAla: 0.847 ± 0.457
0.564GlnCys: 0.564 ± 0.296
0.564GlnAsp: 0.564 ± 0.264
1.976GlnGlu: 1.976 ± 0.822
0.564GlnPhe: 0.564 ± 0.447
0.847GlnGly: 0.847 ± 0.198
0.564GlnHis: 0.564 ± 0.448
0.847GlnIle: 0.847 ± 0.457
2.54GlnLys: 2.54 ± 0.801
1.129GlnLeu: 1.129 ± 0.894
0.564GlnMet: 0.564 ± 0.296
1.693GlnAsn: 1.693 ± 0.898
0.282GlnPro: 0.282 ± 0.223
0.282GlnGln: 0.282 ± 0.244
1.693GlnArg: 1.693 ± 0.481
1.129GlnSer: 1.129 ± 0.497
0.564GlnThr: 0.564 ± 0.489
1.411GlnVal: 1.411 ± 0.504
0.282GlnTrp: 0.282 ± 0.45
0.282GlnTyr: 0.282 ± 0.45
0.0GlnXaa: 0.0 ± 0.0
Arg
3.387ArgAla: 3.387 ± 0.744
0.282ArgCys: 0.282 ± 0.223
3.387ArgAsp: 3.387 ± 1.162
7.621ArgGlu: 7.621 ± 2.471
1.411ArgPhe: 1.411 ± 0.965
4.516ArgGly: 4.516 ± 1.312
1.411ArgHis: 1.411 ± 0.385
2.822ArgIle: 2.822 ± 0.793
4.798ArgLys: 4.798 ± 1.995
5.363ArgLeu: 5.363 ± 1.064
1.411ArgMet: 1.411 ± 0.491
1.976ArgAsn: 1.976 ± 0.382
1.693ArgPro: 1.693 ± 0.894
0.847ArgGln: 0.847 ± 0.198
4.798ArgArg: 4.798 ± 1.993
2.822ArgSer: 2.822 ± 0.342
3.387ArgThr: 3.387 ± 1.143
4.234ArgVal: 4.234 ± 1.097
2.54ArgTrp: 2.54 ± 0.645
1.411ArgTyr: 1.411 ± 0.853
0.0ArgXaa: 0.0 ± 0.0
Ser
4.798SerAla: 4.798 ± 0.964
1.411SerCys: 1.411 ± 0.553
3.105SerAsp: 3.105 ± 1.273
4.516SerGlu: 4.516 ± 1.015
6.774SerPhe: 6.774 ± 5.773
7.338SerGly: 7.338 ± 0.533
1.693SerHis: 1.693 ± 0.652
6.492SerIle: 6.492 ± 2.674
5.645SerLys: 5.645 ± 1.512
9.032SerLeu: 9.032 ± 5.05
4.234SerMet: 4.234 ± 1.078
4.234SerAsn: 4.234 ± 2.015
3.669SerPro: 3.669 ± 2.761
0.282SerGln: 0.282 ± 0.244
2.822SerArg: 2.822 ± 0.784
10.725SerSer: 10.725 ± 7.99
2.258SerThr: 2.258 ± 0.407
5.645SerVal: 5.645 ± 1.492
1.129SerTrp: 1.129 ± 0.492
1.976SerTyr: 1.976 ± 1.059
0.0SerXaa: 0.0 ± 0.0
Thr
1.411ThrAla: 1.411 ± 0.385
0.847ThrCys: 0.847 ± 0.399
1.976ThrAsp: 1.976 ± 0.55
3.105ThrGlu: 3.105 ± 0.942
1.411ThrPhe: 1.411 ± 0.958
1.693ThrGly: 1.693 ± 0.57
0.847ThrHis: 0.847 ± 0.652
3.669ThrIle: 3.669 ± 0.459
2.54ThrLys: 2.54 ± 1.164
2.822ThrLeu: 2.822 ± 0.869
2.258ThrMet: 2.258 ± 0.798
2.822ThrAsn: 2.822 ± 1.225
2.822ThrPro: 2.822 ± 1.373
0.847ThrGln: 0.847 ± 0.423
2.54ThrArg: 2.54 ± 1.03
4.234ThrSer: 4.234 ± 1.248
2.822ThrThr: 2.822 ± 0.647
2.258ThrVal: 2.258 ± 0.846
1.129ThrTrp: 1.129 ± 0.304
1.976ThrTyr: 1.976 ± 0.541
0.0ThrXaa: 0.0 ± 0.0
Val
3.669ValAla: 3.669 ± 0.355
0.847ValCys: 0.847 ± 0.457
4.798ValAsp: 4.798 ± 1.648
4.516ValGlu: 4.516 ± 1.196
1.129ValPhe: 1.129 ± 0.993
4.798ValGly: 4.798 ± 1.767
0.847ValHis: 0.847 ± 0.67
4.234ValIle: 4.234 ± 1.133
3.669ValLys: 3.669 ± 1.354
6.492ValLeu: 6.492 ± 1.759
1.411ValMet: 1.411 ± 0.491
1.976ValAsn: 1.976 ± 1.073
2.54ValPro: 2.54 ± 0.269
1.129ValGln: 1.129 ± 0.623
3.669ValArg: 3.669 ± 0.476
4.516ValSer: 4.516 ± 1.826
3.105ValThr: 3.105 ± 0.914
3.669ValVal: 3.669 ± 1.118
1.129ValTrp: 1.129 ± 0.465
3.105ValTyr: 3.105 ± 0.929
0.0ValXaa: 0.0 ± 0.0
Trp
0.847TrpAla: 0.847 ± 0.45
0.847TrpCys: 0.847 ± 0.457
0.0TrpAsp: 0.0 ± 0.0
1.411TrpGlu: 1.411 ± 0.835
0.564TrpPhe: 0.564 ± 0.526
1.129TrpGly: 1.129 ± 0.383
0.282TrpHis: 0.282 ± 0.244
1.411TrpIle: 1.411 ± 1.117
2.54TrpLys: 2.54 ± 0.624
2.822TrpLeu: 2.822 ± 0.767
0.847TrpMet: 0.847 ± 0.457
1.129TrpAsn: 1.129 ± 0.747
0.847TrpPro: 0.847 ± 0.398
0.564TrpGln: 0.564 ± 0.489
0.847TrpArg: 0.847 ± 0.423
1.976TrpSer: 1.976 ± 1.052
0.564TrpThr: 0.564 ± 0.246
0.847TrpVal: 0.847 ± 0.423
0.847TrpTrp: 0.847 ± 0.504
0.564TrpTyr: 0.564 ± 0.447
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.847TyrAla: 0.847 ± 0.423
0.282TyrCys: 0.282 ± 0.244
2.258TyrAsp: 2.258 ± 0.983
2.258TyrGlu: 2.258 ± 1.171
0.282TyrPhe: 0.282 ± 0.244
4.798TyrGly: 4.798 ± 1.34
0.564TyrHis: 0.564 ± 0.264
0.847TyrIle: 0.847 ± 0.389
1.129TyrLys: 1.129 ± 0.623
2.258TyrLeu: 2.258 ± 0.489
0.564TyrMet: 0.564 ± 0.447
1.976TyrAsn: 1.976 ± 0.541
1.976TyrPro: 1.976 ± 1.16
0.847TyrGln: 0.847 ± 0.423
1.976TyrArg: 1.976 ± 0.651
1.976TyrSer: 1.976 ± 1.012
1.129TyrThr: 1.129 ± 0.736
2.258TyrVal: 2.258 ± 0.483
0.847TyrTrp: 0.847 ± 0.457
1.129TyrTyr: 1.129 ± 0.383
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3544 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski