Amino acid dipepetide frequency for Tomato leaf curl Madagascar virus-Menabe [Madagascar:Morondova:2001]

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.45AlaAla: 5.45 ± 2.935
0.908AlaCys: 0.908 ± 0.817
0.908AlaAsp: 0.908 ± 1.061
0.908AlaGlu: 0.908 ± 0.83
0.0AlaPhe: 0.0 ± 0.0
0.908AlaGly: 0.908 ± 0.675
1.817AlaHis: 1.817 ± 0.911
1.817AlaIle: 1.817 ± 1.001
3.633AlaLys: 3.633 ± 1.121
4.541AlaLeu: 4.541 ± 1.623
0.908AlaMet: 0.908 ± 0.652
1.817AlaAsn: 1.817 ± 1.35
6.358AlaPro: 6.358 ± 1.191
2.725AlaGln: 2.725 ± 1.054
6.358AlaArg: 6.358 ± 1.885
1.817AlaSer: 1.817 ± 1.123
5.45AlaThr: 5.45 ± 2.831
5.45AlaVal: 5.45 ± 2.512
0.908AlaTrp: 0.908 ± 0.675
0.908AlaTyr: 0.908 ± 0.675
0.0AlaXaa: 0.0 ± 0.0
Cys
0.908CysAla: 0.908 ± 0.675
1.817CysCys: 1.817 ± 1.66
0.0CysAsp: 0.0 ± 0.0
0.908CysGlu: 0.908 ± 0.817
0.908CysPhe: 0.908 ± 0.863
1.817CysGly: 1.817 ± 1.001
0.0CysHis: 0.0 ± 0.0
0.908CysIle: 0.908 ± 0.817
0.908CysLys: 0.908 ± 0.817
0.908CysLeu: 0.908 ± 1.061
0.908CysMet: 0.908 ± 0.83
0.908CysAsn: 0.908 ± 0.675
1.817CysPro: 1.817 ± 1.66
0.0CysGln: 0.0 ± 0.0
0.908CysArg: 0.908 ± 0.675
4.541CysSer: 4.541 ± 3.339
1.817CysThr: 1.817 ± 0.658
0.908CysVal: 0.908 ± 0.817
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.633AspAla: 3.633 ± 1.644
1.817AspCys: 1.817 ± 1.684
2.725AspAsp: 2.725 ± 0.861
1.817AspGlu: 1.817 ± 0.658
1.817AspPhe: 1.817 ± 0.658
1.817AspGly: 1.817 ± 1.35
2.725AspHis: 2.725 ± 1.915
1.817AspIle: 1.817 ± 1.086
2.725AspLys: 2.725 ± 1.722
8.174AspLeu: 8.174 ± 2.413
0.0AspMet: 0.0 ± 0.0
0.908AspAsn: 0.908 ± 0.817
1.817AspPro: 1.817 ± 0.911
0.0AspGln: 0.0 ± 0.0
3.633AspArg: 3.633 ± 1.294
5.45AspSer: 5.45 ± 1.093
0.908AspThr: 0.908 ± 0.675
6.358AspVal: 6.358 ± 1.561
1.817AspTrp: 1.817 ± 1.001
0.908AspTyr: 0.908 ± 0.83
0.0AspXaa: 0.0 ± 0.0
Glu
2.725GluAla: 2.725 ± 0.791
0.0GluCys: 0.0 ± 0.0
0.908GluAsp: 0.908 ± 0.863
6.358GluGlu: 6.358 ± 3.9
3.633GluPhe: 3.633 ± 1.962
5.45GluGly: 5.45 ± 0.953
0.0GluHis: 0.0 ± 0.0
2.725GluIle: 2.725 ± 2.59
0.0GluLys: 0.0 ± 0.0
6.358GluLeu: 6.358 ± 2.156
0.0GluMet: 0.0 ± 0.0
6.358GluAsn: 6.358 ± 1.823
5.45GluPro: 5.45 ± 1.454
1.817GluGln: 1.817 ± 1.633
0.908GluArg: 0.908 ± 0.675
0.908GluSer: 0.908 ± 1.061
2.725GluThr: 2.725 ± 1.36
0.0GluVal: 0.0 ± 0.0
2.725GluTrp: 2.725 ± 1.486
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.908PheCys: 0.908 ± 0.817
2.725PheAsp: 2.725 ± 1.321
1.817PheGlu: 1.817 ± 0.658
1.817PhePhe: 1.817 ± 0.658
0.908PheGly: 0.908 ± 0.817
4.541PheHis: 4.541 ± 1.0
1.817PheIle: 1.817 ± 0.924
3.633PheLys: 3.633 ± 1.453
7.266PheLeu: 7.266 ± 2.176
0.908PheMet: 0.908 ± 0.675
4.541PheAsn: 4.541 ± 2.324
0.908PhePro: 0.908 ± 0.83
1.817PheGln: 1.817 ± 1.35
2.725PheArg: 2.725 ± 1.67
0.908PheSer: 0.908 ± 0.842
2.725PheThr: 2.725 ± 1.178
1.817PheVal: 1.817 ± 1.35
0.0PheTrp: 0.0 ± 0.0
1.817PheTyr: 1.817 ± 1.633
0.0PheXaa: 0.0 ± 0.0
Gly
1.817GlyAla: 1.817 ± 1.35
1.817GlyCys: 1.817 ± 1.067
4.541GlyAsp: 4.541 ± 1.68
4.541GlyGlu: 4.541 ± 2.144
1.817GlyPhe: 1.817 ± 1.123
2.725GlyGly: 2.725 ± 1.054
1.817GlyHis: 1.817 ± 0.911
4.541GlyIle: 4.541 ± 1.317
4.541GlyLys: 4.541 ± 1.623
0.908GlyLeu: 0.908 ± 0.842
0.908GlyMet: 0.908 ± 0.817
1.817GlyAsn: 1.817 ± 2.121
4.541GlyPro: 4.541 ± 1.623
2.725GlyGln: 2.725 ± 0.995
1.817GlyArg: 1.817 ± 0.924
3.633GlySer: 3.633 ± 1.325
2.725GlyThr: 2.725 ± 2.526
1.817GlyVal: 1.817 ± 1.727
0.0GlyTrp: 0.0 ± 0.0
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.817HisAla: 1.817 ± 1.083
2.725HisCys: 2.725 ± 1.803
2.725HisAsp: 2.725 ± 1.371
4.541HisGlu: 4.541 ± 1.943
2.725HisPhe: 2.725 ± 1.372
1.817HisGly: 1.817 ± 1.123
1.817HisHis: 1.817 ± 1.684
1.817HisIle: 1.817 ± 1.136
1.817HisLys: 1.817 ± 1.148
1.817HisLeu: 1.817 ± 1.35
0.0HisMet: 0.0 ± 0.0
2.725HisAsn: 2.725 ± 1.369
1.817HisPro: 1.817 ± 1.001
2.725HisGln: 2.725 ± 1.74
2.725HisArg: 2.725 ± 1.74
0.0HisSer: 0.0 ± 0.0
3.633HisThr: 3.633 ± 2.095
3.633HisVal: 3.633 ± 0.821
0.0HisTrp: 0.0 ± 0.0
1.817HisTyr: 1.817 ± 0.658
0.0HisXaa: 0.0 ± 0.0
Ile
2.725IleAla: 2.725 ± 1.703
1.817IleCys: 1.817 ± 1.35
3.633IleAsp: 3.633 ± 2.079
1.817IleGlu: 1.817 ± 1.35
1.817IlePhe: 1.817 ± 1.35
0.0IleGly: 0.0 ± 0.0
0.908IleHis: 0.908 ± 0.863
2.725IleIle: 2.725 ± 1.716
7.266IleLys: 7.266 ± 1.705
0.908IleLeu: 0.908 ± 0.817
0.908IleMet: 0.908 ± 0.742
3.633IleAsn: 3.633 ± 2.584
0.908IlePro: 0.908 ± 0.675
7.266IleGln: 7.266 ± 2.997
8.174IleArg: 8.174 ± 3.619
4.541IleSer: 4.541 ± 2.553
2.725IleThr: 2.725 ± 1.784
0.908IleVal: 0.908 ± 0.675
0.908IleTrp: 0.908 ± 0.863
3.633IleTyr: 3.633 ± 1.776
0.0IleXaa: 0.0 ± 0.0
Lys
2.725LysAla: 2.725 ± 1.787
1.817LysCys: 1.817 ± 0.924
1.817LysAsp: 1.817 ± 1.35
5.45LysGlu: 5.45 ± 2.079
1.817LysPhe: 1.817 ± 1.086
2.725LysGly: 2.725 ± 0.861
1.817LysHis: 1.817 ± 0.658
3.633LysIle: 3.633 ± 1.121
1.817LysLys: 1.817 ± 1.067
1.817LysLeu: 1.817 ± 1.065
0.0LysMet: 0.0 ± 0.0
2.725LysAsn: 2.725 ± 1.054
2.725LysPro: 2.725 ± 0.791
3.633LysGln: 3.633 ± 0.894
4.541LysArg: 4.541 ± 2.722
3.633LysSer: 3.633 ± 1.644
0.908LysThr: 0.908 ± 0.675
6.358LysVal: 6.358 ± 1.919
0.0LysTrp: 0.0 ± 0.0
3.633LysTyr: 3.633 ± 0.995
0.0LysXaa: 0.0 ± 0.0
Leu
1.817LeuAla: 1.817 ± 0.911
1.817LeuCys: 1.817 ± 1.35
5.45LeuAsp: 5.45 ± 2.375
3.633LeuGlu: 3.633 ± 1.461
2.725LeuPhe: 2.725 ± 1.486
5.45LeuGly: 5.45 ± 1.537
3.633LeuHis: 3.633 ± 2.079
3.633LeuIle: 3.633 ± 1.657
3.633LeuLys: 3.633 ± 1.121
4.541LeuLeu: 4.541 ± 2.773
0.908LeuMet: 0.908 ± 1.061
5.45LeuAsn: 5.45 ± 1.071
0.908LeuPro: 0.908 ± 0.83
6.358LeuGln: 6.358 ± 0.776
9.083LeuArg: 9.083 ± 2.931
3.633LeuSer: 3.633 ± 1.972
5.45LeuThr: 5.45 ± 2.833
3.633LeuVal: 3.633 ± 1.294
0.0LeuTrp: 0.0 ± 0.0
2.725LeuTyr: 2.725 ± 1.784
0.0LeuXaa: 0.0 ± 0.0
Met
2.725MetAla: 2.725 ± 0.995
0.908MetCys: 0.908 ± 1.061
4.541MetAsp: 4.541 ± 1.253
0.0MetGlu: 0.0 ± 0.0
2.725MetPhe: 2.725 ± 1.703
1.817MetGly: 1.817 ± 1.065
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.908MetLys: 0.908 ± 0.817
0.908MetLeu: 0.908 ± 0.83
0.0MetMet: 0.0 ± 0.0
0.908MetAsn: 0.908 ± 0.863
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
0.908MetArg: 0.908 ± 0.842
0.908MetSer: 0.908 ± 0.817
0.908MetThr: 0.908 ± 1.061
0.0MetVal: 0.0 ± 0.0
1.817MetTrp: 1.817 ± 0.911
2.725MetTyr: 2.725 ± 2.45
0.0MetXaa: 0.0 ± 0.0
Asn
3.633AsnAla: 3.633 ± 1.644
0.0AsnCys: 0.0 ± 0.0
2.725AsnAsp: 2.725 ± 1.054
1.817AsnGlu: 1.817 ± 1.083
2.725AsnPhe: 2.725 ± 1.479
2.725AsnGly: 2.725 ± 1.24
8.174AsnHis: 8.174 ± 2.763
3.633AsnIle: 3.633 ± 1.501
1.817AsnLys: 1.817 ± 1.001
3.633AsnLeu: 3.633 ± 1.453
3.633AsnMet: 3.633 ± 1.764
3.633AsnAsn: 3.633 ± 1.958
3.633AsnPro: 3.633 ± 1.077
3.633AsnGln: 3.633 ± 1.972
1.817AsnArg: 1.817 ± 1.633
2.725AsnSer: 2.725 ± 1.434
3.633AsnThr: 3.633 ± 1.698
3.633AsnVal: 3.633 ± 0.995
0.0AsnTrp: 0.0 ± 0.0
1.817AsnTyr: 1.817 ± 1.35
0.0AsnXaa: 0.0 ± 0.0
Pro
3.633ProAla: 3.633 ± 1.235
1.817ProCys: 1.817 ± 1.083
2.725ProAsp: 2.725 ± 1.228
0.908ProGlu: 0.908 ± 0.675
1.817ProPhe: 1.817 ± 0.924
2.725ProGly: 2.725 ± 0.995
3.633ProHis: 3.633 ± 1.962
4.541ProIle: 4.541 ± 1.074
4.541ProLys: 4.541 ± 1.505
4.541ProLeu: 4.541 ± 1.433
2.725ProMet: 2.725 ± 2.04
2.725ProAsn: 2.725 ± 1.372
0.908ProPro: 0.908 ± 0.675
5.45ProGln: 5.45 ± 2.849
3.633ProArg: 3.633 ± 1.806
5.45ProSer: 5.45 ± 2.661
4.541ProThr: 4.541 ± 2.575
2.725ProVal: 2.725 ± 1.321
0.908ProTrp: 0.908 ± 0.675
2.725ProTyr: 2.725 ± 1.054
0.0ProXaa: 0.0 ± 0.0
Gln
4.541GlnAla: 4.541 ± 2.242
0.908GlnCys: 0.908 ± 0.675
0.908GlnAsp: 0.908 ± 0.83
2.725GlnGlu: 2.725 ± 0.861
1.817GlnPhe: 1.817 ± 1.35
3.633GlnGly: 3.633 ± 1.377
3.633GlnHis: 3.633 ± 2.423
3.633GlnIle: 3.633 ± 1.95
0.908GlnLys: 0.908 ± 0.83
1.817GlnLeu: 1.817 ± 1.136
0.908GlnMet: 0.908 ± 0.842
2.725GlnAsn: 2.725 ± 1.253
6.358GlnPro: 6.358 ± 3.305
0.908GlnGln: 0.908 ± 0.675
3.633GlnArg: 3.633 ± 1.106
4.541GlnSer: 4.541 ± 1.639
1.817GlnThr: 1.817 ± 0.924
5.45GlnVal: 5.45 ± 1.773
0.908GlnTrp: 0.908 ± 0.675
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.817ArgAla: 1.817 ± 1.148
1.817ArgCys: 1.817 ± 1.083
6.358ArgAsp: 6.358 ± 2.534
1.817ArgGlu: 1.817 ± 1.065
5.45ArgPhe: 5.45 ± 1.546
2.725ArgGly: 2.725 ± 0.861
1.817ArgHis: 1.817 ± 1.123
5.45ArgIle: 5.45 ± 1.277
3.633ArgLys: 3.633 ± 1.78
4.541ArgLeu: 4.541 ± 2.093
1.817ArgMet: 1.817 ± 1.633
2.725ArgAsn: 2.725 ± 1.066
7.266ArgPro: 7.266 ± 1.878
0.908ArgGln: 0.908 ± 0.83
7.266ArgArg: 7.266 ± 3.516
4.541ArgSer: 4.541 ± 1.623
4.541ArgThr: 4.541 ± 1.98
5.45ArgVal: 5.45 ± 1.792
0.0ArgTrp: 0.0 ± 0.0
1.817ArgTyr: 1.817 ± 1.148
0.0ArgXaa: 0.0 ± 0.0
Ser
5.45SerAla: 5.45 ± 2.103
0.0SerCys: 0.0 ± 0.0
2.725SerAsp: 2.725 ± 0.861
2.725SerGlu: 2.725 ± 2.025
2.725SerPhe: 2.725 ± 0.861
3.633SerGly: 3.633 ± 2.449
1.817SerHis: 1.817 ± 1.201
3.633SerIle: 3.633 ± 1.077
4.541SerLys: 4.541 ± 1.591
3.633SerLeu: 3.633 ± 1.972
0.908SerMet: 0.908 ± 1.061
6.358SerAsn: 6.358 ± 1.782
7.266SerPro: 7.266 ± 1.414
4.541SerGln: 4.541 ± 2.323
1.817SerArg: 1.817 ± 0.924
9.083SerSer: 9.083 ± 3.162
5.45SerThr: 5.45 ± 3.266
2.725SerVal: 2.725 ± 2.49
0.908SerTrp: 0.908 ± 0.817
2.725SerTyr: 2.725 ± 1.486
0.0SerXaa: 0.0 ± 0.0
Thr
3.633ThrAla: 3.633 ± 0.946
0.0ThrCys: 0.0 ± 0.0
0.0ThrAsp: 0.0 ± 0.0
2.725ThrGlu: 2.725 ± 1.264
1.817ThrPhe: 1.817 ± 1.065
4.541ThrGly: 4.541 ± 1.593
3.633ThrHis: 3.633 ± 2.133
3.633ThrIle: 3.633 ± 1.499
1.817ThrLys: 1.817 ± 1.35
6.358ThrLeu: 6.358 ± 1.53
0.908ThrMet: 0.908 ± 0.675
4.541ThrAsn: 4.541 ± 2.892
4.541ThrPro: 4.541 ± 1.893
0.908ThrGln: 0.908 ± 1.061
3.633ThrArg: 3.633 ± 1.235
6.358ThrSer: 6.358 ± 2.842
0.908ThrThr: 0.908 ± 0.863
2.725ThrVal: 2.725 ± 1.716
0.908ThrTrp: 0.908 ± 1.061
2.725ThrTyr: 2.725 ± 0.791
0.0ThrXaa: 0.0 ± 0.0
Val
0.908ValAla: 0.908 ± 0.817
0.0ValCys: 0.0 ± 0.0
2.725ValAsp: 2.725 ± 1.178
1.817ValGlu: 1.817 ± 1.66
2.725ValPhe: 2.725 ± 1.784
1.817ValGly: 1.817 ± 1.633
0.908ValHis: 0.908 ± 0.83
5.45ValIle: 5.45 ± 2.124
4.541ValLys: 4.541 ± 2.054
6.358ValLeu: 6.358 ± 3.053
2.725ValMet: 2.725 ± 0.791
1.817ValAsn: 1.817 ± 0.924
4.541ValPro: 4.541 ± 1.3
3.633ValGln: 3.633 ± 1.805
1.817ValArg: 1.817 ± 1.633
6.358ValSer: 6.358 ± 1.993
2.725ValThr: 2.725 ± 1.321
1.817ValVal: 1.817 ± 1.633
1.817ValTrp: 1.817 ± 1.086
3.633ValTyr: 3.633 ± 1.347
0.0ValXaa: 0.0 ± 0.0
Trp
1.817TrpAla: 1.817 ± 1.35
0.0TrpCys: 0.0 ± 0.0
0.908TrpAsp: 0.908 ± 0.83
0.908TrpGlu: 0.908 ± 0.863
0.0TrpPhe: 0.0 ± 0.0
0.908TrpGly: 0.908 ± 0.675
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.908TrpMet: 0.908 ± 0.817
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.908TrpGln: 0.908 ± 0.675
1.817TrpArg: 1.817 ± 1.001
0.908TrpSer: 0.908 ± 0.842
1.817TrpThr: 1.817 ± 1.086
0.908TrpVal: 0.908 ± 0.675
0.0TrpTrp: 0.0 ± 0.0
1.817TrpTyr: 1.817 ± 1.065
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.817TyrAla: 1.817 ± 0.658
0.0TyrCys: 0.0 ± 0.0
1.817TyrAsp: 1.817 ± 1.086
1.817TyrGlu: 1.817 ± 1.083
2.725TyrPhe: 2.725 ± 0.773
0.908TyrGly: 0.908 ± 0.675
0.0TyrHis: 0.0 ± 0.0
1.817TyrIle: 1.817 ± 0.658
0.908TyrLys: 0.908 ± 0.675
5.45TyrLeu: 5.45 ± 1.926
1.817TyrMet: 1.817 ± 1.038
2.725TyrAsn: 2.725 ± 0.791
1.817TyrPro: 1.817 ± 1.065
1.817TyrGln: 1.817 ± 0.911
4.541TyrArg: 4.541 ± 4.083
2.725TyrSer: 2.725 ± 1.054
0.908TyrThr: 0.908 ± 0.817
1.817TyrVal: 1.817 ± 0.911
0.0TyrTrp: 0.0 ± 0.0
0.908TyrTyr: 0.908 ± 0.842
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1102 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski