Amino acid dipepetide frequency for Sugarcane bacilliform Guadeloupe A virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.861AlaAla: 2.861 ± 1.381
0.954AlaCys: 0.954 ± 0.46
3.338AlaAsp: 3.338 ± 1.195
3.815AlaGlu: 3.815 ± 1.841
1.431AlaPhe: 1.431 ± 0.69
2.861AlaGly: 2.861 ± 1.381
1.907AlaHis: 1.907 ± 3.022
6.676AlaIle: 6.676 ± 2.391
2.384AlaLys: 2.384 ± 1.038
4.769AlaLeu: 4.769 ± 2.513
1.431AlaMet: 1.431 ± 0.69
0.954AlaAsn: 0.954 ± 0.46
2.384AlaPro: 2.384 ± 1.151
3.338AlaGln: 3.338 ± 0.897
3.815AlaArg: 3.815 ± 1.841
3.815AlaSer: 3.815 ± 2.344
2.384AlaThr: 2.384 ± 2.848
2.384AlaVal: 2.384 ± 1.151
0.477AlaTrp: 0.477 ± 0.23
2.384AlaTyr: 2.384 ± 1.151
0.0AlaXaa: 0.0 ± 0.0
Cys
0.477CysAla: 0.477 ± 0.23
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.907CysGlu: 1.907 ± 0.92
0.954CysPhe: 0.954 ± 0.46
1.907CysGly: 1.907 ± 0.92
0.0CysHis: 0.0 ± 0.0
0.477CysIle: 0.477 ± 1.644
1.907CysLys: 1.907 ± 0.92
0.954CysLeu: 0.954 ± 1.511
0.477CysMet: 0.477 ± 0.23
0.477CysAsn: 0.477 ± 0.23
0.0CysPro: 0.0 ± 0.0
0.477CysGln: 0.477 ± 0.23
2.861CysArg: 2.861 ± 1.381
0.477CysSer: 0.477 ± 0.23
0.477CysThr: 0.477 ± 0.23
0.477CysVal: 0.477 ± 0.23
0.0CysTrp: 0.0 ± 0.0
0.477CysTyr: 0.477 ± 0.23
0.0CysXaa: 0.0 ± 0.0
Asp
1.907AspAla: 1.907 ± 0.92
0.477AspCys: 0.477 ± 0.23
3.338AspAsp: 3.338 ± 1.611
5.246AspGlu: 5.246 ± 2.531
2.384AspPhe: 2.384 ± 1.151
3.338AspGly: 3.338 ± 1.611
0.477AspHis: 0.477 ± 0.23
3.815AspIle: 3.815 ± 1.251
1.907AspLys: 1.907 ± 1.261
3.815AspLeu: 3.815 ± 6.044
0.954AspMet: 0.954 ± 0.46
2.384AspAsn: 2.384 ± 1.151
1.907AspPro: 1.907 ± 1.261
2.384AspGln: 2.384 ± 1.038
1.431AspArg: 1.431 ± 0.69
4.769AspSer: 4.769 ± 2.392
3.338AspThr: 3.338 ± 1.195
1.907AspVal: 1.907 ± 1.172
2.384AspTrp: 2.384 ± 1.196
1.907AspTyr: 1.907 ± 0.92
0.0AspXaa: 0.0 ± 0.0
Glu
6.199GluAla: 6.199 ± 1.826
1.431GluCys: 1.431 ± 1.332
5.246GluAsp: 5.246 ± 1.236
15.737GluGlu: 15.737 ± 3.685
3.815GluPhe: 3.815 ± 0.909
1.431GluGly: 1.431 ± 1.362
1.907GluHis: 1.907 ± 0.92
5.246GluIle: 5.246 ± 1.642
7.153GluLys: 7.153 ± 1.987
10.491GluLeu: 10.491 ± 7.338
1.907GluMet: 1.907 ± 0.92
2.861GluAsn: 2.861 ± 0.943
2.861GluPro: 2.861 ± 1.381
2.861GluGln: 2.861 ± 4.533
5.722GluArg: 5.722 ± 1.405
4.292GluSer: 4.292 ± 1.028
3.338GluThr: 3.338 ± 1.611
8.584GluVal: 8.584 ± 1.117
1.431GluTrp: 1.431 ± 0.69
3.338GluTyr: 3.338 ± 0.897
0.0GluXaa: 0.0 ± 0.0
Phe
0.954PheAla: 0.954 ± 0.46
0.954PheCys: 0.954 ± 0.46
1.907PheAsp: 1.907 ± 0.92
3.815PheGlu: 3.815 ± 2.344
0.954PhePhe: 0.954 ± 0.46
0.954PheGly: 0.954 ± 0.46
1.431PheHis: 1.431 ± 0.69
2.861PheIle: 2.861 ± 0.943
1.907PheLys: 1.907 ± 0.92
2.861PheLeu: 2.861 ± 1.702
0.477PheMet: 0.477 ± 0.23
0.954PheAsn: 0.954 ± 0.46
1.431PhePro: 1.431 ± 0.69
2.384PheGln: 2.384 ± 2.848
1.431PheArg: 1.431 ± 0.69
0.477PheSer: 0.477 ± 0.23
3.338PheThr: 3.338 ± 1.611
0.954PheVal: 0.954 ± 0.46
0.477PheTrp: 0.477 ± 0.23
2.384PheTyr: 2.384 ± 1.151
0.0PheXaa: 0.0 ± 0.0
Gly
1.907GlyAla: 1.907 ± 0.92
0.954GlyCys: 0.954 ± 0.46
3.338GlyAsp: 3.338 ± 1.195
5.722GlyGlu: 5.722 ± 2.761
2.861GlyPhe: 2.861 ± 1.381
0.954GlyGly: 0.954 ± 0.46
0.954GlyHis: 0.954 ± 0.46
2.384GlyIle: 2.384 ± 1.196
6.676GlyLys: 6.676 ± 1.897
4.292GlyLeu: 4.292 ± 2.071
1.431GlyMet: 1.431 ± 0.811
1.907GlyAsn: 1.907 ± 0.92
0.477GlyPro: 0.477 ± 0.23
0.477GlyGln: 0.477 ± 0.23
1.431GlyArg: 1.431 ± 0.69
2.384GlySer: 2.384 ± 4.622
5.246GlyThr: 5.246 ± 1.642
2.861GlyVal: 2.861 ± 1.381
0.954GlyTrp: 0.954 ± 0.46
0.954GlyTyr: 0.954 ± 0.46
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.907HisGlu: 1.907 ± 1.172
0.954HisPhe: 0.954 ± 0.46
0.477HisGly: 0.477 ± 0.23
0.477HisHis: 0.477 ± 0.23
0.954HisIle: 0.954 ± 0.46
1.431HisLys: 1.431 ± 0.69
2.384HisLeu: 2.384 ± 1.151
0.954HisMet: 0.954 ± 0.46
0.477HisAsn: 0.477 ± 1.702
0.477HisPro: 0.477 ± 0.23
1.907HisGln: 1.907 ± 0.92
1.907HisArg: 1.907 ± 0.92
0.0HisSer: 0.0 ± 0.0
0.954HisThr: 0.954 ± 1.511
3.815HisVal: 3.815 ± 1.841
0.477HisTrp: 0.477 ± 0.23
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.815IleAla: 3.815 ± 1.841
1.431IleCys: 1.431 ± 0.69
5.246IleAsp: 5.246 ± 1.642
5.246IleGlu: 5.246 ± 0.598
1.907IlePhe: 1.907 ± 1.261
3.338IleGly: 3.338 ± 1.611
0.954IleHis: 0.954 ± 1.511
7.153IleIle: 7.153 ± 1.791
3.815IleLys: 3.815 ± 1.26
3.815IleLeu: 3.815 ± 1.26
3.338IleMet: 3.338 ± 1.611
3.338IleAsn: 3.338 ± 1.611
1.907IlePro: 1.907 ± 0.92
4.292IleGln: 4.292 ± 4.086
4.769IleArg: 4.769 ± 1.09
4.292IleSer: 4.292 ± 4.087
3.815IleThr: 3.815 ± 1.251
4.769IleVal: 4.769 ± 0.809
0.0IleTrp: 0.0 ± 0.0
3.815IleTyr: 3.815 ± 1.841
0.0IleXaa: 0.0 ± 0.0
Lys
2.384LysAla: 2.384 ± 1.151
1.431LysCys: 1.431 ± 0.69
4.769LysAsp: 4.769 ± 2.587
10.491LysGlu: 10.491 ± 2.683
4.292LysPhe: 4.292 ± 2.202
3.338LysGly: 3.338 ± 1.195
4.769LysHis: 4.769 ± 2.301
7.63LysIle: 7.63 ± 0.682
6.676LysLys: 6.676 ± 0.322
4.292LysLeu: 4.292 ± 1.028
2.861LysMet: 2.861 ± 0.898
3.338LysAsn: 3.338 ± 0.897
1.907LysPro: 1.907 ± 0.92
2.384LysGln: 2.384 ± 1.038
4.292LysArg: 4.292 ± 3.997
5.722LysSer: 5.722 ± 2.347
4.292LysThr: 4.292 ± 2.447
5.246LysVal: 5.246 ± 3.632
0.954LysTrp: 0.954 ± 0.46
1.431LysTyr: 1.431 ± 0.69
0.0LysXaa: 0.0 ± 0.0
Leu
5.246LeuAla: 5.246 ± 3.632
0.954LeuCys: 0.954 ± 1.492
4.292LeuAsp: 4.292 ± 2.202
7.153LeuGlu: 7.153 ± 1.987
0.0LeuPhe: 0.0 ± 0.0
4.292LeuGly: 4.292 ± 2.071
0.477LeuHis: 0.477 ± 0.23
3.338LeuIle: 3.338 ± 1.611
9.537LeuLys: 9.537 ± 4.654
2.861LeuLeu: 2.861 ± 2.665
1.431LeuMet: 1.431 ± 1.332
5.246LeuAsn: 5.246 ± 2.291
2.861LeuPro: 2.861 ± 1.381
4.769LeuGln: 4.769 ± 7.555
3.338LeuArg: 3.338 ± 1.476
6.199LeuSer: 6.199 ± 1.826
4.292LeuThr: 4.292 ± 7.507
6.676LeuVal: 6.676 ± 1.784
0.954LeuTrp: 0.954 ± 1.511
2.384LeuTyr: 2.384 ± 1.151
0.0LeuXaa: 0.0 ± 0.0
Met
2.384MetAla: 2.384 ± 1.151
0.477MetCys: 0.477 ± 0.23
2.861MetAsp: 2.861 ± 1.381
1.907MetGlu: 1.907 ± 0.92
0.0MetPhe: 0.0 ± 0.0
0.954MetGly: 0.954 ± 1.511
0.477MetHis: 0.477 ± 0.23
2.384MetIle: 2.384 ± 1.151
3.815MetLys: 3.815 ± 1.841
1.431MetLeu: 1.431 ± 0.69
0.954MetMet: 0.954 ± 0.46
2.861MetAsn: 2.861 ± 1.381
1.431MetPro: 1.431 ± 0.69
0.477MetGln: 0.477 ± 0.23
0.954MetArg: 0.954 ± 0.46
0.477MetSer: 0.477 ± 1.644
2.384MetThr: 2.384 ± 1.151
1.431MetVal: 1.431 ± 1.332
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.338AsnAla: 3.338 ± 1.611
0.477AsnCys: 0.477 ± 0.23
2.861AsnAsp: 2.861 ± 1.381
1.907AsnGlu: 1.907 ± 0.92
0.477AsnPhe: 0.477 ± 0.23
2.861AsnGly: 2.861 ± 1.381
0.954AsnHis: 0.954 ± 0.46
3.338AsnIle: 3.338 ± 1.195
2.384AsnLys: 2.384 ± 2.84
5.246AsnLeu: 5.246 ± 4.168
0.954AsnMet: 0.954 ± 0.46
0.954AsnAsn: 0.954 ± 0.46
1.907AsnPro: 1.907 ± 0.92
0.954AsnGln: 0.954 ± 0.46
3.338AsnArg: 3.338 ± 1.476
3.815AsnSer: 3.815 ± 1.251
5.246AsnThr: 5.246 ± 4.212
2.384AsnVal: 2.384 ± 1.151
0.0AsnTrp: 0.0 ± 0.0
2.861AsnTyr: 2.861 ± 1.381
0.0AsnXaa: 0.0 ± 0.0
Pro
2.861ProAla: 2.861 ± 1.381
0.0ProCys: 0.0 ± 0.0
1.431ProAsp: 1.431 ± 0.69
2.384ProGlu: 2.384 ± 1.151
0.954ProPhe: 0.954 ± 0.46
1.431ProGly: 1.431 ± 1.362
0.954ProHis: 0.954 ± 0.46
1.431ProIle: 1.431 ± 0.69
1.907ProLys: 1.907 ± 2.158
2.861ProLeu: 2.861 ± 0.943
0.954ProMet: 0.954 ± 0.46
2.861ProAsn: 2.861 ± 1.381
2.384ProPro: 2.384 ± 1.151
0.954ProGln: 0.954 ± 0.46
3.815ProArg: 3.815 ± 1.841
2.861ProSer: 2.861 ± 1.381
1.907ProThr: 1.907 ± 0.92
1.907ProVal: 1.907 ± 0.92
0.954ProTrp: 0.954 ± 0.46
0.477ProTyr: 0.477 ± 1.644
0.0ProXaa: 0.0 ± 0.0
Gln
5.246GlnAla: 5.246 ± 2.291
0.0GlnCys: 0.0 ± 0.0
1.907GlnAsp: 1.907 ± 1.261
3.338GlnGlu: 3.338 ± 5.037
0.954GlnPhe: 0.954 ± 0.46
2.861GlnGly: 2.861 ± 1.381
1.907GlnHis: 1.907 ± 0.92
3.338GlnIle: 3.338 ± 1.476
5.722GlnLys: 5.722 ± 3.954
3.338GlnLeu: 3.338 ± 4.965
0.0GlnMet: 0.0 ± 0.0
2.384GlnAsn: 2.384 ± 3.64
3.338GlnPro: 3.338 ± 1.476
3.338GlnGln: 3.338 ± 4.338
3.338GlnArg: 3.338 ± 4.349
1.907GlnSer: 1.907 ± 0.92
1.431GlnThr: 1.431 ± 1.332
2.861GlnVal: 2.861 ± 1.381
0.477GlnTrp: 0.477 ± 0.23
1.431GlnTyr: 1.431 ± 0.69
0.0GlnXaa: 0.0 ± 0.0
Arg
1.907ArgAla: 1.907 ± 1.172
0.954ArgCys: 0.954 ± 0.46
1.907ArgAsp: 1.907 ± 1.172
2.861ArgGlu: 2.861 ± 0.943
0.477ArgPhe: 0.477 ± 0.23
1.907ArgGly: 1.907 ± 0.92
0.0ArgHis: 0.0 ± 0.0
5.722ArgIle: 5.722 ± 3.516
4.292ArgLys: 4.292 ± 0.977
5.246ArgLeu: 5.246 ± 1.97
2.861ArgMet: 2.861 ± 1.381
3.815ArgAsn: 3.815 ± 1.26
2.861ArgPro: 2.861 ± 0.943
4.292ArgGln: 4.292 ± 1.028
2.861ArgArg: 2.861 ± 0.943
5.246ArgSer: 5.246 ± 1.97
3.815ArgThr: 3.815 ± 1.841
5.246ArgVal: 5.246 ± 1.642
1.907ArgTrp: 1.907 ± 0.92
2.384ArgTyr: 2.384 ± 1.151
0.0ArgXaa: 0.0 ± 0.0
Ser
2.861SerAla: 2.861 ± 1.173
1.907SerCys: 1.907 ± 0.92
2.384SerAsp: 2.384 ± 1.93
7.153SerGlu: 7.153 ± 5.228
3.815SerPhe: 3.815 ± 1.26
3.338SerGly: 3.338 ± 4.338
0.477SerHis: 0.477 ± 0.23
3.815SerIle: 3.815 ± 1.26
5.246SerLys: 5.246 ± 0.598
3.338SerLeu: 3.338 ± 0.897
0.954SerMet: 0.954 ± 0.46
3.338SerAsn: 3.338 ± 3.182
2.384SerPro: 2.384 ± 1.196
5.722SerGln: 5.722 ± 3.516
4.769SerArg: 4.769 ± 2.301
4.292SerSer: 4.292 ± 2.781
3.815SerThr: 3.815 ± 2.522
2.861SerVal: 2.861 ± 2.724
0.477SerTrp: 0.477 ± 0.23
1.431SerTyr: 1.431 ± 0.69
0.0SerXaa: 0.0 ± 0.0
Thr
2.861ThrAla: 2.861 ± 1.173
0.477ThrCys: 0.477 ± 0.23
2.384ThrAsp: 2.384 ± 1.151
5.722ThrGlu: 5.722 ± 1.885
1.431ThrPhe: 1.431 ± 1.332
5.722ThrGly: 5.722 ± 2.221
0.0ThrHis: 0.0 ± 0.0
3.338ThrIle: 3.338 ± 1.195
3.815ThrLys: 3.815 ± 1.251
4.292ThrLeu: 4.292 ± 1.361
1.907ThrMet: 1.907 ± 0.92
1.907ThrAsn: 1.907 ± 1.261
1.907ThrPro: 1.907 ± 0.92
2.861ThrGln: 2.861 ± 3.388
3.815ThrArg: 3.815 ± 1.26
6.199ThrSer: 6.199 ± 3.705
5.722ThrThr: 5.722 ± 0.409
4.769ThrVal: 4.769 ± 1.491
0.954ThrTrp: 0.954 ± 1.511
1.431ThrTyr: 1.431 ± 1.332
0.0ThrXaa: 0.0 ± 0.0
Val
2.861ValAla: 2.861 ± 1.173
1.907ValCys: 1.907 ± 0.92
1.431ValAsp: 1.431 ± 0.69
5.722ValGlu: 5.722 ± 1.885
3.815ValPhe: 3.815 ± 1.26
4.769ValGly: 4.769 ± 2.301
1.431ValHis: 1.431 ± 0.69
2.861ValIle: 2.861 ± 1.173
6.676ValLys: 6.676 ± 2.182
5.246ValLeu: 5.246 ± 0.598
0.954ValMet: 0.954 ± 0.629
3.815ValAsn: 3.815 ± 1.841
1.907ValPro: 1.907 ± 0.92
3.338ValGln: 3.338 ± 1.476
2.861ValArg: 2.861 ± 2.665
3.815ValSer: 3.815 ± 1.251
4.292ValThr: 4.292 ± 2.071
3.338ValVal: 3.338 ± 0.897
0.954ValTrp: 0.954 ± 0.46
1.431ValTyr: 1.431 ± 1.362
0.0ValXaa: 0.0 ± 0.0
Trp
1.907TrpAla: 1.907 ± 1.172
0.0TrpCys: 0.0 ± 0.0
0.954TrpAsp: 0.954 ± 1.511
1.431TrpGlu: 1.431 ± 1.362
0.477TrpPhe: 0.477 ± 0.23
0.477TrpGly: 0.477 ± 0.23
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.907TrpLys: 1.907 ± 0.92
1.431TrpLeu: 1.431 ± 0.69
0.477TrpMet: 0.477 ± 0.23
0.477TrpAsn: 0.477 ± 0.23
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.954TrpArg: 0.954 ± 0.46
0.954TrpSer: 0.954 ± 0.46
0.954TrpThr: 0.954 ± 0.46
0.954TrpVal: 0.954 ± 0.46
0.0TrpTrp: 0.0 ± 0.0
0.477TrpTyr: 0.477 ± 0.23
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.384TyrAla: 2.384 ± 1.151
0.0TyrCys: 0.0 ± 0.0
0.477TyrAsp: 0.477 ± 0.23
2.384TyrGlu: 2.384 ± 1.151
0.954TyrPhe: 0.954 ± 0.46
0.954TyrGly: 0.954 ± 0.46
0.0TyrHis: 0.0 ± 0.0
4.292TyrIle: 4.292 ± 2.071
3.338TyrLys: 3.338 ± 1.611
3.338TyrLeu: 3.338 ± 0.897
1.907TyrMet: 1.907 ± 0.92
1.907TyrAsn: 1.907 ± 1.261
0.954TyrPro: 0.954 ± 0.46
1.907TyrGln: 1.907 ± 1.261
2.861TyrArg: 2.861 ± 0.943
2.384TyrSer: 2.384 ± 1.151
0.477TyrThr: 0.477 ± 0.23
0.477TyrVal: 0.477 ± 0.23
0.0TyrTrp: 0.0 ± 0.0
1.431TyrTyr: 1.431 ± 0.69
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2098 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski