Amino acid dipepetide frequency for Banana streak GF virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.249AlaAla: 4.249 ± 1.178
0.944AlaCys: 0.944 ± 0.426
2.361AlaAsp: 2.361 ± 0.819
3.777AlaGlu: 3.777 ± 1.704
2.361AlaPhe: 2.361 ± 1.065
0.944AlaGly: 0.944 ± 0.426
1.416AlaHis: 1.416 ± 1.061
6.138AlaIle: 6.138 ± 2.77
0.944AlaLys: 0.944 ± 1.221
7.082AlaLeu: 7.082 ± 3.037
2.833AlaMet: 2.833 ± 1.278
2.361AlaAsn: 2.361 ± 1.065
0.944AlaPro: 0.944 ± 0.426
1.416AlaGln: 1.416 ± 0.639
3.777AlaArg: 3.777 ± 1.704
2.833AlaSer: 2.833 ± 1.317
2.361AlaThr: 2.361 ± 1.426
3.305AlaVal: 3.305 ± 1.491
0.0AlaTrp: 0.0 ± 0.0
3.305AlaTyr: 3.305 ± 3.974
0.0AlaXaa: 0.0 ± 0.0
Cys
1.889CysAla: 1.889 ± 0.852
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.472CysGlu: 0.472 ± 0.213
1.416CysPhe: 1.416 ± 0.639
2.361CysGly: 2.361 ± 1.065
0.472CysHis: 0.472 ± 0.213
0.472CysIle: 0.472 ± 0.213
1.889CysLys: 1.889 ± 0.852
0.944CysLeu: 0.944 ± 1.221
0.472CysMet: 0.472 ± 0.213
0.944CysAsn: 0.944 ± 0.426
0.0CysPro: 0.0 ± 0.0
1.889CysGln: 1.889 ± 0.852
0.472CysArg: 0.472 ± 0.213
0.944CysSer: 0.944 ± 0.426
0.944CysThr: 0.944 ± 0.426
0.472CysVal: 0.472 ± 0.213
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.889AspAla: 1.889 ± 0.852
1.416AspCys: 1.416 ± 0.639
6.138AspAsp: 6.138 ± 2.77
4.721AspGlu: 4.721 ± 2.131
1.889AspPhe: 1.889 ± 0.852
2.833AspGly: 2.833 ± 1.278
0.472AspHis: 0.472 ± 1.395
3.777AspIle: 3.777 ± 1.188
4.249AspLys: 4.249 ± 1.406
8.499AspLeu: 8.499 ± 7.185
1.889AspMet: 1.889 ± 0.799
3.305AspAsn: 3.305 ± 1.236
1.416AspPro: 1.416 ± 0.639
2.361AspGln: 2.361 ± 1.065
2.361AspArg: 2.361 ± 1.426
3.777AspSer: 3.777 ± 1.704
1.416AspThr: 1.416 ± 0.639
1.889AspVal: 1.889 ± 0.924
0.472AspTrp: 0.472 ± 0.213
4.249AspTyr: 4.249 ± 1.732
0.0AspXaa: 0.0 ± 0.0
Glu
5.194GluAla: 5.194 ± 3.212
0.472GluCys: 0.472 ± 0.213
7.082GluAsp: 7.082 ± 0.429
12.276GluGlu: 12.276 ± 2.627
2.833GluPhe: 2.833 ± 0.76
2.833GluGly: 2.833 ± 1.278
1.889GluHis: 1.889 ± 0.852
7.082GluIle: 7.082 ± 1.487
7.082GluLys: 7.082 ± 0.429
9.443GluLeu: 9.443 ± 2.076
0.944GluMet: 0.944 ± 0.426
4.249GluAsn: 4.249 ± 0.92
2.833GluPro: 2.833 ± 0.76
3.777GluGln: 3.777 ± 4.886
2.833GluArg: 2.833 ± 1.278
7.082GluSer: 7.082 ± 0.429
2.833GluThr: 2.833 ± 1.278
5.666GluVal: 5.666 ± 1.362
1.889GluTrp: 1.889 ± 0.852
2.361GluTyr: 2.361 ± 1.065
0.0GluXaa: 0.0 ± 0.0
Phe
1.889PheAla: 1.889 ± 0.924
0.944PheCys: 0.944 ± 0.426
2.361PheAsp: 2.361 ± 1.426
2.361PheGlu: 2.361 ± 0.819
0.944PhePhe: 0.944 ± 0.426
0.944PheGly: 0.944 ± 0.426
1.889PheHis: 1.889 ± 0.852
3.777PheIle: 3.777 ± 0.816
1.889PheLys: 1.889 ± 1.556
1.889PheLeu: 1.889 ± 0.924
0.944PheMet: 0.944 ± 0.426
0.944PheAsn: 0.944 ± 0.426
0.472PhePro: 0.472 ± 0.213
2.833PheGln: 2.833 ± 1.278
1.889PheArg: 1.889 ± 0.852
0.472PheSer: 0.472 ± 0.213
1.889PheThr: 1.889 ± 1.556
0.472PheVal: 0.472 ± 0.213
0.472PheTrp: 0.472 ± 0.213
1.889PheTyr: 1.889 ± 0.852
0.0PheXaa: 0.0 ± 0.0
Gly
3.305GlyAla: 3.305 ± 1.236
0.944GlyCys: 0.944 ± 0.426
2.361GlyAsp: 2.361 ± 1.065
4.721GlyGlu: 4.721 ± 2.131
2.361GlyPhe: 2.361 ± 1.426
1.416GlyGly: 1.416 ± 0.639
0.472GlyHis: 0.472 ± 0.213
3.305GlyIle: 3.305 ± 1.236
5.194GlyLys: 5.194 ± 1.008
3.305GlyLeu: 3.305 ± 1.491
1.416GlyMet: 1.416 ± 0.676
2.361GlyAsn: 2.361 ± 0.819
0.472GlyPro: 0.472 ± 0.213
0.944GlyGln: 0.944 ± 0.426
2.833GlyArg: 2.833 ± 1.278
2.361GlySer: 2.361 ± 1.065
3.777GlyThr: 3.777 ± 1.704
2.361GlyVal: 2.361 ± 1.065
2.361GlyTrp: 2.361 ± 1.426
0.944GlyTyr: 0.944 ± 0.426
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.416HisCys: 1.416 ± 0.639
0.944HisAsp: 0.944 ± 0.426
0.944HisGlu: 0.944 ± 0.426
0.0HisPhe: 0.0 ± 0.0
0.472HisGly: 0.472 ± 0.213
0.944HisHis: 0.944 ± 1.221
2.361HisIle: 2.361 ± 1.065
1.416HisLys: 1.416 ± 0.639
6.138HisLeu: 6.138 ± 1.905
0.0HisMet: 0.0 ± 0.0
0.472HisAsn: 0.472 ± 1.395
0.944HisPro: 0.944 ± 0.426
2.361HisGln: 2.361 ± 1.065
2.833HisArg: 2.833 ± 1.278
0.944HisSer: 0.944 ± 0.426
1.416HisThr: 1.416 ± 1.061
1.416HisVal: 1.416 ± 0.639
0.472HisTrp: 0.472 ± 0.213
1.416HisTyr: 1.416 ± 0.639
0.0HisXaa: 0.0 ± 0.0
Ile
2.361IleAla: 2.361 ± 1.065
2.833IleCys: 2.833 ± 1.278
5.666IleAsp: 5.666 ± 2.557
4.721IleGlu: 4.721 ± 1.205
2.361IlePhe: 2.361 ± 1.065
5.194IleGly: 5.194 ± 1.008
1.889IleHis: 1.889 ± 0.924
4.721IleIle: 4.721 ± 2.131
6.138IleLys: 6.138 ± 0.647
8.499IleLeu: 8.499 ± 0.875
1.416IleMet: 1.416 ± 0.639
3.305IleAsn: 3.305 ± 1.491
3.777IlePro: 3.777 ± 1.704
2.833IleGln: 2.833 ± 1.317
1.889IleArg: 1.889 ± 0.852
5.194IleSer: 5.194 ± 1.008
4.721IleThr: 4.721 ± 1.205
2.361IleVal: 2.361 ± 0.819
0.0IleTrp: 0.0 ± 0.0
3.777IleTyr: 3.777 ± 1.704
0.0IleXaa: 0.0 ± 0.0
Lys
4.721LysAla: 4.721 ± 1.057
2.361LysCys: 2.361 ± 1.065
3.777LysAsp: 3.777 ± 0.816
8.499LysGlu: 8.499 ± 7.176
3.777LysPhe: 3.777 ± 0.816
4.721LysGly: 4.721 ± 2.851
2.833LysHis: 2.833 ± 1.278
5.666LysIle: 5.666 ± 2.115
3.777LysLys: 3.777 ± 0.816
5.194LysLeu: 5.194 ± 3.845
1.416LysMet: 1.416 ± 0.639
3.305LysAsn: 3.305 ± 1.236
1.889LysPro: 1.889 ± 0.852
3.777LysGln: 3.777 ± 3.338
2.833LysArg: 2.833 ± 0.76
1.889LysSer: 1.889 ± 0.852
2.833LysThr: 2.833 ± 2.025
8.026LysVal: 8.026 ± 5.177
0.472LysTrp: 0.472 ± 0.213
1.889LysTyr: 1.889 ± 0.852
0.0LysXaa: 0.0 ± 0.0
Leu
6.138LeuAla: 6.138 ± 1.505
0.944LeuCys: 0.944 ± 0.426
5.666LeuAsp: 5.666 ± 2.115
6.61LeuGlu: 6.61 ± 1.763
1.416LeuPhe: 1.416 ± 1.061
5.666LeuGly: 5.666 ± 1.521
1.889LeuHis: 1.889 ± 0.924
6.138LeuIle: 6.138 ± 2.77
6.61LeuLys: 6.61 ± 3.237
8.026LeuLeu: 8.026 ± 1.075
0.944LeuMet: 0.944 ± 0.426
5.666LeuAsn: 5.666 ± 0.82
4.721LeuPro: 4.721 ± 2.131
8.499LeuGln: 8.499 ± 6.988
9.443LeuArg: 9.443 ± 6.921
8.499LeuSer: 8.499 ± 6.988
2.833LeuThr: 2.833 ± 2.025
5.666LeuVal: 5.666 ± 1.39
0.472LeuTrp: 0.472 ± 0.213
1.889LeuTyr: 1.889 ± 1.556
0.0LeuXaa: 0.0 ± 0.0
Met
0.472MetAla: 0.472 ± 0.213
0.472MetCys: 0.472 ± 0.213
2.361MetAsp: 2.361 ± 1.065
1.416MetGlu: 1.416 ± 0.639
0.0MetPhe: 0.0 ± 0.0
0.472MetGly: 0.472 ± 0.213
0.472MetHis: 0.472 ± 0.213
0.472MetIle: 0.472 ± 0.213
3.305MetLys: 3.305 ± 1.491
1.416MetLeu: 1.416 ± 0.639
0.944MetMet: 0.944 ± 0.426
2.361MetAsn: 2.361 ± 2.233
1.889MetPro: 1.889 ± 0.852
0.472MetGln: 0.472 ± 0.213
0.0MetArg: 0.0 ± 0.0
0.472MetSer: 0.472 ± 0.213
4.249MetThr: 4.249 ± 1.917
0.944MetVal: 0.944 ± 0.426
0.472MetTrp: 0.472 ± 0.213
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.889AsnAla: 1.889 ± 0.852
0.944AsnCys: 0.944 ± 1.221
1.416AsnAsp: 1.416 ± 1.061
3.777AsnGlu: 3.777 ± 1.704
1.889AsnPhe: 1.889 ± 0.852
3.305AsnGly: 3.305 ± 1.236
1.416AsnHis: 1.416 ± 0.639
1.889AsnIle: 1.889 ± 0.852
2.833AsnLys: 2.833 ± 0.76
6.138AsnLeu: 6.138 ± 1.905
0.472AsnMet: 0.472 ± 0.213
1.416AsnAsn: 1.416 ± 0.639
3.777AsnPro: 3.777 ± 1.188
2.361AsnGln: 2.361 ± 0.819
0.472AsnArg: 0.472 ± 0.213
3.777AsnSer: 3.777 ± 2.96
4.721AsnThr: 4.721 ± 1.205
0.944AsnVal: 0.944 ± 0.426
0.472AsnTrp: 0.472 ± 1.395
2.361AsnTyr: 2.361 ± 1.065
0.0AsnXaa: 0.0 ± 0.0
Pro
2.361ProAla: 2.361 ± 1.065
0.0ProCys: 0.0 ± 0.0
2.361ProAsp: 2.361 ± 3.563
5.666ProGlu: 5.666 ± 2.557
0.944ProPhe: 0.944 ± 1.863
2.361ProGly: 2.361 ± 1.065
1.416ProHis: 1.416 ± 0.639
2.833ProIle: 2.833 ± 0.76
2.361ProLys: 2.361 ± 0.819
3.305ProLeu: 3.305 ± 0.759
0.0ProMet: 0.0 ± 0.0
0.472ProAsn: 0.472 ± 0.213
1.889ProPro: 1.889 ± 0.852
1.416ProGln: 1.416 ± 0.639
3.305ProArg: 3.305 ± 1.491
4.721ProSer: 4.721 ± 2.131
1.416ProThr: 1.416 ± 0.639
1.416ProVal: 1.416 ± 0.639
0.472ProTrp: 0.472 ± 0.213
0.472ProTyr: 0.472 ± 0.213
0.0ProXaa: 0.0 ± 0.0
Gln
2.361GlnAla: 2.361 ± 1.065
0.0GlnCys: 0.0 ± 0.0
2.833GlnAsp: 2.833 ± 2.123
7.082GlnGlu: 7.082 ± 4.606
1.889GlnPhe: 1.889 ± 0.924
1.416GlnGly: 1.416 ± 0.639
1.889GlnHis: 1.889 ± 0.852
3.305GlnIle: 3.305 ± 1.491
2.361GlnLys: 2.361 ± 1.065
6.138GlnLeu: 6.138 ± 5.971
1.416GlnMet: 1.416 ± 1.118
2.361GlnAsn: 2.361 ± 2.278
2.361GlnPro: 2.361 ± 0.819
2.833GlnGln: 2.833 ± 0.76
3.777GlnArg: 3.777 ± 0.816
1.416GlnSer: 1.416 ± 1.703
1.889GlnThr: 1.889 ± 0.924
2.361GlnVal: 2.361 ± 1.065
0.0GlnTrp: 0.0 ± 0.0
0.944GlnTyr: 0.944 ± 0.426
0.0GlnXaa: 0.0 ± 0.0
Arg
3.305ArgAla: 3.305 ± 1.817
0.472ArgCys: 0.472 ± 0.213
2.361ArgAsp: 2.361 ± 1.065
2.361ArgGlu: 2.361 ± 1.065
1.889ArgPhe: 1.889 ± 0.852
1.889ArgGly: 1.889 ± 1.556
2.361ArgHis: 2.361 ± 1.065
3.305ArgIle: 3.305 ± 1.817
4.721ArgLys: 4.721 ± 1.205
4.721ArgLeu: 4.721 ± 2.131
2.361ArgMet: 2.361 ± 1.065
2.833ArgAsn: 2.833 ± 2.123
2.833ArgPro: 2.833 ± 0.76
4.249ArgGln: 4.249 ± 2.748
5.194ArgArg: 5.194 ± 1.216
5.666ArgSer: 5.666 ± 2.557
3.777ArgThr: 3.777 ± 0.816
2.833ArgVal: 2.833 ± 0.76
2.833ArgTrp: 2.833 ± 1.278
1.416ArgTyr: 1.416 ± 0.639
0.0ArgXaa: 0.0 ± 0.0
Ser
1.889SerAla: 1.889 ± 0.852
0.0SerCys: 0.0 ± 0.0
3.305SerAsp: 3.305 ± 0.759
7.082SerGlu: 7.082 ± 0.429
0.472SerPhe: 0.472 ± 0.213
3.777SerGly: 3.777 ± 1.704
1.889SerHis: 1.889 ± 0.852
4.721SerIle: 4.721 ± 2.131
8.499SerLys: 8.499 ± 5.496
4.721SerLeu: 4.721 ± 1.205
0.944SerMet: 0.944 ± 0.426
1.889SerAsn: 1.889 ± 0.852
2.361SerPro: 2.361 ± 1.426
1.416SerGln: 1.416 ± 0.639
6.138SerArg: 6.138 ± 0.647
4.721SerSer: 4.721 ± 2.537
5.666SerThr: 5.666 ± 2.635
1.416SerVal: 1.416 ± 0.639
1.416SerTrp: 1.416 ± 2.613
1.416SerTyr: 1.416 ± 0.639
0.0SerXaa: 0.0 ± 0.0
Thr
1.889ThrAla: 1.889 ± 0.924
0.472ThrCys: 0.472 ± 0.213
3.305ThrAsp: 3.305 ± 1.491
6.61ThrGlu: 6.61 ± 3.634
0.944ThrPhe: 0.944 ± 0.426
2.833ThrGly: 2.833 ± 1.278
0.944ThrHis: 0.944 ± 0.426
5.194ThrIle: 5.194 ± 3.212
3.777ThrLys: 3.777 ± 3.78
3.777ThrLeu: 3.777 ± 0.816
1.416ThrMet: 1.416 ± 0.639
3.305ThrAsn: 3.305 ± 1.236
3.305ThrPro: 3.305 ± 1.491
0.944ThrGln: 0.944 ± 0.426
2.833ThrArg: 2.833 ± 1.278
4.721ThrSer: 4.721 ± 1.205
3.777ThrThr: 3.777 ± 1.188
2.833ThrVal: 2.833 ± 1.317
0.944ThrTrp: 0.944 ± 0.426
0.944ThrTyr: 0.944 ± 0.426
0.0ThrXaa: 0.0 ± 0.0
Val
2.833ValAla: 2.833 ± 1.317
1.416ValCys: 1.416 ± 0.639
1.889ValAsp: 1.889 ± 2.442
4.249ValGlu: 4.249 ± 2.748
1.889ValPhe: 1.889 ± 1.556
2.361ValGly: 2.361 ± 1.065
1.416ValHis: 1.416 ± 1.703
4.249ValIle: 4.249 ± 0.92
1.889ValLys: 1.889 ± 0.852
4.721ValLeu: 4.721 ± 1.205
0.944ValMet: 0.944 ± 0.426
3.305ValAsn: 3.305 ± 1.491
1.889ValPro: 1.889 ± 0.852
2.361ValGln: 2.361 ± 1.065
2.833ValArg: 2.833 ± 0.76
1.416ValSer: 1.416 ± 0.639
2.833ValThr: 2.833 ± 1.278
1.416ValVal: 1.416 ± 0.639
0.472ValTrp: 0.472 ± 0.213
1.889ValTyr: 1.889 ± 0.852
0.0ValXaa: 0.0 ± 0.0
Trp
2.361TrpAla: 2.361 ± 1.065
0.0TrpCys: 0.0 ± 0.0
0.944TrpAsp: 0.944 ± 0.426
0.944TrpGlu: 0.944 ± 0.426
0.472TrpPhe: 0.472 ± 0.213
0.472TrpGly: 0.472 ± 0.213
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.889TrpLys: 1.889 ± 0.924
0.944TrpLeu: 0.944 ± 0.426
0.472TrpMet: 0.472 ± 0.213
0.944TrpAsn: 0.944 ± 0.426
0.472TrpPro: 0.472 ± 2.032
0.472TrpGln: 0.472 ± 1.395
1.889TrpArg: 1.889 ± 0.924
0.0TrpSer: 0.0 ± 0.0
0.944TrpThr: 0.944 ± 0.426
0.472TrpVal: 0.472 ± 0.213
0.0TrpTrp: 0.0 ± 0.0
0.472TrpTyr: 0.472 ± 0.213
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.361TyrAla: 2.361 ± 1.065
0.0TyrCys: 0.0 ± 0.0
1.889TyrAsp: 1.889 ± 0.852
2.361TyrGlu: 2.361 ± 1.065
1.416TyrPhe: 1.416 ± 0.639
1.416TyrGly: 1.416 ± 0.639
0.944TyrHis: 0.944 ± 0.426
4.249TyrIle: 4.249 ± 1.917
2.833TyrLys: 2.833 ± 3.406
2.833TyrLeu: 2.833 ± 2.123
0.944TyrMet: 0.944 ± 0.426
0.472TyrAsn: 0.472 ± 0.213
0.944TyrPro: 0.944 ± 0.426
1.416TyrGln: 1.416 ± 0.639
3.777TyrArg: 3.777 ± 0.816
2.833TyrSer: 2.833 ± 1.278
0.472TyrThr: 0.472 ± 2.032
0.0TyrVal: 0.0 ± 0.0
0.472TyrTrp: 0.472 ± 0.213
1.889TyrTyr: 1.889 ± 0.852
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2119 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski