Amino acid dipepetide frequency for Banana streak UA virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.039AlaAla: 5.039 ± 1.9
0.916AlaCys: 0.916 ± 0.438
1.832AlaAsp: 1.832 ± 0.875
3.665AlaGlu: 3.665 ± 0.847
1.374AlaPhe: 1.374 ± 0.657
2.29AlaGly: 2.29 ± 1.208
1.832AlaHis: 1.832 ± 2.944
6.871AlaIle: 6.871 ± 3.625
2.29AlaLys: 2.29 ± 1.208
8.704AlaLeu: 8.704 ± 5.436
3.665AlaMet: 3.665 ± 1.751
0.916AlaAsn: 0.916 ± 0.438
2.29AlaPro: 2.29 ± 1.094
1.374AlaGln: 1.374 ± 0.657
3.207AlaArg: 3.207 ± 1.532
3.207AlaSer: 3.207 ± 0.848
2.29AlaThr: 2.29 ± 1.208
3.665AlaVal: 3.665 ± 1.751
0.916AlaTrp: 0.916 ± 0.438
1.832AlaTyr: 1.832 ± 0.875
0.0AlaXaa: 0.0 ± 0.0
Cys
0.458CysAla: 0.458 ± 0.219
0.0CysCys: 0.0 ± 0.0
0.458CysAsp: 0.458 ± 0.219
0.458CysGlu: 0.458 ± 0.219
1.374CysPhe: 1.374 ± 1.298
1.374CysGly: 1.374 ± 0.657
0.916CysHis: 0.916 ± 0.438
1.374CysIle: 1.374 ± 0.657
2.749CysLys: 2.749 ± 0.904
0.458CysLeu: 0.458 ± 0.219
0.458CysMet: 0.458 ± 0.219
0.458CysAsn: 0.458 ± 0.219
0.0CysPro: 0.0 ± 0.0
0.458CysGln: 0.458 ± 0.219
2.29CysArg: 2.29 ± 1.094
0.916CysSer: 0.916 ± 0.438
0.458CysThr: 0.458 ± 0.219
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.832AspAla: 1.832 ± 0.875
0.916AspCys: 0.916 ± 0.438
3.207AspAsp: 3.207 ± 1.532
3.207AspGlu: 3.207 ± 1.532
2.749AspPhe: 2.749 ± 0.904
1.832AspGly: 1.832 ± 0.875
0.916AspHis: 0.916 ± 0.438
4.581AspIle: 4.581 ± 2.782
1.374AspLys: 1.374 ± 0.657
4.581AspLeu: 4.581 ± 7.361
0.458AspMet: 0.458 ± 0.219
2.29AspAsn: 2.29 ± 1.094
1.832AspPro: 1.832 ± 1.29
2.29AspGln: 2.29 ± 1.094
2.749AspArg: 2.749 ± 1.162
5.497AspSer: 5.497 ± 2.184
4.123AspThr: 4.123 ± 1.97
3.665AspVal: 3.665 ± 4.065
2.29AspTrp: 2.29 ± 1.094
3.665AspTyr: 3.665 ± 2.281
0.0AspXaa: 0.0 ± 0.0
Glu
7.787GluAla: 7.787 ± 1.687
0.916GluCys: 0.916 ± 0.438
8.246GluAsp: 8.246 ± 1.801
12.826GluGlu: 12.826 ± 4.56
3.207GluPhe: 3.207 ± 0.848
3.665GluGly: 3.665 ± 1.192
0.916GluHis: 0.916 ± 0.438
4.123GluIle: 4.123 ± 1.97
8.246GluLys: 8.246 ± 2.712
8.246GluLeu: 8.246 ± 1.801
1.374GluMet: 1.374 ± 1.298
3.207GluAsn: 3.207 ± 1.157
3.665GluPro: 3.665 ± 1.751
4.123GluGln: 4.123 ± 2.825
5.955GluArg: 5.955 ± 1.739
4.581GluSer: 4.581 ± 1.369
3.207GluThr: 3.207 ± 1.532
7.329GluVal: 7.329 ± 2.037
0.916GluTrp: 0.916 ± 0.438
2.749GluTyr: 2.749 ± 1.313
0.0GluXaa: 0.0 ± 0.0
Phe
2.749PheAla: 2.749 ± 2.597
0.458PheCys: 0.458 ± 0.219
2.749PheAsp: 2.749 ± 1.313
3.665PheGlu: 3.665 ± 0.847
1.832PhePhe: 1.832 ± 0.875
0.916PheGly: 0.916 ± 1.537
1.832PheHis: 1.832 ± 1.14
3.207PheIle: 3.207 ± 0.848
1.374PheLys: 1.374 ± 0.657
1.832PheLeu: 1.832 ± 0.875
0.458PheMet: 0.458 ± 0.219
0.458PheAsn: 0.458 ± 0.219
0.916PhePro: 0.916 ± 0.438
2.29PheGln: 2.29 ± 1.208
1.832PheArg: 1.832 ± 0.875
0.916PheSer: 0.916 ± 0.438
2.749PheThr: 2.749 ± 0.904
1.832PheVal: 1.832 ± 1.29
0.916PheTrp: 0.916 ± 1.472
1.832PheTyr: 1.832 ± 0.875
0.0PheXaa: 0.0 ± 0.0
Gly
2.29GlyAla: 2.29 ± 1.094
0.916GlyCys: 0.916 ± 0.438
1.832GlyAsp: 1.832 ± 0.875
6.413GlyGlu: 6.413 ± 0.315
2.749GlyPhe: 2.749 ± 1.162
1.832GlyGly: 1.832 ± 0.875
0.916GlyHis: 0.916 ± 0.438
1.832GlyIle: 1.832 ± 0.875
3.207GlyLys: 3.207 ± 1.532
4.123GlyLeu: 4.123 ± 1.97
1.374GlyMet: 1.374 ± 0.873
1.832GlyAsn: 1.832 ± 1.14
0.916GlyPro: 0.916 ± 0.438
0.458GlyGln: 0.458 ± 0.219
3.665GlyArg: 3.665 ± 1.192
2.29GlySer: 2.29 ± 1.208
5.497GlyThr: 5.497 ± 2.325
3.665GlyVal: 3.665 ± 1.751
0.916GlyTrp: 0.916 ± 0.438
1.832GlyTyr: 1.832 ± 0.875
0.0GlyXaa: 0.0 ± 0.0
His
0.458HisAla: 0.458 ± 0.219
0.458HisCys: 0.458 ± 0.219
0.458HisAsp: 0.458 ± 0.219
2.29HisGlu: 2.29 ± 2.029
0.458HisPhe: 0.458 ± 0.219
0.916HisGly: 0.916 ± 0.438
1.374HisHis: 1.374 ± 0.657
2.29HisIle: 2.29 ± 1.094
1.374HisLys: 1.374 ± 0.657
2.29HisLeu: 2.29 ± 1.094
0.0HisMet: 0.0 ± 0.0
0.916HisAsn: 0.916 ± 1.472
1.374HisPro: 1.374 ± 0.657
1.374HisGln: 1.374 ± 0.657
2.29HisArg: 2.29 ± 1.094
0.916HisSer: 0.916 ± 1.472
1.374HisThr: 1.374 ± 0.657
2.29HisVal: 2.29 ± 1.094
0.458HisTrp: 0.458 ± 0.219
0.916HisTyr: 0.916 ± 0.438
0.0HisXaa: 0.0 ± 0.0
Ile
3.207IleAla: 3.207 ± 1.532
1.832IleCys: 1.832 ± 0.875
4.581IleAsp: 4.581 ± 2.011
7.787IleGlu: 7.787 ± 1.687
1.374IlePhe: 1.374 ± 0.657
2.749IleGly: 2.749 ± 1.313
2.29IleHis: 2.29 ± 1.094
5.497IleIle: 5.497 ± 2.626
5.039IleLys: 5.039 ± 1.498
3.207IleLeu: 3.207 ± 1.532
1.832IleMet: 1.832 ± 0.875
3.665IleAsn: 3.665 ± 1.192
2.749IlePro: 2.749 ± 1.313
5.039IleGln: 5.039 ± 9.367
4.123IleArg: 4.123 ± 0.901
5.039IleSer: 5.039 ± 1.134
2.749IleThr: 2.749 ± 0.904
4.581IleVal: 4.581 ± 2.189
0.0IleTrp: 0.0 ± 0.0
2.749IleTyr: 2.749 ± 1.313
0.0IleXaa: 0.0 ± 0.0
Lys
3.207LysAla: 3.207 ± 2.434
1.374LysCys: 1.374 ± 0.657
5.039LysAsp: 5.039 ± 4.213
8.246LysGlu: 8.246 ± 1.801
4.123LysPhe: 4.123 ± 0.901
2.29LysGly: 2.29 ± 1.094
2.749LysHis: 2.749 ± 1.313
5.955LysIle: 5.955 ± 2.309
5.955LysLys: 5.955 ± 2.845
6.413LysLeu: 6.413 ± 3.598
2.749LysMet: 2.749 ± 1.313
3.665LysAsn: 3.665 ± 1.751
1.832LysPro: 1.832 ± 0.875
1.832LysGln: 1.832 ± 5.683
3.665LysArg: 3.665 ± 2.281
5.039LysSer: 5.039 ± 3.842
3.665LysThr: 3.665 ± 1.192
5.497LysVal: 5.497 ± 2.421
1.374LysTrp: 1.374 ± 0.657
1.832LysTyr: 1.832 ± 0.875
0.0LysXaa: 0.0 ± 0.0
Leu
4.581LeuAla: 4.581 ± 2.011
1.374LeuCys: 1.374 ± 1.298
4.581LeuAsp: 4.581 ± 3.731
7.787LeuGlu: 7.787 ± 2.8
1.374LeuPhe: 1.374 ± 1.298
6.413LeuGly: 6.413 ± 1.696
0.916LeuHis: 0.916 ± 1.537
4.581LeuIle: 4.581 ± 1.369
9.62LeuLys: 9.62 ± 3.906
5.955LeuLeu: 5.955 ± 2.251
1.374LeuMet: 1.374 ± 2.107
6.871LeuAsn: 6.871 ± 3.396
2.749LeuPro: 2.749 ± 1.313
2.749LeuGln: 2.749 ± 1.813
3.665LeuArg: 3.665 ± 3.041
5.039LeuSer: 5.039 ± 1.134
2.749LeuThr: 2.749 ± 2.804
5.955LeuVal: 5.955 ± 1.739
0.458LeuTrp: 0.458 ± 0.219
2.29LeuTyr: 2.29 ± 1.094
0.0LeuXaa: 0.0 ± 0.0
Met
1.374MetAla: 1.374 ± 0.657
0.0MetCys: 0.0 ± 0.0
2.29MetAsp: 2.29 ± 1.094
2.749MetGlu: 2.749 ± 1.313
0.916MetPhe: 0.916 ± 0.438
0.916MetGly: 0.916 ± 0.438
0.916MetHis: 0.916 ± 0.438
2.29MetIle: 2.29 ± 1.094
3.665MetLys: 3.665 ± 1.751
1.832MetLeu: 1.832 ± 1.14
0.458MetMet: 0.458 ± 0.219
0.916MetAsn: 0.916 ± 0.438
1.832MetPro: 1.832 ± 0.875
0.0MetGln: 0.0 ± 0.0
0.916MetArg: 0.916 ± 1.472
0.458MetSer: 0.458 ± 0.219
2.29MetThr: 2.29 ± 2.029
0.916MetVal: 0.916 ± 0.438
0.0MetTrp: 0.0 ± 0.0
0.458MetTyr: 0.458 ± 0.219
0.0MetXaa: 0.0 ± 0.0
Asn
3.665AsnAla: 3.665 ± 1.751
0.916AsnCys: 0.916 ± 1.472
1.374AsnAsp: 1.374 ± 0.657
3.207AsnGlu: 3.207 ± 1.532
0.458AsnPhe: 0.458 ± 0.219
2.749AsnGly: 2.749 ± 1.313
0.0AsnHis: 0.0 ± 0.0
4.123AsnIle: 4.123 ± 1.172
0.916AsnLys: 0.916 ± 3.313
4.581AsnLeu: 4.581 ± 2.611
0.916AsnMet: 0.916 ± 0.438
0.916AsnAsn: 0.916 ± 0.438
3.207AsnPro: 3.207 ± 1.157
3.665AsnGln: 3.665 ± 1.192
1.374AsnArg: 1.374 ± 0.657
2.29AsnSer: 2.29 ± 1.208
3.665AsnThr: 3.665 ± 2.581
0.916AsnVal: 0.916 ± 0.438
0.916AsnTrp: 0.916 ± 0.438
4.581AsnTyr: 4.581 ± 2.189
0.0AsnXaa: 0.0 ± 0.0
Pro
2.749ProAla: 2.749 ± 1.162
0.0ProCys: 0.0 ± 0.0
2.29ProAsp: 2.29 ± 1.094
3.665ProGlu: 3.665 ± 1.751
0.916ProPhe: 0.916 ± 0.438
1.832ProGly: 1.832 ± 0.875
1.832ProHis: 1.832 ± 0.875
1.832ProIle: 1.832 ± 0.875
5.039ProLys: 5.039 ± 0.759
2.29ProLeu: 2.29 ± 1.006
0.458ProMet: 0.458 ± 0.219
1.374ProAsn: 1.374 ± 0.657
1.832ProPro: 1.832 ± 0.875
0.916ProGln: 0.916 ± 0.438
4.581ProArg: 4.581 ± 2.189
3.665ProSer: 3.665 ± 1.192
1.374ProThr: 1.374 ± 0.657
2.29ProVal: 2.29 ± 1.094
0.916ProTrp: 0.916 ± 0.438
1.374ProTyr: 1.374 ± 1.402
0.0ProXaa: 0.0 ± 0.0
Gln
4.581GlnAla: 4.581 ± 2.417
0.0GlnCys: 0.0 ± 0.0
3.207GlnAsp: 3.207 ± 3.358
4.123GlnGlu: 4.123 ± 0.901
0.916GlnPhe: 0.916 ± 1.472
2.29GlnGly: 2.29 ± 1.208
2.29GlnHis: 2.29 ± 1.094
3.207GlnIle: 3.207 ± 0.848
2.749GlnLys: 2.749 ± 2.804
4.123GlnLeu: 4.123 ± 4.812
0.458GlnMet: 0.458 ± 0.219
3.207GlnAsn: 3.207 ± 1.598
1.832GlnPro: 1.832 ± 1.14
1.832GlnGln: 1.832 ± 1.29
2.29GlnArg: 2.29 ± 1.006
0.458GlnSer: 0.458 ± 0.219
0.916GlnThr: 0.916 ± 2.68
4.123GlnVal: 4.123 ± 2.97
0.916GlnTrp: 0.916 ± 1.472
1.832GlnTyr: 1.832 ± 1.29
0.0GlnXaa: 0.0 ± 0.0
Arg
2.29ArgAla: 2.29 ± 1.208
1.832ArgCys: 1.832 ± 0.875
1.374ArgAsp: 1.374 ± 0.657
5.039ArgGlu: 5.039 ± 1.134
0.458ArgPhe: 0.458 ± 0.219
2.29ArgGly: 2.29 ± 1.006
0.0ArgHis: 0.0 ± 0.0
4.581ArgIle: 4.581 ± 1.0
6.413ArgLys: 6.413 ± 3.142
5.497ArgLeu: 5.497 ± 0.567
3.207ArgMet: 3.207 ± 1.532
2.749ArgAsn: 2.749 ± 1.313
4.581ArgPro: 4.581 ± 0.963
3.207ArgGln: 3.207 ± 0.848
2.29ArgArg: 2.29 ± 2.029
5.497ArgSer: 5.497 ± 0.567
5.497ArgThr: 5.497 ± 1.291
3.665ArgVal: 3.665 ± 1.384
1.374ArgTrp: 1.374 ± 0.657
0.916ArgTyr: 0.916 ± 0.438
0.0ArgXaa: 0.0 ± 0.0
Ser
1.832SerAla: 1.832 ± 1.29
0.458SerCys: 0.458 ± 0.219
2.29SerAsp: 2.29 ± 3.756
7.329SerGlu: 7.329 ± 2.352
2.29SerPhe: 2.29 ± 1.006
3.665SerGly: 3.665 ± 1.192
1.832SerHis: 1.832 ± 0.875
2.749SerIle: 2.749 ± 1.313
4.123SerLys: 4.123 ± 4.629
5.039SerLeu: 5.039 ± 1.9
0.916SerMet: 0.916 ± 0.603
1.832SerAsn: 1.832 ± 1.29
3.207SerPro: 3.207 ± 1.532
6.413SerGln: 6.413 ± 3.196
4.581SerArg: 4.581 ± 1.0
3.665SerSer: 3.665 ± 2.581
4.123SerThr: 4.123 ± 1.172
3.207SerVal: 3.207 ± 1.598
0.0SerTrp: 0.0 ± 0.0
0.916SerTyr: 0.916 ± 1.472
0.0SerXaa: 0.0 ± 0.0
Thr
4.123ThrAla: 4.123 ± 1.264
0.0ThrCys: 0.0 ± 0.0
3.665ThrAsp: 3.665 ± 0.847
5.039ThrGlu: 5.039 ± 2.397
3.207ThrPhe: 3.207 ± 1.157
3.665ThrGly: 3.665 ± 1.751
0.0ThrHis: 0.0 ± 0.0
4.581ThrIle: 4.581 ± 1.369
3.207ThrLys: 3.207 ± 1.157
3.207ThrLeu: 3.207 ± 1.532
0.916ThrMet: 0.916 ± 0.438
2.749ThrAsn: 2.749 ± 1.162
2.29ThrPro: 2.29 ± 1.208
2.29ThrGln: 2.29 ± 2.029
4.581ThrArg: 4.581 ± 1.369
5.039ThrSer: 5.039 ± 5.538
3.207ThrThr: 3.207 ± 2.686
2.29ThrVal: 2.29 ± 1.094
0.458ThrTrp: 0.458 ± 0.219
0.458ThrTyr: 0.458 ± 0.219
0.0ThrXaa: 0.0 ± 0.0
Val
2.29ValAla: 2.29 ± 1.006
1.832ValCys: 1.832 ± 0.875
1.832ValAsp: 1.832 ± 0.875
5.039ValGlu: 5.039 ± 1.134
3.665ValPhe: 3.665 ± 1.192
5.039ValGly: 5.039 ± 1.498
2.29ValHis: 2.29 ± 1.094
2.749ValIle: 2.749 ± 0.904
4.123ValLys: 4.123 ± 4.812
4.581ValLeu: 4.581 ± 2.011
1.832ValMet: 1.832 ± 0.875
3.207ValAsn: 3.207 ± 1.532
2.29ValPro: 2.29 ± 1.094
2.749ValGln: 2.749 ± 0.904
4.581ValArg: 4.581 ± 0.963
4.123ValSer: 4.123 ± 2.139
3.665ValThr: 3.665 ± 1.192
3.665ValVal: 3.665 ± 0.847
0.458ValTrp: 0.458 ± 0.219
1.374ValTyr: 1.374 ± 0.657
0.0ValXaa: 0.0 ± 0.0
Trp
1.832TrpAla: 1.832 ± 0.875
0.0TrpCys: 0.0 ± 0.0
0.916TrpAsp: 0.916 ± 1.472
0.916TrpGlu: 0.916 ± 0.438
0.458TrpPhe: 0.458 ± 0.219
0.916TrpGly: 0.916 ± 0.438
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
2.29TrpLys: 2.29 ± 1.006
1.832TrpLeu: 1.832 ± 0.875
0.0TrpMet: 0.0 ± 0.0
0.458TrpAsn: 0.458 ± 0.219
0.0TrpPro: 0.0 ± 0.0
0.458TrpGln: 0.458 ± 0.219
0.916TrpArg: 0.916 ± 0.438
0.458TrpSer: 0.458 ± 0.219
0.916TrpThr: 0.916 ± 0.438
0.916TrpVal: 0.916 ± 0.438
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.832TyrAla: 1.832 ± 0.875
0.458TyrCys: 0.458 ± 0.219
1.832TyrAsp: 1.832 ± 0.875
1.832TyrGlu: 1.832 ± 0.875
0.916TyrPhe: 0.916 ± 0.438
0.916TyrGly: 0.916 ± 0.438
0.458TyrHis: 0.458 ± 0.219
3.665TyrIle: 3.665 ± 1.751
2.749TyrLys: 2.749 ± 1.313
2.29TyrLeu: 2.29 ± 2.767
1.832TyrMet: 1.832 ± 0.875
2.749TyrAsn: 2.749 ± 1.162
1.832TyrPro: 1.832 ± 0.875
1.832TyrGln: 1.832 ± 1.29
3.207TyrArg: 3.207 ± 0.848
1.832TyrSer: 1.832 ± 0.875
0.458TyrThr: 0.458 ± 0.219
0.916TyrVal: 0.916 ± 0.438
0.0TyrTrp: 0.0 ± 0.0
1.832TyrTyr: 1.832 ± 0.875
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2184 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski