Amino acid dipepetide frequency for Banana streak UL virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.817AlaAla: 2.817 ± 1.336
0.0AlaCys: 0.0 ± 0.0
2.817AlaAsp: 2.817 ± 1.879
4.695AlaGlu: 4.695 ± 2.227
4.225AlaPhe: 4.225 ± 2.004
0.939AlaGly: 0.939 ± 0.445
1.408AlaHis: 1.408 ± 3.102
4.225AlaIle: 4.225 ± 2.004
6.103AlaLys: 6.103 ± 1.934
4.225AlaLeu: 4.225 ± 4.598
2.347AlaMet: 2.347 ± 1.113
1.878AlaAsn: 1.878 ± 0.891
2.347AlaPro: 2.347 ± 1.113
1.878AlaGln: 1.878 ± 0.891
5.164AlaArg: 5.164 ± 2.45
3.286AlaSer: 3.286 ± 1.559
1.408AlaThr: 1.408 ± 0.668
0.939AlaVal: 0.939 ± 1.456
0.469AlaTrp: 0.469 ± 0.223
1.408AlaTyr: 1.408 ± 0.668
0.0AlaXaa: 0.0 ± 0.0
Cys
0.469CysAla: 0.469 ± 0.223
0.469CysCys: 0.469 ± 0.223
0.0CysAsp: 0.0 ± 0.0
1.408CysGlu: 1.408 ± 0.668
0.939CysPhe: 0.939 ± 0.445
0.939CysGly: 0.939 ± 0.445
0.939CysHis: 0.939 ± 0.445
0.469CysIle: 0.469 ± 0.223
1.878CysLys: 1.878 ± 0.891
0.0CysLeu: 0.0 ± 0.0
0.469CysMet: 0.469 ± 0.223
1.408CysAsn: 1.408 ± 0.668
0.0CysPro: 0.0 ± 0.0
0.939CysGln: 0.939 ± 0.445
1.408CysArg: 1.408 ± 0.668
1.408CysSer: 1.408 ± 0.668
0.939CysThr: 0.939 ± 0.445
0.0CysVal: 0.0 ± 0.0
0.469CysTrp: 0.469 ± 0.223
1.408CysTyr: 1.408 ± 1.275
0.0CysXaa: 0.0 ± 0.0
Asp
0.939AspAla: 0.939 ± 0.445
0.0AspCys: 0.0 ± 0.0
2.817AspAsp: 2.817 ± 1.318
6.573AspGlu: 6.573 ± 2.104
2.347AspPhe: 2.347 ± 1.113
0.469AspGly: 0.469 ± 0.223
0.939AspHis: 0.939 ± 0.445
4.225AspIle: 4.225 ± 1.415
2.347AspLys: 2.347 ± 1.113
4.225AspLeu: 4.225 ± 2.819
1.408AspMet: 1.408 ± 0.849
3.756AspAsn: 3.756 ± 1.782
2.817AspPro: 2.817 ± 1.336
3.286AspGln: 3.286 ± 0.796
0.469AspArg: 0.469 ± 0.223
2.347AspSer: 2.347 ± 1.113
1.408AspThr: 1.408 ± 0.668
2.817AspVal: 2.817 ± 4.368
1.408AspTrp: 1.408 ± 0.668
4.225AspTyr: 4.225 ± 3.189
0.0AspXaa: 0.0 ± 0.0
Glu
5.634GluAla: 5.634 ± 5.098
0.469GluCys: 0.469 ± 0.223
7.042GluAsp: 7.042 ± 0.559
12.207GluGlu: 12.207 ± 4.195
2.347GluPhe: 2.347 ± 1.113
5.634GluGly: 5.634 ± 2.672
1.878GluHis: 1.878 ± 1.108
8.92GluIle: 8.92 ± 2.68
10.329GluLys: 10.329 ± 1.9
7.512GluLeu: 7.512 ± 2.772
1.878GluMet: 1.878 ± 0.891
3.756GluAsn: 3.756 ± 1.347
0.939GluPro: 0.939 ± 1.665
5.634GluGln: 5.634 ± 3.324
4.695GluArg: 4.695 ± 1.053
1.878GluSer: 1.878 ± 0.891
3.286GluThr: 3.286 ± 0.796
8.451GluVal: 8.451 ± 0.845
2.347GluTrp: 2.347 ± 0.965
1.878GluTyr: 1.878 ± 0.891
0.0GluXaa: 0.0 ± 0.0
Phe
1.408PheAla: 1.408 ± 0.668
0.939PheCys: 0.939 ± 0.445
1.408PheAsp: 1.408 ± 0.668
2.817PheGlu: 2.817 ± 1.336
0.939PhePhe: 0.939 ± 0.445
1.878PheGly: 1.878 ± 1.434
0.939PheHis: 0.939 ± 0.445
4.225PheIle: 4.225 ± 0.857
0.939PheLys: 0.939 ± 0.445
2.347PheLeu: 2.347 ± 0.965
2.347PheMet: 2.347 ± 1.099
0.469PheAsn: 0.469 ± 0.223
1.408PhePro: 1.408 ± 0.668
1.878PheGln: 1.878 ± 0.891
1.408PheArg: 1.408 ± 0.668
2.347PheSer: 2.347 ± 0.965
2.347PheThr: 2.347 ± 1.113
0.939PheVal: 0.939 ± 1.665
0.939PheTrp: 0.939 ± 0.445
1.878PheTyr: 1.878 ± 0.891
0.0PheXaa: 0.0 ± 0.0
Gly
1.408GlyAla: 1.408 ± 0.668
0.469GlyCys: 0.469 ± 0.223
2.347GlyAsp: 2.347 ± 0.965
3.286GlyGlu: 3.286 ± 1.314
2.347GlyPhe: 2.347 ± 1.359
4.225GlyGly: 4.225 ± 1.415
1.408GlyHis: 1.408 ± 0.668
3.286GlyIle: 3.286 ± 0.796
5.634GlyLys: 5.634 ± 1.777
3.286GlyLeu: 3.286 ± 1.559
0.469GlyMet: 0.469 ± 1.288
0.939GlyAsn: 0.939 ± 1.665
1.408GlyPro: 1.408 ± 0.668
1.878GlyGln: 1.878 ± 0.891
2.347GlyArg: 2.347 ± 1.113
4.225GlySer: 4.225 ± 0.857
6.103GlyThr: 6.103 ± 2.623
2.347GlyVal: 2.347 ± 1.359
1.408GlyTrp: 1.408 ± 0.668
3.756GlyTyr: 3.756 ± 0.797
0.0GlyXaa: 0.0 ± 0.0
His
1.408HisAla: 1.408 ± 0.668
0.469HisCys: 0.469 ± 0.223
0.0HisAsp: 0.0 ± 0.0
2.347HisGlu: 2.347 ± 0.965
1.408HisPhe: 1.408 ± 0.668
0.0HisGly: 0.0 ± 0.0
0.469HisHis: 0.469 ± 0.223
2.817HisIle: 2.817 ± 1.336
1.408HisLys: 1.408 ± 0.668
2.347HisLeu: 2.347 ± 0.965
0.469HisMet: 0.469 ± 0.223
1.408HisAsn: 1.408 ± 4.943
0.469HisPro: 0.469 ± 0.223
0.939HisGln: 0.939 ± 0.445
0.939HisArg: 0.939 ± 1.456
0.0HisSer: 0.0 ± 0.0
1.408HisThr: 1.408 ± 2.528
1.408HisVal: 1.408 ± 0.668
0.469HisTrp: 0.469 ± 0.223
0.469HisTyr: 0.469 ± 0.223
0.0HisXaa: 0.0 ± 0.0
Ile
2.817IleAla: 2.817 ± 0.856
2.817IleCys: 2.817 ± 1.336
4.225IleAsp: 4.225 ± 2.004
7.042IleGlu: 7.042 ± 1.494
2.347IlePhe: 2.347 ± 1.359
4.695IleGly: 4.695 ± 2.227
2.347IleHis: 2.347 ± 1.113
4.225IleIle: 4.225 ± 2.004
6.573IleLys: 6.573 ± 1.593
4.695IleLeu: 4.695 ± 2.227
0.939IleMet: 0.939 ± 0.445
2.347IleAsn: 2.347 ± 1.113
1.878IlePro: 1.878 ± 0.891
5.164IleGln: 5.164 ± 2.837
4.225IleArg: 4.225 ± 0.857
5.634IleSer: 5.634 ± 2.155
3.756IleThr: 3.756 ± 1.457
3.286IleVal: 3.286 ± 1.559
0.0IleTrp: 0.0 ± 0.0
1.408IleTyr: 1.408 ± 0.668
0.0IleXaa: 0.0 ± 0.0
Lys
4.225LysAla: 4.225 ± 2.066
0.939LysCys: 0.939 ± 0.445
1.878LysAsp: 1.878 ± 1.434
12.207LysGlu: 12.207 ± 2.178
5.164LysPhe: 5.164 ± 2.376
4.225LysGly: 4.225 ± 0.857
1.408LysHis: 1.408 ± 0.668
4.695LysIle: 4.695 ± 1.93
7.042LysLys: 7.042 ± 2.282
8.451LysLeu: 8.451 ± 0.845
2.347LysMet: 2.347 ± 1.113
5.634LysAsn: 5.634 ± 3.951
3.756LysPro: 3.756 ± 1.782
4.225LysGln: 4.225 ± 1.252
5.164LysArg: 5.164 ± 2.837
7.512LysSer: 7.512 ± 3.563
3.286LysThr: 3.286 ± 1.314
7.512LysVal: 7.512 ± 2.113
0.939LysTrp: 0.939 ± 0.445
0.939LysTyr: 0.939 ± 0.445
0.0LysXaa: 0.0 ± 0.0
Leu
2.817LeuAla: 2.817 ± 1.318
1.878LeuCys: 1.878 ± 0.891
5.164LeuAsp: 5.164 ± 2.376
9.39LeuGlu: 9.39 ± 0.445
0.469LeuPhe: 0.469 ± 1.648
5.634LeuGly: 5.634 ± 0.702
1.408LeuHis: 1.408 ± 0.668
2.817LeuIle: 2.817 ± 1.879
9.859LeuLys: 9.859 ± 5.845
5.164LeuLeu: 5.164 ± 3.485
1.408LeuMet: 1.408 ± 1.275
4.695LeuAsn: 4.695 ± 2.598
4.695LeuPro: 4.695 ± 1.513
5.634LeuGln: 5.634 ± 3.324
4.695LeuArg: 4.695 ± 1.93
7.512LeuSer: 7.512 ± 2.468
3.756LeuThr: 3.756 ± 0.797
5.634LeuVal: 5.634 ± 3.951
0.469LeuTrp: 0.469 ± 0.223
1.878LeuTyr: 1.878 ± 0.891
0.0LeuXaa: 0.0 ± 0.0
Met
4.225MetAla: 4.225 ± 2.004
0.0MetCys: 0.0 ± 0.0
1.408MetAsp: 1.408 ± 0.668
1.408MetGlu: 1.408 ± 0.668
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.878MetIle: 1.878 ± 0.891
3.286MetLys: 3.286 ± 1.559
2.347MetLeu: 2.347 ± 0.965
0.0MetMet: 0.0 ± 0.0
0.469MetAsn: 0.469 ± 0.223
0.469MetPro: 0.469 ± 0.223
0.939MetGln: 0.939 ± 0.445
0.939MetArg: 0.939 ± 1.456
0.939MetSer: 0.939 ± 1.665
3.286MetThr: 3.286 ± 0.796
1.878MetVal: 1.878 ± 0.891
0.469MetTrp: 0.469 ± 0.223
0.939MetTyr: 0.939 ± 0.445
0.0MetXaa: 0.0 ± 0.0
Asn
1.878AsnAla: 1.878 ± 0.891
2.347AsnCys: 2.347 ± 1.113
2.347AsnAsp: 2.347 ± 1.113
2.817AsnGlu: 2.817 ± 1.336
1.878AsnPhe: 1.878 ± 0.891
3.286AsnGly: 3.286 ± 1.559
1.408AsnHis: 1.408 ± 3.102
4.225AsnIle: 4.225 ± 2.819
4.695AsnLys: 4.695 ± 0.966
3.756AsnLeu: 3.756 ± 1.457
0.0AsnMet: 0.0 ± 0.0
1.878AsnAsn: 1.878 ± 0.891
2.347AsnPro: 2.347 ± 1.359
3.286AsnGln: 3.286 ± 0.796
1.408AsnArg: 1.408 ± 1.275
3.286AsnSer: 3.286 ± 0.796
3.286AsnThr: 3.286 ± 4.862
0.469AsnVal: 0.469 ± 0.223
0.469AsnTrp: 0.469 ± 0.223
2.347AsnTyr: 2.347 ± 1.113
0.0AsnXaa: 0.0 ± 0.0
Pro
5.164ProAla: 5.164 ± 2.45
0.0ProCys: 0.0 ± 0.0
0.939ProAsp: 0.939 ± 0.445
3.286ProGlu: 3.286 ± 1.559
0.939ProPhe: 0.939 ± 0.445
0.939ProGly: 0.939 ± 0.445
1.408ProHis: 1.408 ± 0.668
0.939ProIle: 0.939 ± 0.445
2.817ProLys: 2.817 ± 0.856
4.225ProLeu: 4.225 ± 0.857
0.469ProMet: 0.469 ± 0.223
0.939ProAsn: 0.939 ± 0.445
0.939ProPro: 0.939 ± 0.445
1.878ProGln: 1.878 ± 0.891
1.878ProArg: 1.878 ± 0.891
2.817ProSer: 2.817 ± 1.318
1.408ProThr: 1.408 ± 1.538
1.878ProVal: 1.878 ± 1.434
0.469ProTrp: 0.469 ± 0.223
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
4.225GlnAla: 4.225 ± 2.066
0.0GlnCys: 0.0 ± 0.0
3.286GlnAsp: 3.286 ± 1.559
4.225GlnGlu: 4.225 ± 0.857
1.408GlnPhe: 1.408 ± 0.668
3.756GlnGly: 3.756 ± 1.457
0.469GlnHis: 0.469 ± 0.223
2.347GlnIle: 2.347 ± 0.965
3.756GlnLys: 3.756 ± 0.797
6.573GlnLeu: 6.573 ± 3.333
2.817GlnMet: 2.817 ± 0.856
1.878GlnAsn: 1.878 ± 0.891
3.756GlnPro: 3.756 ± 0.797
2.347GlnGln: 2.347 ± 0.965
3.756GlnArg: 3.756 ± 0.797
1.878GlnSer: 1.878 ± 0.891
1.408GlnThr: 1.408 ± 2.528
2.347GlnVal: 2.347 ± 2.094
0.0GlnTrp: 0.0 ± 0.0
1.408GlnTyr: 1.408 ± 0.668
0.0GlnXaa: 0.0 ± 0.0
Arg
1.878ArgAla: 1.878 ± 0.891
1.878ArgCys: 1.878 ± 0.891
0.939ArgAsp: 0.939 ± 1.456
1.878ArgGlu: 1.878 ± 2.912
1.408ArgPhe: 1.408 ± 0.668
2.817ArgGly: 2.817 ± 0.856
0.939ArgHis: 0.939 ± 1.456
4.695ArgIle: 4.695 ± 2.227
3.286ArgLys: 3.286 ± 1.559
6.103ArgLeu: 6.103 ± 4.218
2.817ArgMet: 2.817 ± 1.336
1.878ArgAsn: 1.878 ± 0.891
0.939ArgPro: 0.939 ± 1.456
1.408ArgGln: 1.408 ± 0.668
4.695ArgArg: 4.695 ± 1.93
4.225ArgSer: 4.225 ± 5.638
4.695ArgThr: 4.695 ± 2.227
1.878ArgVal: 1.878 ± 1.108
1.408ArgTrp: 1.408 ± 0.668
2.347ArgTyr: 2.347 ± 1.113
0.0ArgXaa: 0.0 ± 0.0
Ser
5.164SerAla: 5.164 ± 1.635
0.469SerCys: 0.469 ± 0.223
4.225SerAsp: 4.225 ± 0.857
8.92SerGlu: 8.92 ± 3.992
0.939SerPhe: 0.939 ± 0.445
2.817SerGly: 2.817 ± 1.336
0.939SerHis: 0.939 ± 1.456
3.286SerIle: 3.286 ± 1.314
5.634SerLys: 5.634 ± 1.712
5.164SerLeu: 5.164 ± 1.109
1.878SerMet: 1.878 ± 0.891
3.286SerAsn: 3.286 ± 2.966
1.878SerPro: 1.878 ± 0.891
4.695SerGln: 4.695 ± 2.598
2.347SerArg: 2.347 ± 0.965
6.103SerSer: 6.103 ± 2.895
5.164SerThr: 5.164 ± 2.669
2.817SerVal: 2.817 ± 1.879
0.939SerTrp: 0.939 ± 0.445
0.939SerTyr: 0.939 ± 0.445
0.0SerXaa: 0.0 ± 0.0
Thr
1.878ThrAla: 1.878 ± 1.434
0.469ThrCys: 0.469 ± 1.648
2.817ThrAsp: 2.817 ± 1.318
6.103ThrGlu: 6.103 ± 3.172
2.347ThrPhe: 2.347 ± 1.113
5.634ThrGly: 5.634 ± 2.671
0.939ThrHis: 0.939 ± 0.445
2.817ThrIle: 2.817 ± 1.336
6.103ThrLys: 6.103 ± 6.039
5.164ThrLeu: 5.164 ± 2.669
0.469ThrMet: 0.469 ± 0.223
2.347ThrAsn: 2.347 ± 1.113
0.939ThrPro: 0.939 ± 0.445
1.408ThrGln: 1.408 ± 0.668
1.878ThrArg: 1.878 ± 0.891
6.103ThrSer: 6.103 ± 7.318
4.695ThrThr: 4.695 ± 1.513
3.286ThrVal: 3.286 ± 1.559
0.0ThrTrp: 0.0 ± 0.0
1.408ThrTyr: 1.408 ± 0.668
0.0ThrXaa: 0.0 ± 0.0
Val
1.408ValAla: 1.408 ± 0.668
1.878ValCys: 1.878 ± 0.891
3.286ValAsp: 3.286 ± 1.559
3.286ValGlu: 3.286 ± 2.378
1.408ValPhe: 1.408 ± 0.668
3.756ValGly: 3.756 ± 2.869
0.939ValHis: 0.939 ± 1.665
6.103ValIle: 6.103 ± 2.513
4.695ValLys: 4.695 ± 2.598
4.695ValLeu: 4.695 ± 2.598
0.469ValMet: 0.469 ± 0.223
3.286ValAsn: 3.286 ± 0.796
0.469ValPro: 0.469 ± 0.223
2.817ValGln: 2.817 ± 1.879
2.347ValArg: 2.347 ± 1.113
2.817ValSer: 2.817 ± 2.549
3.756ValThr: 3.756 ± 1.457
2.817ValVal: 2.817 ± 1.879
0.469ValTrp: 0.469 ± 1.648
1.408ValTyr: 1.408 ± 0.668
0.0ValXaa: 0.0 ± 0.0
Trp
0.939TrpAla: 0.939 ± 0.445
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.878TrpGlu: 1.878 ± 1.108
0.469TrpPhe: 0.469 ± 0.223
0.469TrpGly: 0.469 ± 0.223
0.0TrpHis: 0.0 ± 0.0
0.469TrpIle: 0.469 ± 0.223
0.939TrpLys: 0.939 ± 1.456
1.408TrpLeu: 1.408 ± 0.668
0.0TrpMet: 0.0 ± 0.0
1.878TrpAsn: 1.878 ± 0.891
0.469TrpPro: 0.469 ± 0.223
0.0TrpGln: 0.0 ± 0.0
1.408TrpArg: 1.408 ± 0.668
1.878TrpSer: 1.878 ± 0.891
0.469TrpThr: 0.469 ± 0.223
0.469TrpVal: 0.469 ± 0.223
0.939TrpTrp: 0.939 ± 0.445
0.469TrpTyr: 0.469 ± 0.223
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.878TyrAla: 1.878 ± 0.891
0.469TyrCys: 0.469 ± 0.223
2.347TyrAsp: 2.347 ± 1.113
0.939TyrGlu: 0.939 ± 0.445
0.469TyrPhe: 0.469 ± 0.223
0.469TyrGly: 0.469 ± 0.223
0.469TyrHis: 0.469 ± 0.223
3.756TyrIle: 3.756 ± 1.782
4.225TyrLys: 4.225 ± 0.857
3.286TyrLeu: 3.286 ± 1.667
1.408TyrMet: 1.408 ± 0.668
3.286TyrAsn: 3.286 ± 1.559
1.408TyrPro: 1.408 ± 0.668
1.408TyrGln: 1.408 ± 0.668
0.939TyrArg: 0.939 ± 1.456
1.408TyrSer: 1.408 ± 0.668
0.939TyrThr: 0.939 ± 1.665
0.939TyrVal: 0.939 ± 0.445
0.469TyrTrp: 0.469 ± 0.223
0.939TyrTyr: 0.939 ± 0.445
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2131 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski