Amino acid dipepetide frequency for Gooseberry vein banding associated virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.333AlaAla: 8.333 ± 1.381
0.0AlaCys: 0.0 ± 0.0
2.632AlaAsp: 2.632 ± 1.444
7.018AlaGlu: 7.018 ± 0.469
2.632AlaPhe: 2.632 ± 1.114
3.947AlaGly: 3.947 ± 0.774
0.439AlaHis: 0.439 ± 0.186
3.947AlaIle: 3.947 ± 2.152
3.947AlaLys: 3.947 ± 3.745
5.263AlaLeu: 5.263 ± 0.347
2.193AlaMet: 2.193 ± 0.929
1.754AlaAsn: 1.754 ± 0.743
4.386AlaPro: 4.386 ± 1.01
2.632AlaGln: 2.632 ± 0.956
3.07AlaArg: 3.07 ± 0.682
4.825AlaSer: 4.825 ± 1.104
3.947AlaThr: 3.947 ± 2.783
6.14AlaVal: 6.14 ± 1.924
0.877AlaTrp: 0.877 ± 0.841
2.632AlaTyr: 2.632 ± 1.114
0.0AlaXaa: 0.0 ± 0.0
Cys
0.877CysAla: 0.877 ± 0.371
0.877CysCys: 0.877 ± 0.371
0.877CysAsp: 0.877 ± 0.371
0.877CysGlu: 0.877 ± 0.841
0.877CysPhe: 0.877 ± 0.371
1.754CysGly: 1.754 ± 0.743
0.439CysHis: 0.439 ± 0.981
0.877CysIle: 0.877 ± 0.371
2.632CysLys: 2.632 ± 1.114
0.439CysLeu: 0.439 ± 0.981
0.439CysMet: 0.439 ± 0.186
0.439CysAsn: 0.439 ± 0.186
1.316CysPro: 1.316 ± 0.557
0.439CysGln: 0.439 ± 0.186
0.877CysArg: 0.877 ± 0.371
0.877CysSer: 0.877 ± 0.371
0.877CysThr: 0.877 ± 0.371
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.877CysTyr: 0.877 ± 0.841
0.0CysXaa: 0.0 ± 0.0
Asp
3.509AspAla: 3.509 ± 2.258
0.877AspCys: 0.877 ± 0.841
3.07AspAsp: 3.07 ± 2.373
4.386AspGlu: 4.386 ± 1.857
1.754AspPhe: 1.754 ± 0.635
0.439AspGly: 0.439 ± 0.186
1.316AspHis: 1.316 ± 0.557
2.193AspIle: 2.193 ± 0.929
1.754AspLys: 1.754 ± 0.743
5.702AspLeu: 5.702 ± 0.974
1.754AspMet: 1.754 ± 0.743
2.632AspAsn: 2.632 ± 0.956
3.07AspPro: 3.07 ± 1.124
3.07AspGln: 3.07 ± 1.124
2.632AspArg: 2.632 ± 0.956
2.632AspSer: 2.632 ± 0.613
4.386AspThr: 4.386 ± 1.515
2.193AspVal: 2.193 ± 0.929
0.439AspTrp: 0.439 ± 0.186
2.632AspTyr: 2.632 ± 1.114
0.0AspXaa: 0.0 ± 0.0
Glu
4.825GluAla: 4.825 ± 1.333
0.877GluCys: 0.877 ± 0.371
4.386GluAsp: 4.386 ± 0.609
10.965GluGlu: 10.965 ± 2.961
3.509GluPhe: 3.509 ± 0.947
6.579GluGly: 6.579 ± 0.512
1.754GluHis: 1.754 ± 0.743
4.386GluIle: 4.386 ± 1.192
3.509GluLys: 3.509 ± 0.79
8.772GluLeu: 8.772 ± 2.6
0.877GluMet: 0.877 ± 0.505
1.754GluAsn: 1.754 ± 0.743
3.07GluPro: 3.07 ± 0.682
4.386GluGln: 4.386 ± 1.072
5.702GluArg: 5.702 ± 1.849
6.14GluSer: 6.14 ± 0.382
2.632GluThr: 2.632 ± 0.956
7.456GluVal: 7.456 ± 2.483
1.316GluTrp: 1.316 ± 0.557
1.754GluTyr: 1.754 ± 0.743
0.0GluXaa: 0.0 ± 0.0
Phe
0.877PheAla: 0.877 ± 1.382
1.316PheCys: 1.316 ± 0.557
3.947PheAsp: 3.947 ± 0.923
2.632PheGlu: 2.632 ± 0.613
0.439PhePhe: 0.439 ± 1.527
1.754PheGly: 1.754 ± 0.743
0.877PheHis: 0.877 ± 0.371
3.509PheIle: 3.509 ± 0.79
0.877PheLys: 0.877 ± 0.371
1.754PheLeu: 1.754 ± 0.743
0.439PheMet: 0.439 ± 0.186
0.439PheAsn: 0.439 ± 0.981
1.316PhePro: 1.316 ± 0.722
1.754PheGln: 1.754 ± 0.743
2.632PheArg: 2.632 ± 0.613
0.877PheSer: 0.877 ± 0.371
2.193PheThr: 2.193 ± 0.929
3.509PheVal: 3.509 ± 3.334
0.0PheTrp: 0.0 ± 0.0
1.754PheTyr: 1.754 ± 0.743
0.0PheXaa: 0.0 ± 0.0
Gly
3.07GlyAla: 3.07 ± 0.682
1.316GlyCys: 1.316 ± 0.557
3.07GlyAsp: 3.07 ± 1.3
7.018GlyGlu: 7.018 ± 1.803
3.509GlyPhe: 3.509 ± 0.947
6.14GlyGly: 6.14 ± 0.382
0.877GlyHis: 0.877 ± 0.371
3.947GlyIle: 3.947 ± 2.152
4.825GlyLys: 4.825 ± 1.978
3.509GlyLeu: 3.509 ± 1.486
1.754GlyMet: 1.754 ± 0.743
2.632GlyAsn: 2.632 ± 0.613
2.632GlyPro: 2.632 ± 1.114
0.439GlyGln: 0.439 ± 1.527
5.702GlyArg: 5.702 ± 1.351
1.316GlySer: 1.316 ± 0.557
3.509GlyThr: 3.509 ± 0.947
5.702GlyVal: 5.702 ± 2.415
2.193GlyTrp: 2.193 ± 0.929
3.07GlyTyr: 3.07 ± 1.3
0.0GlyXaa: 0.0 ± 0.0
His
0.439HisAla: 0.439 ± 0.981
0.439HisCys: 0.439 ± 0.186
2.193HisAsp: 2.193 ± 1.556
0.877HisGlu: 0.877 ± 0.371
0.439HisPhe: 0.439 ± 0.981
2.193HisGly: 2.193 ± 0.929
0.0HisHis: 0.0 ± 0.0
2.193HisIle: 2.193 ± 0.929
1.316HisLys: 1.316 ± 0.557
1.316HisLeu: 1.316 ± 0.557
0.439HisMet: 0.439 ± 0.186
2.193HisAsn: 2.193 ± 0.596
0.0HisPro: 0.0 ± 0.0
1.754HisGln: 1.754 ± 0.743
1.316HisArg: 1.316 ± 0.557
0.0HisSer: 0.0 ± 0.0
0.439HisThr: 0.439 ± 0.981
1.754HisVal: 1.754 ± 0.743
0.439HisTrp: 0.439 ± 0.186
1.754HisTyr: 1.754 ± 0.635
0.0HisXaa: 0.0 ± 0.0
Ile
3.509IleAla: 3.509 ± 1.486
0.877IleCys: 0.877 ± 0.371
2.632IleAsp: 2.632 ± 1.114
5.702IleGlu: 5.702 ± 0.314
0.0IlePhe: 0.0 ± 0.0
3.947IleGly: 3.947 ± 2.152
2.193IleHis: 2.193 ± 1.556
2.632IleIle: 2.632 ± 1.114
4.825IleLys: 4.825 ± 1.231
2.193IleLeu: 2.193 ± 0.929
0.0IleMet: 0.0 ± 0.0
3.07IleAsn: 3.07 ± 1.3
3.509IlePro: 3.509 ± 1.271
3.947IleGln: 3.947 ± 1.698
3.07IleArg: 3.07 ± 0.915
4.825IleSer: 4.825 ± 3.684
2.193IleThr: 2.193 ± 0.929
3.509IleVal: 3.509 ± 0.947
0.0IleTrp: 0.0 ± 0.0
2.193IleTyr: 2.193 ± 1.485
0.0IleXaa: 0.0 ± 0.0
Lys
5.702LysAla: 5.702 ± 3.312
1.316LysCys: 1.316 ± 0.557
3.07LysAsp: 3.07 ± 2.373
5.263LysGlu: 5.263 ± 2.608
2.193LysPhe: 2.193 ± 1.029
4.386LysGly: 4.386 ± 1.01
0.877LysHis: 0.877 ± 0.841
2.632LysIle: 2.632 ± 1.304
5.263LysLys: 5.263 ± 2.261
7.456LysLeu: 7.456 ± 1.376
1.754LysMet: 1.754 ± 0.743
2.632LysAsn: 2.632 ± 0.956
3.509LysPro: 3.509 ± 0.947
1.316LysGln: 1.316 ± 1.85
1.316LysArg: 1.316 ± 0.722
3.07LysSer: 3.07 ± 1.347
3.07LysThr: 3.07 ± 1.347
5.702LysVal: 5.702 ± 1.284
1.316LysTrp: 1.316 ± 0.557
0.877LysTyr: 0.877 ± 0.371
0.0LysXaa: 0.0 ± 0.0
Leu
5.702LeuAla: 5.702 ± 1.566
1.754LeuCys: 1.754 ± 0.743
2.632LeuAsp: 2.632 ± 4.147
8.333LeuGlu: 8.333 ± 1.176
1.754LeuPhe: 1.754 ± 0.635
3.07LeuGly: 3.07 ± 1.3
1.316LeuHis: 1.316 ± 0.557
3.509LeuIle: 3.509 ± 0.91
3.947LeuLys: 3.947 ± 4.339
2.632LeuLeu: 2.632 ± 0.613
1.754LeuMet: 1.754 ± 0.743
3.947LeuAsn: 3.947 ± 0.923
4.386LeuPro: 4.386 ± 0.609
4.386LeuGln: 4.386 ± 1.857
7.895LeuArg: 7.895 ± 1.307
6.579LeuSer: 6.579 ± 1.915
6.14LeuThr: 6.14 ± 4.636
4.386LeuVal: 4.386 ± 1.01
0.0LeuTrp: 0.0 ± 0.0
2.632LeuTyr: 2.632 ± 0.956
0.0LeuXaa: 0.0 ± 0.0
Met
1.754MetAla: 1.754 ± 0.743
0.0MetCys: 0.0 ± 0.0
2.632MetAsp: 2.632 ± 1.114
1.754MetGlu: 1.754 ± 0.743
0.439MetPhe: 0.439 ± 0.186
0.439MetGly: 0.439 ± 0.186
0.439MetHis: 0.439 ± 0.186
0.877MetIle: 0.877 ± 0.371
0.877MetLys: 0.877 ± 0.371
3.07MetLeu: 3.07 ± 1.3
0.0MetMet: 0.0 ± 0.0
0.877MetAsn: 0.877 ± 0.371
3.07MetPro: 3.07 ± 1.3
1.316MetGln: 1.316 ± 0.557
1.316MetArg: 1.316 ± 0.722
1.316MetSer: 1.316 ± 1.248
1.316MetThr: 1.316 ± 0.557
1.754MetVal: 1.754 ± 0.743
0.439MetTrp: 0.439 ± 0.186
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.07AsnAla: 3.07 ± 0.915
0.877AsnCys: 0.877 ± 0.371
1.754AsnAsp: 1.754 ± 1.129
1.316AsnGlu: 1.316 ± 0.557
3.07AsnPhe: 3.07 ± 1.3
3.07AsnGly: 3.07 ± 0.915
0.877AsnHis: 0.877 ± 0.371
0.439AsnIle: 0.439 ± 0.186
2.632AsnLys: 2.632 ± 1.114
3.07AsnLeu: 3.07 ± 0.682
1.316AsnMet: 1.316 ± 0.903
3.947AsnAsn: 3.947 ± 0.943
0.877AsnPro: 0.877 ± 1.382
4.386AsnGln: 4.386 ± 3.113
2.193AsnArg: 2.193 ± 1.029
2.632AsnSer: 2.632 ± 0.613
3.509AsnThr: 3.509 ± 2.611
2.193AsnVal: 2.193 ± 0.929
0.439AsnTrp: 0.439 ± 0.186
1.316AsnTyr: 1.316 ± 0.557
0.0AsnXaa: 0.0 ± 0.0
Pro
4.386ProAla: 4.386 ± 1.01
0.439ProCys: 0.439 ± 0.981
3.07ProAsp: 3.07 ± 1.3
6.14ProGlu: 6.14 ± 1.924
1.316ProPhe: 1.316 ± 0.557
3.947ProGly: 3.947 ± 1.672
1.316ProHis: 1.316 ± 0.557
1.316ProIle: 1.316 ± 0.722
3.509ProLys: 3.509 ± 2.611
1.754ProLeu: 1.754 ± 0.635
0.877ProMet: 0.877 ± 0.347
1.754ProAsn: 1.754 ± 1.129
4.386ProPro: 4.386 ± 1.857
2.193ProGln: 2.193 ± 0.929
3.509ProArg: 3.509 ± 0.79
3.509ProSer: 3.509 ± 0.79
3.07ProThr: 3.07 ± 0.682
2.632ProVal: 2.632 ± 1.114
1.316ProTrp: 1.316 ± 0.557
1.316ProTyr: 1.316 ± 0.557
0.0ProXaa: 0.0 ± 0.0
Gln
3.509GlnAla: 3.509 ± 0.947
0.0GlnCys: 0.0 ± 0.0
1.316GlnAsp: 1.316 ± 0.557
3.509GlnGlu: 3.509 ± 0.79
1.316GlnPhe: 1.316 ± 0.722
3.509GlnGly: 3.509 ± 1.486
1.316GlnHis: 1.316 ± 0.722
3.509GlnIle: 3.509 ± 2.96
1.754GlnLys: 1.754 ± 0.743
5.263GlnLeu: 5.263 ± 4.516
0.877GlnMet: 0.877 ± 0.371
3.509GlnAsn: 3.509 ± 2.258
1.316GlnPro: 1.316 ± 0.557
1.754GlnGln: 1.754 ± 0.635
2.632GlnArg: 2.632 ± 0.613
2.193GlnSer: 2.193 ± 0.596
2.193GlnThr: 2.193 ± 0.929
3.509GlnVal: 3.509 ± 0.91
0.877GlnTrp: 0.877 ± 0.371
0.877GlnTyr: 0.877 ± 0.841
0.0GlnXaa: 0.0 ± 0.0
Arg
3.509ArgAla: 3.509 ± 0.947
0.0ArgCys: 0.0 ± 0.0
2.632ArgAsp: 2.632 ± 0.613
3.509ArgGlu: 3.509 ± 1.271
3.509ArgPhe: 3.509 ± 0.79
3.947ArgGly: 3.947 ± 1.672
1.754ArgHis: 1.754 ± 0.635
3.07ArgIle: 3.07 ± 0.915
5.263ArgLys: 5.263 ± 2.261
4.386ArgLeu: 4.386 ± 1.072
4.825ArgMet: 4.825 ± 2.043
3.07ArgAsn: 3.07 ± 1.124
3.07ArgPro: 3.07 ± 0.682
3.07ArgGln: 3.07 ± 0.915
7.018ArgArg: 7.018 ± 0.669
5.263ArgSer: 5.263 ± 0.347
3.947ArgThr: 3.947 ± 0.923
2.632ArgVal: 2.632 ± 1.114
1.316ArgTrp: 1.316 ± 0.557
2.193ArgTyr: 2.193 ± 0.596
0.0ArgXaa: 0.0 ± 0.0
Ser
3.07SerAla: 3.07 ± 0.915
1.316SerCys: 1.316 ± 0.722
2.193SerAsp: 2.193 ± 0.929
5.702SerGlu: 5.702 ± 0.974
0.877SerPhe: 0.877 ± 0.371
6.14SerGly: 6.14 ± 2.6
0.877SerHis: 0.877 ± 0.371
5.263SerIle: 5.263 ± 1.153
3.07SerLys: 3.07 ± 1.124
6.579SerLeu: 6.579 ± 2.071
0.877SerMet: 0.877 ± 0.371
0.877SerAsn: 0.877 ± 0.371
2.632SerPro: 2.632 ± 1.114
2.193SerGln: 2.193 ± 3.779
5.702SerArg: 5.702 ± 0.974
3.947SerSer: 3.947 ± 2.449
3.947SerThr: 3.947 ± 0.943
2.632SerVal: 2.632 ± 1.114
1.316SerTrp: 1.316 ± 1.248
2.193SerTyr: 2.193 ± 0.929
0.0SerXaa: 0.0 ± 0.0
Thr
5.263ThrAla: 5.263 ± 3.498
0.877ThrCys: 0.877 ± 0.371
2.632ThrAsp: 2.632 ± 1.114
3.07ThrGlu: 3.07 ± 0.682
0.877ThrPhe: 0.877 ± 0.371
5.263ThrGly: 5.263 ± 2.608
2.632ThrHis: 2.632 ± 1.114
4.386ThrIle: 4.386 ± 1.192
3.07ThrLys: 3.07 ± 2.066
5.702ThrLeu: 5.702 ± 1.861
0.439ThrMet: 0.439 ± 0.186
4.386ThrAsn: 4.386 ± 2.289
3.509ThrPro: 3.509 ± 0.79
2.193ThrGln: 2.193 ± 1.556
3.509ThrArg: 3.509 ± 1.486
4.825ThrSer: 4.825 ± 2.134
4.825ThrThr: 4.825 ± 2.043
1.754ThrVal: 1.754 ± 1.129
0.439ThrTrp: 0.439 ± 0.186
1.754ThrTyr: 1.754 ± 0.635
0.0ThrXaa: 0.0 ± 0.0
Val
5.263ValAla: 5.263 ± 1.396
3.509ValCys: 3.509 ± 0.79
3.947ValAsp: 3.947 ± 0.774
2.193ValGlu: 2.193 ± 0.929
3.509ValPhe: 3.509 ± 0.79
3.509ValGly: 3.509 ± 0.91
0.877ValHis: 0.877 ± 0.371
4.386ValIle: 4.386 ± 1.072
4.825ValLys: 4.825 ± 2.788
4.386ValLeu: 4.386 ± 2.066
2.193ValMet: 2.193 ± 0.929
1.316ValAsn: 1.316 ± 0.557
2.632ValPro: 2.632 ± 1.114
1.316ValGln: 1.316 ± 0.557
4.825ValArg: 4.825 ± 1.231
3.07ValSer: 3.07 ± 0.915
4.825ValThr: 4.825 ± 0.46
3.07ValVal: 3.07 ± 0.915
0.877ValTrp: 0.877 ± 1.382
1.754ValTyr: 1.754 ± 0.743
0.0ValXaa: 0.0 ± 0.0
Trp
0.439TrpAla: 0.439 ± 0.186
0.0TrpCys: 0.0 ± 0.0
0.877TrpAsp: 0.877 ± 0.371
1.754TrpGlu: 1.754 ± 1.667
0.0TrpPhe: 0.0 ± 0.0
1.316TrpGly: 1.316 ± 0.557
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.316TrpLys: 1.316 ± 1.248
1.754TrpLeu: 1.754 ± 0.743
0.439TrpMet: 0.439 ± 0.186
0.439TrpAsn: 0.439 ± 0.186
0.439TrpPro: 0.439 ± 0.186
0.0TrpGln: 0.0 ± 0.0
1.316TrpArg: 1.316 ± 0.557
0.877TrpSer: 0.877 ± 0.371
2.632TrpThr: 2.632 ± 1.114
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.509TyrAla: 3.509 ± 1.486
0.0TyrCys: 0.0 ± 0.0
0.877TyrAsp: 0.877 ± 0.841
3.07TyrGlu: 3.07 ± 1.3
0.877TyrPhe: 0.877 ± 0.371
0.877TyrGly: 0.877 ± 0.371
1.316TyrHis: 1.316 ± 0.722
1.754TyrIle: 1.754 ± 0.743
3.509TyrLys: 3.509 ± 0.947
1.754TyrLeu: 1.754 ± 0.743
0.0TyrMet: 0.0 ± 0.0
1.316TyrAsn: 1.316 ± 0.722
2.632TyrPro: 2.632 ± 0.613
2.193TyrGln: 2.193 ± 0.929
1.754TyrArg: 1.754 ± 0.743
2.632TyrSer: 2.632 ± 0.613
1.754TyrThr: 1.754 ± 1.129
1.754TyrVal: 1.754 ± 0.743
0.0TyrTrp: 0.0 ± 0.0
1.316TyrTyr: 1.316 ± 0.557
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2281 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski