Amino acid dipepetide frequency for Pineapple bacilliform CO virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.083AlaAla: 4.083 ± 1.12
1.815AlaCys: 1.815 ± 1.238
3.176AlaAsp: 3.176 ± 2.606
5.445AlaGlu: 5.445 ± 2.117
2.269AlaPhe: 2.269 ± 1.12
1.361AlaGly: 1.361 ± 0.672
1.361AlaHis: 1.361 ± 1.374
5.898AlaIle: 5.898 ± 2.913
2.269AlaLys: 2.269 ± 1.12
6.806AlaLeu: 6.806 ± 1.684
1.815AlaMet: 1.815 ± 0.896
2.269AlaAsn: 2.269 ± 2.278
2.269AlaPro: 2.269 ± 1.609
2.269AlaGln: 2.269 ± 1.12
3.176AlaArg: 3.176 ± 1.569
4.083AlaSer: 4.083 ± 2.017
2.269AlaThr: 2.269 ± 1.12
4.991AlaVal: 4.991 ± 0.993
0.0AlaTrp: 0.0 ± 0.0
4.083AlaTyr: 4.083 ± 1.409
0.0AlaXaa: 0.0 ± 0.0
Cys
0.454CysAla: 0.454 ± 0.224
0.454CysCys: 0.454 ± 0.224
0.0CysAsp: 0.0 ± 0.0
2.269CysGlu: 2.269 ± 1.13
0.907CysPhe: 0.907 ± 0.448
1.815CysGly: 1.815 ± 0.896
0.454CysHis: 0.454 ± 0.224
0.454CysIle: 0.454 ± 0.224
2.722CysLys: 2.722 ± 1.344
0.907CysLeu: 0.907 ± 0.448
0.454CysMet: 0.454 ± 0.224
1.361CysAsn: 1.361 ± 0.672
0.454CysPro: 0.454 ± 0.224
0.907CysGln: 0.907 ± 0.448
1.361CysArg: 1.361 ± 0.672
0.907CysSer: 0.907 ± 0.448
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.454CysTrp: 0.454 ± 0.224
1.361CysTyr: 1.361 ± 0.672
0.0CysXaa: 0.0 ± 0.0
Asp
2.722AspAla: 2.722 ± 1.344
2.269AspCys: 2.269 ± 1.12
6.352AspAsp: 6.352 ± 1.868
2.269AspGlu: 2.269 ± 1.12
4.083AspPhe: 4.083 ± 2.017
2.722AspGly: 2.722 ± 1.344
0.907AspHis: 0.907 ± 0.448
2.722AspIle: 2.722 ± 2.059
2.269AspLys: 2.269 ± 1.609
4.537AspLeu: 4.537 ± 6.054
1.361AspMet: 1.361 ± 0.672
3.63AspAsn: 3.63 ± 1.054
2.722AspPro: 2.722 ± 1.344
1.815AspGln: 1.815 ± 0.896
3.176AspArg: 3.176 ± 1.569
1.815AspSer: 1.815 ± 0.896
2.269AspThr: 2.269 ± 1.609
1.815AspVal: 1.815 ± 1.238
0.907AspTrp: 0.907 ± 0.448
1.815AspTyr: 1.815 ± 0.896
0.0AspXaa: 0.0 ± 0.0
Glu
6.806GluAla: 6.806 ± 2.161
0.0GluCys: 0.0 ± 0.0
6.806GluAsp: 6.806 ± 2.058
15.426GluGlu: 15.426 ± 1.259
1.361GluPhe: 1.361 ± 0.672
5.898GluGly: 5.898 ± 2.109
4.537GluHis: 4.537 ± 2.259
3.63GluIle: 3.63 ± 6.13
11.343GluLys: 11.343 ± 4.656
10.889GluLeu: 10.889 ± 3.161
1.815GluMet: 1.815 ± 0.896
2.722GluAsn: 2.722 ± 1.344
1.361GluPro: 1.361 ± 0.672
6.352GluGln: 6.352 ± 5.212
3.63GluArg: 3.63 ± 1.469
4.083GluSer: 4.083 ± 1.12
2.269GluThr: 2.269 ± 1.609
4.537GluVal: 4.537 ± 1.225
1.361GluTrp: 1.361 ± 0.672
2.269GluTyr: 2.269 ± 1.13
0.0GluXaa: 0.0 ± 0.0
Phe
2.269PheAla: 2.269 ± 1.12
0.454PheCys: 0.454 ± 0.224
2.722PheAsp: 2.722 ± 1.344
2.269PheGlu: 2.269 ± 1.609
1.361PhePhe: 1.361 ± 0.672
1.815PheGly: 1.815 ± 0.896
0.454PheHis: 0.454 ± 0.224
2.722PheIle: 2.722 ± 1.058
0.907PheLys: 0.907 ± 0.448
3.176PheLeu: 3.176 ± 1.569
0.0PheMet: 0.0 ± 0.0
1.815PheAsn: 1.815 ± 0.896
1.361PhePro: 1.361 ± 0.672
2.269PheGln: 2.269 ± 1.12
2.269PheArg: 2.269 ± 1.609
1.361PheSer: 1.361 ± 0.672
2.722PheThr: 2.722 ± 1.344
1.361PheVal: 1.361 ± 1.838
0.454PheTrp: 0.454 ± 0.224
1.361PheTyr: 1.361 ± 0.672
0.0PheXaa: 0.0 ± 0.0
Gly
3.63GlyAla: 3.63 ± 1.054
1.361GlyCys: 1.361 ± 0.672
1.815GlyAsp: 1.815 ± 0.896
7.26GlyGlu: 7.26 ± 2.331
2.722GlyPhe: 2.722 ± 1.531
3.176GlyGly: 3.176 ± 1.484
0.907GlyHis: 0.907 ± 0.448
3.63GlyIle: 3.63 ± 1.793
3.176GlyLys: 3.176 ± 1.569
4.991GlyLeu: 4.991 ± 0.993
1.815GlyMet: 1.815 ± 0.799
2.722GlyAsn: 2.722 ± 1.344
2.722GlyPro: 2.722 ± 1.344
0.454GlyGln: 0.454 ± 0.224
3.63GlyArg: 3.63 ± 1.054
1.815GlySer: 1.815 ± 1.713
3.176GlyThr: 3.176 ± 1.484
3.176GlyVal: 3.176 ± 1.569
0.454GlyTrp: 0.454 ± 0.224
1.361GlyTyr: 1.361 ± 0.672
0.0GlyXaa: 0.0 ± 0.0
His
0.907HisAla: 0.907 ± 1.532
0.907HisCys: 0.907 ± 0.448
0.454HisAsp: 0.454 ± 0.224
2.722HisGlu: 2.722 ± 1.058
0.907HisPhe: 0.907 ± 0.448
0.907HisGly: 0.907 ± 0.448
0.907HisHis: 0.907 ± 0.448
1.815HisIle: 1.815 ± 0.896
2.269HisLys: 2.269 ± 2.902
3.176HisLeu: 3.176 ± 1.032
0.0HisMet: 0.0 ± 0.0
0.454HisAsn: 0.454 ± 1.704
0.454HisPro: 0.454 ± 0.224
1.815HisGln: 1.815 ± 1.238
1.815HisArg: 1.815 ± 1.238
0.454HisSer: 0.454 ± 0.224
0.907HisThr: 0.907 ± 1.532
2.722HisVal: 2.722 ± 1.344
0.454HisTrp: 0.454 ± 0.224
0.454HisTyr: 0.454 ± 0.224
0.0HisXaa: 0.0 ± 0.0
Ile
2.722IleAla: 2.722 ± 1.344
1.815IleCys: 1.815 ± 0.896
3.63IleAsp: 3.63 ± 1.793
3.176IleGlu: 3.176 ± 1.032
0.907IlePhe: 0.907 ± 0.448
4.991IleGly: 4.991 ± 1.623
1.815IleHis: 1.815 ± 3.063
4.537IleIle: 4.537 ± 1.225
3.63IleLys: 3.63 ± 1.054
6.352IleLeu: 6.352 ± 1.895
0.454IleMet: 0.454 ± 0.224
1.815IleAsn: 1.815 ± 1.238
2.722IlePro: 2.722 ± 1.344
2.722IleGln: 2.722 ± 1.058
4.537IleArg: 4.537 ± 1.225
4.083IleSer: 4.083 ± 8.243
3.63IleThr: 3.63 ± 1.469
5.445IleVal: 5.445 ± 1.73
0.454IleTrp: 0.454 ± 0.224
1.815IleTyr: 1.815 ± 0.896
0.0IleXaa: 0.0 ± 0.0
Lys
5.898LysAla: 5.898 ± 2.109
1.815LysCys: 1.815 ± 0.896
3.176LysAsp: 3.176 ± 5.218
9.982LysGlu: 9.982 ± 0.594
2.722LysPhe: 2.722 ± 1.058
4.083LysGly: 4.083 ± 1.489
2.722LysHis: 2.722 ± 1.058
5.445LysIle: 5.445 ± 2.117
4.083LysLys: 4.083 ± 1.489
5.898LysLeu: 5.898 ± 3.953
2.269LysMet: 2.269 ± 1.022
3.63LysAsn: 3.63 ± 1.793
1.815LysPro: 1.815 ± 0.896
2.722LysGln: 2.722 ± 5.942
1.815LysArg: 1.815 ± 1.713
4.537LysSer: 4.537 ± 1.198
2.722LysThr: 2.722 ± 1.344
6.352LysVal: 6.352 ± 3.681
0.454LysTrp: 0.454 ± 0.224
0.907LysTyr: 0.907 ± 0.448
0.0LysXaa: 0.0 ± 0.0
Leu
4.991LeuAla: 4.991 ± 3.842
0.454LeuCys: 0.454 ± 0.224
4.991LeuAsp: 4.991 ± 1.623
9.982LeuGlu: 9.982 ± 5.304
0.907LeuPhe: 0.907 ± 0.448
4.991LeuGly: 4.991 ± 1.359
1.815LeuHis: 1.815 ± 0.896
3.63LeuIle: 3.63 ± 2.476
9.528LeuLys: 9.528 ± 2.189
7.713LeuLeu: 7.713 ± 2.95
0.454LeuMet: 0.454 ± 0.404
4.991LeuAsn: 4.991 ± 0.993
4.537LeuPro: 4.537 ± 2.241
9.074LeuGln: 9.074 ± 2.015
6.352LeuArg: 6.352 ± 6.84
6.806LeuSer: 6.806 ± 3.464
4.083LeuThr: 4.083 ± 2.359
5.898LeuVal: 5.898 ± 2.079
0.907LeuTrp: 0.907 ± 1.981
3.176LeuTyr: 3.176 ± 1.484
0.0LeuXaa: 0.0 ± 0.0
Met
1.361MetAla: 1.361 ± 0.672
0.0MetCys: 0.0 ± 0.0
1.815MetAsp: 1.815 ± 0.896
1.815MetGlu: 1.815 ± 0.896
0.0MetPhe: 0.0 ± 0.0
0.907MetGly: 0.907 ± 0.448
0.454MetHis: 0.454 ± 0.224
1.361MetIle: 1.361 ± 0.672
0.454MetLys: 0.454 ± 0.224
0.907MetLeu: 0.907 ± 0.448
1.361MetMet: 1.361 ± 0.672
0.907MetAsn: 0.907 ± 0.448
1.361MetPro: 1.361 ± 0.672
0.907MetGln: 0.907 ± 0.448
0.907MetArg: 0.907 ± 0.448
2.722MetSer: 2.722 ± 2.059
1.815MetThr: 1.815 ± 0.896
1.361MetVal: 1.361 ± 0.672
0.454MetTrp: 0.454 ± 0.224
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
0.907AsnAla: 0.907 ± 0.448
0.454AsnCys: 0.454 ± 0.224
0.907AsnAsp: 0.907 ± 0.448
3.176AsnGlu: 3.176 ± 1.569
1.815AsnPhe: 1.815 ± 0.896
3.176AsnGly: 3.176 ± 1.569
0.454AsnHis: 0.454 ± 0.224
1.815AsnIle: 1.815 ± 1.713
2.722AsnLys: 2.722 ± 2.059
5.898AsnLeu: 5.898 ± 5.382
0.454AsnMet: 0.454 ± 0.224
1.815AsnAsn: 1.815 ± 1.238
3.176AsnPro: 3.176 ± 1.484
3.176AsnGln: 3.176 ± 1.569
1.815AsnArg: 1.815 ± 1.238
2.722AsnSer: 2.722 ± 1.058
4.537AsnThr: 4.537 ± 1.198
0.907AsnVal: 0.907 ± 0.448
0.0AsnTrp: 0.0 ± 0.0
1.815AsnTyr: 1.815 ± 0.896
0.0AsnXaa: 0.0 ± 0.0
Pro
4.083ProAla: 4.083 ± 1.12
0.454ProCys: 0.454 ± 0.224
2.269ProAsp: 2.269 ± 1.12
3.63ProGlu: 3.63 ± 1.793
1.815ProPhe: 1.815 ± 1.713
2.269ProGly: 2.269 ± 1.12
1.815ProHis: 1.815 ± 0.896
1.815ProIle: 1.815 ± 0.896
3.176ProLys: 3.176 ± 1.032
5.898ProLeu: 5.898 ± 0.623
0.454ProMet: 0.454 ± 0.224
1.815ProAsn: 1.815 ± 0.896
2.722ProPro: 2.722 ± 1.531
2.269ProGln: 2.269 ± 1.609
1.815ProArg: 1.815 ± 0.896
2.722ProSer: 2.722 ± 1.344
3.176ProThr: 3.176 ± 1.569
1.815ProVal: 1.815 ± 0.896
0.454ProTrp: 0.454 ± 0.224
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
5.898GlnAla: 5.898 ± 1.686
0.0GlnCys: 0.0 ± 0.0
2.722GlnAsp: 2.722 ± 1.058
5.445GlnGlu: 5.445 ± 0.798
2.269GlnPhe: 2.269 ± 1.12
0.0GlnGly: 0.0 ± 0.0
0.454GlnHis: 0.454 ± 0.224
3.63GlnIle: 3.63 ± 1.793
4.083GlnLys: 4.083 ± 3.653
4.083GlnLeu: 4.083 ± 3.653
0.907GlnMet: 0.907 ± 0.448
1.815GlnAsn: 1.815 ± 2.499
2.722GlnPro: 2.722 ± 1.058
4.083GlnGln: 4.083 ± 4.123
4.083GlnArg: 4.083 ± 1.12
0.907GlnSer: 0.907 ± 1.532
3.63GlnThr: 3.63 ± 1.624
2.722GlnVal: 2.722 ± 1.344
0.0GlnTrp: 0.0 ± 0.0
1.815GlnTyr: 1.815 ± 0.896
0.0GlnXaa: 0.0 ± 0.0
Arg
1.361ArgAla: 1.361 ± 0.672
0.454ArgCys: 0.454 ± 0.224
1.815ArgAsp: 1.815 ± 0.896
4.083ArgGlu: 4.083 ± 2.359
0.907ArgPhe: 0.907 ± 0.448
3.176ArgGly: 3.176 ± 1.569
0.907ArgHis: 0.907 ± 0.448
4.991ArgIle: 4.991 ± 5.83
6.806ArgLys: 6.806 ± 2.074
4.083ArgLeu: 4.083 ± 1.489
2.722ArgMet: 2.722 ± 1.344
3.176ArgAsn: 3.176 ± 4.042
2.269ArgPro: 2.269 ± 1.13
1.361ArgGln: 1.361 ± 2.719
1.361ArgArg: 1.361 ± 0.672
3.176ArgSer: 3.176 ± 1.569
3.63ArgThr: 3.63 ± 1.793
4.991ArgVal: 4.991 ± 0.993
2.269ArgTrp: 2.269 ± 1.12
3.176ArgTyr: 3.176 ± 1.032
0.0ArgXaa: 0.0 ± 0.0
Ser
0.907SerAla: 0.907 ± 0.448
1.361SerCys: 1.361 ± 0.672
1.815SerAsp: 1.815 ± 0.896
4.537SerGlu: 4.537 ± 6.673
1.361SerPhe: 1.361 ± 0.672
4.537SerGly: 4.537 ± 1.541
1.361SerHis: 1.361 ± 3.232
1.361SerIle: 1.361 ± 0.672
4.991SerLys: 4.991 ± 4.337
6.352SerLeu: 6.352 ± 0.489
0.454SerMet: 0.454 ± 0.224
2.722SerAsn: 2.722 ± 1.531
2.722SerPro: 2.722 ± 1.344
2.269SerGln: 2.269 ± 1.13
4.991SerArg: 4.991 ± 0.993
2.722SerSer: 2.722 ± 1.058
4.991SerThr: 4.991 ± 1.623
3.176SerVal: 3.176 ± 1.032
0.907SerTrp: 0.907 ± 1.532
2.269SerTyr: 2.269 ± 1.12
0.0SerXaa: 0.0 ± 0.0
Thr
5.445ThrAla: 5.445 ± 0.798
0.907ThrCys: 0.907 ± 0.448
2.722ThrAsp: 2.722 ± 1.344
4.083ThrGlu: 4.083 ± 1.12
2.269ThrPhe: 2.269 ± 1.12
1.815ThrGly: 1.815 ± 0.896
1.815ThrHis: 1.815 ± 1.238
5.898ThrIle: 5.898 ± 1.858
3.63ThrLys: 3.63 ± 3.425
3.176ThrLeu: 3.176 ± 1.569
0.907ThrMet: 0.907 ± 0.448
0.907ThrAsn: 0.907 ± 0.448
4.083ThrPro: 4.083 ± 2.017
2.269ThrGln: 2.269 ± 1.13
2.722ThrArg: 2.722 ± 1.531
3.176ThrSer: 3.176 ± 1.84
4.537ThrThr: 4.537 ± 2.241
2.269ThrVal: 2.269 ± 1.609
0.907ThrTrp: 0.907 ± 0.448
1.361ThrTyr: 1.361 ± 1.374
0.0ThrXaa: 0.0 ± 0.0
Val
4.083ValAla: 4.083 ± 3.653
0.907ValCys: 0.907 ± 0.448
3.176ValAsp: 3.176 ± 1.484
5.898ValGlu: 5.898 ± 2.079
3.176ValPhe: 3.176 ± 1.484
4.083ValGly: 4.083 ± 1.409
0.907ValHis: 0.907 ± 0.448
3.176ValIle: 3.176 ± 1.569
1.361ValLys: 1.361 ± 1.374
7.26ValLeu: 7.26 ± 0.494
1.361ValMet: 1.361 ± 0.672
0.907ValAsn: 0.907 ± 0.448
3.63ValPro: 3.63 ± 1.624
1.815ValGln: 1.815 ± 0.896
4.083ValArg: 4.083 ± 2.017
4.537ValSer: 4.537 ± 1.225
2.722ValThr: 2.722 ± 1.058
4.537ValVal: 4.537 ± 2.241
0.0ValTrp: 0.0 ± 0.0
1.815ValTyr: 1.815 ± 0.896
0.0ValXaa: 0.0 ± 0.0
Trp
0.907TrpAla: 0.907 ± 0.448
0.0TrpCys: 0.0 ± 0.0
0.907TrpAsp: 0.907 ± 1.981
1.361TrpGlu: 1.361 ± 1.374
0.0TrpPhe: 0.0 ± 0.0
1.361TrpGly: 1.361 ± 0.672
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.361TrpLys: 1.361 ± 0.672
1.361TrpLeu: 1.361 ± 0.672
0.0TrpMet: 0.0 ± 0.0
0.454TrpAsn: 0.454 ± 0.224
0.0TrpPro: 0.0 ± 0.0
0.454TrpGln: 0.454 ± 0.224
0.907TrpArg: 0.907 ± 0.448
0.454TrpSer: 0.454 ± 1.704
0.907TrpThr: 0.907 ± 0.448
0.907TrpVal: 0.907 ± 0.448
0.907TrpTrp: 0.907 ± 0.448
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.269TyrAla: 2.269 ± 1.12
1.815TyrCys: 1.815 ± 0.896
0.907TyrAsp: 0.907 ± 0.448
2.269TyrGlu: 2.269 ± 1.12
1.361TyrPhe: 1.361 ± 0.672
0.907TyrGly: 0.907 ± 0.448
0.454TyrHis: 0.454 ± 0.224
2.722TyrIle: 2.722 ± 1.344
2.269TyrLys: 2.269 ± 1.609
2.269TyrLeu: 2.269 ± 2.278
1.361TyrMet: 1.361 ± 0.672
1.815TyrAsn: 1.815 ± 0.896
1.361TyrPro: 1.361 ± 0.672
2.269TyrGln: 2.269 ± 1.12
2.269TyrArg: 2.269 ± 2.902
2.722TyrSer: 2.722 ± 1.344
0.907TyrThr: 0.907 ± 0.448
0.454TyrVal: 0.454 ± 0.224
0.454TyrTrp: 0.454 ± 1.704
1.815TyrTyr: 1.815 ± 0.896
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2205 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski