Amino acid dipepetide frequency for Po-Circo-like virus 21

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.331AlaAla: 2.331 ± 1.246
1.166AlaCys: 1.166 ± 1.071
2.331AlaAsp: 2.331 ± 1.435
2.331AlaGlu: 2.331 ± 1.435
1.166AlaPhe: 1.166 ± 1.22
5.828AlaGly: 5.828 ± 1.802
2.331AlaHis: 2.331 ± 1.0
8.159AlaIle: 8.159 ± 2.452
2.331AlaLys: 2.331 ± 1.246
9.324AlaLeu: 9.324 ± 1.027
1.166AlaMet: 1.166 ± 0.717
0.0AlaAsn: 0.0 ± 0.0
3.497AlaPro: 3.497 ± 1.911
1.166AlaGln: 1.166 ± 1.22
4.662AlaArg: 4.662 ± 1.027
9.324AlaSer: 9.324 ± 4.457
1.166AlaThr: 1.166 ± 1.421
6.993AlaVal: 6.993 ± 2.64
0.0AlaTrp: 0.0 ± 0.0
2.331AlaTyr: 2.331 ± 1.0
0.0AlaXaa: 0.0 ± 0.0
Cys
2.331CysAla: 2.331 ± 1.717
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
2.331CysPhe: 2.331 ± 1.435
1.166CysGly: 1.166 ± 1.421
0.0CysHis: 0.0 ± 0.0
2.331CysIle: 2.331 ± 2.842
3.497CysLys: 3.497 ± 0.797
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.166CysPro: 1.166 ± 1.063
1.166CysGln: 1.166 ± 1.22
0.0CysArg: 0.0 ± 0.0
1.166CysSer: 1.166 ± 1.421
2.331CysThr: 2.331 ± 1.087
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.166CysTyr: 1.166 ± 1.22
0.0CysXaa: 0.0 ± 0.0
Asp
5.828AspAla: 5.828 ± 2.254
0.0AspCys: 0.0 ± 0.0
2.331AspAsp: 2.331 ± 1.246
5.828AspGlu: 5.828 ± 2.524
2.331AspPhe: 2.331 ± 2.126
0.0AspGly: 0.0 ± 0.0
0.0AspHis: 0.0 ± 0.0
1.166AspIle: 1.166 ± 1.22
2.331AspLys: 2.331 ± 1.435
1.166AspLeu: 1.166 ± 1.071
2.331AspMet: 2.331 ± 1.435
2.331AspAsn: 2.331 ± 1.246
6.993AspPro: 6.993 ± 4.177
5.828AspGln: 5.828 ± 1.974
2.331AspArg: 2.331 ± 1.0
1.166AspSer: 1.166 ± 1.063
2.331AspThr: 2.331 ± 1.246
3.497AspVal: 3.497 ± 3.189
1.166AspTrp: 1.166 ± 0.717
1.166AspTyr: 1.166 ± 1.063
0.0AspXaa: 0.0 ± 0.0
Glu
8.159GluAla: 8.159 ± 4.108
0.0GluCys: 0.0 ± 0.0
3.497GluAsp: 3.497 ± 2.152
6.993GluGlu: 6.993 ± 4.304
1.166GluPhe: 1.166 ± 0.717
1.166GluGly: 1.166 ± 0.717
1.166GluHis: 1.166 ± 0.717
2.331GluIle: 2.331 ± 1.435
1.166GluLys: 1.166 ± 0.717
6.993GluLeu: 6.993 ± 1.864
3.497GluMet: 3.497 ± 1.978
2.331GluAsn: 2.331 ± 1.116
3.497GluPro: 3.497 ± 1.944
3.497GluGln: 3.497 ± 3.189
0.0GluArg: 0.0 ± 0.0
2.331GluSer: 2.331 ± 1.0
5.828GluThr: 5.828 ± 1.463
3.497GluVal: 3.497 ± 1.504
0.0GluTrp: 0.0 ± 0.0
1.166GluTyr: 1.166 ± 1.063
0.0GluXaa: 0.0 ± 0.0
Phe
4.662PheAla: 4.662 ± 2.567
2.331PheCys: 2.331 ± 1.534
4.662PheAsp: 4.662 ± 3.296
4.662PheGlu: 4.662 ± 1.544
2.331PhePhe: 2.331 ± 1.717
2.331PheGly: 2.331 ± 1.0
1.166PheHis: 1.166 ± 1.063
1.166PheIle: 1.166 ± 0.717
0.0PheLys: 0.0 ± 0.0
3.497PheLeu: 3.497 ± 2.503
2.331PheMet: 2.331 ± 1.613
1.166PheAsn: 1.166 ± 1.421
0.0PhePro: 0.0 ± 0.0
0.0PheGln: 0.0 ± 0.0
2.331PheArg: 2.331 ± 1.087
9.324PheSer: 9.324 ± 5.46
5.828PheThr: 5.828 ± 3.587
5.828PheVal: 5.828 ± 4.284
0.0PheTrp: 0.0 ± 0.0
2.331PheTyr: 2.331 ± 1.246
0.0PheXaa: 0.0 ± 0.0
Gly
1.166GlyAla: 1.166 ± 1.071
1.166GlyCys: 1.166 ± 1.22
1.166GlyAsp: 1.166 ± 1.071
1.166GlyGlu: 1.166 ± 0.717
4.662GlyPhe: 4.662 ± 2.255
3.497GlyGly: 3.497 ± 2.152
1.166GlyHis: 1.166 ± 0.717
1.166GlyIle: 1.166 ± 1.22
2.331GlyLys: 2.331 ± 1.435
2.331GlyLeu: 2.331 ± 1.0
0.0GlyMet: 0.0 ± 0.0
8.159GlyAsn: 8.159 ± 2.346
2.331GlyPro: 2.331 ± 1.087
1.166GlyGln: 1.166 ± 1.071
2.331GlyArg: 2.331 ± 1.0
2.331GlySer: 2.331 ± 1.087
4.662GlyThr: 4.662 ± 2.0
5.828GlyVal: 5.828 ± 3.442
0.0GlyTrp: 0.0 ± 0.0
1.166GlyTyr: 1.166 ± 0.717
0.0GlyXaa: 0.0 ± 0.0
His
1.166HisAla: 1.166 ± 1.071
0.0HisCys: 0.0 ± 0.0
1.166HisAsp: 1.166 ± 1.063
1.166HisGlu: 1.166 ± 0.717
1.166HisPhe: 1.166 ± 1.421
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.166HisIle: 1.166 ± 0.717
0.0HisLys: 0.0 ± 0.0
5.828HisLeu: 5.828 ± 2.681
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
1.166HisGln: 1.166 ± 0.717
1.166HisArg: 1.166 ± 0.717
2.331HisSer: 2.331 ± 2.126
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
1.166HisTrp: 1.166 ± 0.717
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.828IleAla: 5.828 ± 1.219
1.166IleCys: 1.166 ± 1.421
5.828IleAsp: 5.828 ± 3.57
1.166IleGlu: 1.166 ± 1.071
1.166IlePhe: 1.166 ± 0.717
2.331IleGly: 2.331 ± 1.0
2.331IleHis: 2.331 ± 1.435
2.331IleIle: 2.331 ± 1.33
2.331IleLys: 2.331 ± 1.246
1.166IleLeu: 1.166 ± 1.063
1.166IleMet: 1.166 ± 1.071
1.166IleAsn: 1.166 ± 0.717
1.166IlePro: 1.166 ± 1.22
3.497IleGln: 3.497 ± 2.152
1.166IleArg: 1.166 ± 0.717
8.159IleSer: 8.159 ± 3.678
5.828IleThr: 5.828 ± 4.08
3.497IleVal: 3.497 ± 1.414
2.331IleTrp: 2.331 ± 1.435
2.331IleTyr: 2.331 ± 1.534
0.0IleXaa: 0.0 ± 0.0
Lys
1.166LysAla: 1.166 ± 0.717
2.331LysCys: 2.331 ± 1.0
3.497LysAsp: 3.497 ± 2.152
3.497LysGlu: 3.497 ± 1.504
1.166LysPhe: 1.166 ± 0.717
4.662LysGly: 4.662 ± 2.492
0.0LysHis: 0.0 ± 0.0
1.166LysIle: 1.166 ± 0.717
2.331LysLys: 2.331 ± 1.0
2.331LysLeu: 2.331 ± 1.435
0.0LysMet: 0.0 ± 0.0
5.828LysAsn: 5.828 ± 3.127
0.0LysPro: 0.0 ± 0.0
2.331LysGln: 2.331 ± 1.435
1.166LysArg: 1.166 ± 0.717
1.166LysSer: 1.166 ± 0.717
5.828LysThr: 5.828 ± 1.755
0.0LysVal: 0.0 ± 0.0
1.166LysTrp: 1.166 ± 0.717
6.993LysTyr: 6.993 ± 1.947
0.0LysXaa: 0.0 ± 0.0
Leu
4.662LeuAla: 4.662 ± 2.145
1.166LeuCys: 1.166 ± 1.421
5.828LeuAsp: 5.828 ± 3.074
4.662LeuGlu: 4.662 ± 2.091
4.662LeuPhe: 4.662 ± 3.296
1.166LeuGly: 1.166 ± 0.717
1.166LeuHis: 1.166 ± 1.421
3.497LeuIle: 3.497 ± 0.797
6.993LeuLys: 6.993 ± 2.379
0.0LeuLeu: 0.0 ± 0.0
0.0LeuMet: 0.0 ± 0.0
5.828LeuAsn: 5.828 ± 2.292
3.497LeuPro: 3.497 ± 2.503
2.331LeuGln: 2.331 ± 2.126
4.662LeuArg: 4.662 ± 2.174
5.828LeuSer: 5.828 ± 4.632
6.993LeuThr: 6.993 ± 1.83
5.828LeuVal: 5.828 ± 2.125
1.166LeuTrp: 1.166 ± 0.717
4.662LeuTyr: 4.662 ± 2.0
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
2.331MetAsp: 2.331 ± 1.435
0.0MetGlu: 0.0 ± 0.0
1.166MetPhe: 1.166 ± 1.071
1.166MetGly: 1.166 ± 0.717
0.0MetHis: 0.0 ± 0.0
1.166MetIle: 1.166 ± 0.717
0.0MetLys: 0.0 ± 0.0
3.497MetLeu: 3.497 ± 1.516
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
3.497MetPro: 3.497 ± 0.797
1.166MetGln: 1.166 ± 0.717
1.166MetArg: 1.166 ± 0.717
4.662MetSer: 4.662 ± 2.935
2.331MetThr: 2.331 ± 1.246
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.166MetTyr: 1.166 ± 0.717
0.0MetXaa: 0.0 ± 0.0
Asn
2.331AsnAla: 2.331 ± 1.611
0.0AsnCys: 0.0 ± 0.0
3.497AsnAsp: 3.497 ± 1.414
2.331AsnGlu: 2.331 ± 1.0
3.497AsnPhe: 3.497 ± 1.372
3.497AsnGly: 3.497 ± 1.627
1.166AsnHis: 1.166 ± 1.063
2.331AsnIle: 2.331 ± 1.246
2.331AsnLys: 2.331 ± 1.246
4.662AsnLeu: 4.662 ± 2.0
2.331AsnMet: 2.331 ± 1.435
3.497AsnAsn: 3.497 ± 1.414
4.662AsnPro: 4.662 ± 1.027
3.497AsnGln: 3.497 ± 2.449
1.166AsnArg: 1.166 ± 1.063
3.497AsnSer: 3.497 ± 1.706
0.0AsnThr: 0.0 ± 0.0
2.331AsnVal: 2.331 ± 1.611
1.166AsnTrp: 1.166 ± 1.071
3.497AsnTyr: 3.497 ± 2.449
0.0AsnXaa: 0.0 ± 0.0
Pro
2.331ProAla: 2.331 ± 2.126
1.166ProCys: 1.166 ± 1.421
2.331ProAsp: 2.331 ± 1.116
2.331ProGlu: 2.331 ± 1.435
6.993ProPhe: 6.993 ± 1.594
2.331ProGly: 2.331 ± 1.435
1.166ProHis: 1.166 ± 0.717
1.166ProIle: 1.166 ± 1.421
1.166ProLys: 1.166 ± 0.717
1.166ProLeu: 1.166 ± 1.22
1.166ProMet: 1.166 ± 1.071
1.166ProAsn: 1.166 ± 1.071
3.497ProPro: 3.497 ± 1.26
4.662ProGln: 4.662 ± 1.544
1.166ProArg: 1.166 ± 1.071
10.49ProSer: 10.49 ± 1.92
4.662ProThr: 4.662 ± 3.296
2.331ProVal: 2.331 ± 1.534
0.0ProTrp: 0.0 ± 0.0
1.166ProTyr: 1.166 ± 1.063
0.0ProXaa: 0.0 ± 0.0
Gln
2.331GlnAla: 2.331 ± 1.246
2.331GlnCys: 2.331 ± 1.087
2.331GlnAsp: 2.331 ± 2.126
4.662GlnGlu: 4.662 ± 2.091
3.497GlnPhe: 3.497 ± 1.414
6.993GlnGly: 6.993 ± 2.501
1.166GlnHis: 1.166 ± 0.717
3.497GlnIle: 3.497 ± 1.911
1.166GlnLys: 1.166 ± 0.717
2.331GlnLeu: 2.331 ± 1.246
1.166GlnMet: 1.166 ± 0.717
3.497GlnAsn: 3.497 ± 3.212
0.0GlnPro: 0.0 ± 0.0
2.331GlnGln: 2.331 ± 1.087
2.331GlnArg: 2.331 ± 1.435
2.331GlnSer: 2.331 ± 1.534
0.0GlnThr: 0.0 ± 0.0
3.497GlnVal: 3.497 ± 1.398
0.0GlnTrp: 0.0 ± 0.0
3.497GlnTyr: 3.497 ± 1.516
0.0GlnXaa: 0.0 ± 0.0
Arg
4.662ArgAla: 4.662 ± 2.091
2.331ArgCys: 2.331 ± 2.842
0.0ArgAsp: 0.0 ± 0.0
3.497ArgGlu: 3.497 ± 2.027
3.497ArgPhe: 3.497 ± 1.516
0.0ArgGly: 0.0 ± 0.0
1.166ArgHis: 1.166 ± 0.717
3.497ArgIle: 3.497 ± 1.414
1.166ArgLys: 1.166 ± 1.071
3.497ArgLeu: 3.497 ± 0.797
0.0ArgMet: 0.0 ± 0.0
2.331ArgAsn: 2.331 ± 1.0
0.0ArgPro: 0.0 ± 0.0
2.331ArgGln: 2.331 ± 1.435
5.828ArgArg: 5.828 ± 1.802
4.662ArgSer: 4.662 ± 1.591
0.0ArgThr: 0.0 ± 0.0
3.497ArgVal: 3.497 ± 1.372
2.331ArgTrp: 2.331 ± 1.435
3.497ArgTyr: 3.497 ± 1.944
0.0ArgXaa: 0.0 ± 0.0
Ser
6.993SerAla: 6.993 ± 3.014
0.0SerCys: 0.0 ± 0.0
4.662SerAsp: 4.662 ± 2.265
4.662SerGlu: 4.662 ± 1.027
8.159SerPhe: 8.159 ± 7.068
8.159SerGly: 8.159 ± 3.513
1.166SerHis: 1.166 ± 1.421
5.828SerIle: 5.828 ± 1.025
2.331SerLys: 2.331 ± 1.0
11.655SerLeu: 11.655 ± 6.489
2.331SerMet: 2.331 ± 1.659
4.662SerAsn: 4.662 ± 1.171
1.166SerPro: 1.166 ± 1.22
5.828SerGln: 5.828 ± 2.927
5.828SerArg: 5.828 ± 4.08
15.152SerSer: 15.152 ± 13.873
4.662SerThr: 4.662 ± 2.793
4.662SerVal: 4.662 ± 3.296
0.0SerTrp: 0.0 ± 0.0
1.166SerTyr: 1.166 ± 0.717
0.0SerXaa: 0.0 ± 0.0
Thr
6.993ThrAla: 6.993 ± 2.315
0.0ThrCys: 0.0 ± 0.0
2.331ThrAsp: 2.331 ± 2.126
2.331ThrGlu: 2.331 ± 1.0
0.0ThrPhe: 0.0 ± 0.0
1.166ThrGly: 1.166 ± 1.071
0.0ThrHis: 0.0 ± 0.0
3.497ThrIle: 3.497 ± 1.26
6.993ThrLys: 6.993 ± 3.479
6.993ThrLeu: 6.993 ± 1.594
0.0ThrMet: 0.0 ± 0.0
3.497ThrAsn: 3.497 ± 2.359
6.993ThrPro: 6.993 ± 2.599
1.166ThrGln: 1.166 ± 1.063
4.662ThrArg: 4.662 ± 2.89
5.828ThrSer: 5.828 ± 2.69
1.166ThrThr: 1.166 ± 1.22
6.993ThrVal: 6.993 ± 1.595
2.331ThrTrp: 2.331 ± 1.33
2.331ThrTyr: 2.331 ± 1.0
0.0ThrXaa: 0.0 ± 0.0
Val
1.166ValAla: 1.166 ± 0.717
0.0ValCys: 0.0 ± 0.0
2.331ValAsp: 2.331 ± 1.246
3.497ValGlu: 3.497 ± 1.504
4.662ValPhe: 4.662 ± 2.45
0.0ValGly: 0.0 ± 0.0
0.0ValHis: 0.0 ± 0.0
6.993ValIle: 6.993 ± 2.527
8.159ValLys: 8.159 ± 3.658
5.828ValLeu: 5.828 ± 2.421
3.497ValMet: 3.497 ± 1.739
1.166ValAsn: 1.166 ± 1.22
3.497ValPro: 3.497 ± 1.706
3.497ValGln: 3.497 ± 1.427
4.662ValArg: 4.662 ± 1.56
3.497ValSer: 3.497 ± 1.736
5.828ValThr: 5.828 ± 2.901
2.331ValVal: 2.331 ± 1.246
1.166ValTrp: 1.166 ± 1.071
1.166ValTyr: 1.166 ± 1.071
0.0ValXaa: 0.0 ± 0.0
Trp
1.166TrpAla: 1.166 ± 0.717
1.166TrpCys: 1.166 ± 0.717
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
1.166TrpGly: 1.166 ± 0.717
0.0TrpHis: 0.0 ± 0.0
1.166TrpIle: 1.166 ± 0.717
0.0TrpLys: 0.0 ± 0.0
1.166TrpLeu: 1.166 ± 0.717
0.0TrpMet: 0.0 ± 0.0
2.331TrpAsn: 2.331 ± 1.435
2.331TrpPro: 2.331 ± 1.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
2.331TrpSer: 2.331 ± 1.717
1.166TrpThr: 1.166 ± 0.717
0.0TrpVal: 0.0 ± 0.0
1.166TrpTrp: 1.166 ± 0.717
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.331TyrAla: 2.331 ± 2.142
2.331TyrCys: 2.331 ± 1.087
0.0TyrAsp: 0.0 ± 0.0
4.662TyrGlu: 4.662 ± 1.027
2.331TyrPhe: 2.331 ± 1.513
1.166TyrGly: 1.166 ± 1.063
2.331TyrHis: 2.331 ± 1.087
2.331TyrIle: 2.331 ± 2.439
0.0TyrLys: 0.0 ± 0.0
1.166TyrLeu: 1.166 ± 0.717
1.166TyrMet: 1.166 ± 0.717
2.331TyrAsn: 2.331 ± 1.0
4.662TyrPro: 4.662 ± 2.086
2.331TyrGln: 2.331 ± 1.246
1.166TyrArg: 1.166 ± 0.717
3.497TyrSer: 3.497 ± 1.911
4.662TyrThr: 4.662 ± 3.42
2.331TyrVal: 2.331 ± 1.611
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (859 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski