Amino acid dipepetide frequency for Ashy storm petrel gyrovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.608AlaAla: 4.608 ± 2.287
1.152AlaCys: 1.152 ± 1.123
2.304AlaAsp: 2.304 ± 2.882
0.0AlaGlu: 0.0 ± 0.0
2.304AlaPhe: 2.304 ± 1.965
2.304AlaGly: 2.304 ± 2.061
2.304AlaHis: 2.304 ± 1.223
5.76AlaIle: 5.76 ± 2.256
2.304AlaLys: 2.304 ± 1.356
6.912AlaLeu: 6.912 ± 5.022
2.304AlaMet: 2.304 ± 1.356
4.608AlaAsn: 4.608 ± 1.705
5.76AlaPro: 5.76 ± 5.855
4.608AlaGln: 4.608 ± 2.287
3.456AlaArg: 3.456 ± 1.596
5.76AlaSer: 5.76 ± 3.624
3.456AlaThr: 3.456 ± 1.323
2.304AlaVal: 2.304 ± 1.356
3.456AlaTrp: 3.456 ± 2.033
1.152AlaTyr: 1.152 ± 1.47
0.0AlaXaa: 0.0 ± 0.0
Cys
1.152CysAla: 1.152 ± 2.176
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.152CysIle: 1.152 ± 0.678
1.152CysLys: 1.152 ± 0.678
1.152CysLeu: 1.152 ± 1.123
0.0CysMet: 0.0 ± 0.0
1.152CysAsn: 1.152 ± 0.678
4.608CysPro: 4.608 ± 2.605
0.0CysGln: 0.0 ± 0.0
2.304CysArg: 2.304 ± 2.245
1.152CysSer: 1.152 ± 1.123
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.304AspAla: 2.304 ± 1.781
2.304AspCys: 2.304 ± 0.944
6.912AspAsp: 6.912 ± 5.274
2.304AspGlu: 2.304 ± 1.781
2.304AspPhe: 2.304 ± 2.245
4.608AspGly: 4.608 ± 1.888
1.152AspHis: 1.152 ± 1.47
1.152AspIle: 1.152 ± 1.123
1.152AspLys: 1.152 ± 0.678
2.304AspLeu: 2.304 ± 0.944
0.0AspMet: 0.0 ± 0.0
3.456AspAsn: 3.456 ± 1.2
3.456AspPro: 3.456 ± 1.596
2.304AspGln: 2.304 ± 1.356
1.152AspArg: 1.152 ± 1.123
0.0AspSer: 0.0 ± 0.0
9.217AspThr: 9.217 ± 4.288
2.304AspVal: 2.304 ± 1.356
0.0AspTrp: 0.0 ± 0.0
1.152AspTyr: 1.152 ± 0.678
0.0AspXaa: 0.0 ± 0.0
Glu
2.304GluAla: 2.304 ± 2.245
1.152GluCys: 1.152 ± 0.678
2.304GluAsp: 2.304 ± 2.245
4.608GluGlu: 4.608 ± 2.605
1.152GluPhe: 1.152 ± 0.678
0.0GluGly: 0.0 ± 0.0
1.152GluHis: 1.152 ± 1.47
3.456GluIle: 3.456 ± 1.2
0.0GluLys: 0.0 ± 0.0
1.152GluLeu: 1.152 ± 1.123
2.304GluMet: 2.304 ± 1.356
2.304GluAsn: 2.304 ± 1.223
3.456GluPro: 3.456 ± 1.596
1.152GluGln: 1.152 ± 1.123
2.304GluArg: 2.304 ± 2.245
0.0GluSer: 0.0 ± 0.0
4.608GluThr: 4.608 ± 2.178
0.0GluVal: 0.0 ± 0.0
1.152GluTrp: 1.152 ± 0.678
1.152GluTyr: 1.152 ± 1.47
0.0GluXaa: 0.0 ± 0.0
Phe
6.912PheAla: 6.912 ± 4.085
0.0PheCys: 0.0 ± 0.0
3.456PheAsp: 3.456 ± 3.368
0.0PheGlu: 0.0 ± 0.0
1.152PhePhe: 1.152 ± 0.678
0.0PheGly: 0.0 ± 0.0
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
3.456PheLys: 3.456 ± 1.2
3.456PheLeu: 3.456 ± 1.2
1.152PheMet: 1.152 ± 0.678
1.152PheAsn: 1.152 ± 0.678
4.608PhePro: 4.608 ± 2.207
1.152PheGln: 1.152 ± 0.678
2.304PheArg: 2.304 ± 0.944
5.76PheSer: 5.76 ± 2.388
1.152PheThr: 1.152 ± 0.678
0.0PheVal: 0.0 ± 0.0
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.456GlyAla: 3.456 ± 1.2
0.0GlyCys: 0.0 ± 0.0
1.152GlyAsp: 1.152 ± 0.678
1.152GlyGlu: 1.152 ± 1.123
0.0GlyPhe: 0.0 ± 0.0
4.608GlyGly: 4.608 ± 1.705
0.0GlyHis: 0.0 ± 0.0
8.065GlyIle: 8.065 ± 3.896
2.304GlyLys: 2.304 ± 0.944
2.304GlyLeu: 2.304 ± 0.944
0.0GlyMet: 0.0 ± 0.0
0.0GlyAsn: 0.0 ± 0.0
5.76GlyPro: 5.76 ± 2.251
4.608GlyGln: 4.608 ± 1.709
9.217GlyArg: 9.217 ± 5.841
3.456GlySer: 3.456 ± 1.323
5.76GlyThr: 5.76 ± 1.378
1.152GlyVal: 1.152 ± 0.678
1.152GlyTrp: 1.152 ± 0.678
2.304GlyTyr: 2.304 ± 1.356
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.152HisGlu: 1.152 ± 0.678
3.456HisPhe: 3.456 ± 2.589
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
3.456HisLys: 3.456 ± 1.296
1.152HisLeu: 1.152 ± 1.47
0.0HisMet: 0.0 ± 0.0
1.152HisAsn: 1.152 ± 0.678
3.456HisPro: 3.456 ± 1.596
1.152HisGln: 1.152 ± 1.47
3.456HisArg: 3.456 ± 3.067
2.304HisSer: 2.304 ± 2.245
2.304HisThr: 2.304 ± 0.944
2.304HisVal: 2.304 ± 2.061
0.0HisTrp: 0.0 ± 0.0
1.152HisTyr: 1.152 ± 0.678
0.0HisXaa: 0.0 ± 0.0
Ile
1.152IleAla: 1.152 ± 1.47
2.304IleCys: 2.304 ± 0.944
2.304IleAsp: 2.304 ± 1.356
0.0IleGlu: 0.0 ± 0.0
1.152IlePhe: 1.152 ± 0.678
3.456IleGly: 3.456 ± 1.2
0.0IleHis: 0.0 ± 0.0
3.456IleIle: 3.456 ± 1.323
1.152IleLys: 1.152 ± 0.678
8.065IleLeu: 8.065 ± 3.296
2.304IleMet: 2.304 ± 1.356
2.304IleAsn: 2.304 ± 2.061
3.456IlePro: 3.456 ± 1.2
2.304IleGln: 2.304 ± 1.356
4.608IleArg: 4.608 ± 3.616
3.456IleSer: 3.456 ± 1.977
3.456IleThr: 3.456 ± 1.596
4.608IleVal: 4.608 ± 1.328
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.456LysAla: 3.456 ± 2.506
0.0LysCys: 0.0 ± 0.0
2.304LysAsp: 2.304 ± 1.781
1.152LysGlu: 1.152 ± 0.678
2.304LysPhe: 2.304 ± 1.356
2.304LysGly: 2.304 ± 1.356
1.152LysHis: 1.152 ± 0.678
5.76LysIle: 5.76 ± 1.203
0.0LysLys: 0.0 ± 0.0
2.304LysLeu: 2.304 ± 1.223
1.152LysMet: 1.152 ± 0.678
1.152LysAsn: 1.152 ± 0.678
5.76LysPro: 5.76 ± 2.301
2.304LysGln: 2.304 ± 2.245
4.608LysArg: 4.608 ± 2.094
2.304LysSer: 2.304 ± 0.944
1.152LysThr: 1.152 ± 0.678
3.456LysVal: 3.456 ± 1.596
1.152LysTrp: 1.152 ± 0.678
1.152LysTyr: 1.152 ± 1.123
0.0LysXaa: 0.0 ± 0.0
Leu
6.912LeuAla: 6.912 ± 1.718
2.304LeuCys: 2.304 ± 0.944
2.304LeuAsp: 2.304 ± 0.944
3.456LeuGlu: 3.456 ± 3.067
4.608LeuPhe: 4.608 ± 1.328
4.608LeuGly: 4.608 ± 3.931
2.304LeuHis: 2.304 ± 0.944
2.304LeuIle: 2.304 ± 1.356
4.608LeuLys: 4.608 ± 1.051
10.369LeuLeu: 10.369 ± 2.155
1.152LeuMet: 1.152 ± 0.678
3.456LeuAsn: 3.456 ± 1.2
5.76LeuPro: 5.76 ± 1.726
3.456LeuGln: 3.456 ± 1.96
9.217LeuArg: 9.217 ± 4.795
5.76LeuSer: 5.76 ± 3.04
1.152LeuThr: 1.152 ± 0.678
0.0LeuVal: 0.0 ± 0.0
1.152LeuTrp: 1.152 ± 1.123
1.152LeuTyr: 1.152 ± 1.123
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
1.152MetAsp: 1.152 ± 2.176
0.0MetGlu: 0.0 ± 0.0
1.152MetPhe: 1.152 ± 0.678
2.304MetGly: 2.304 ± 1.356
1.152MetHis: 1.152 ± 0.678
0.0MetIle: 0.0 ± 0.0
1.152MetLys: 1.152 ± 0.678
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
1.152MetAsn: 1.152 ± 0.678
1.152MetPro: 1.152 ± 1.47
2.304MetGln: 2.304 ± 1.356
0.0MetArg: 0.0 ± 0.0
3.456MetSer: 3.456 ± 1.596
3.456MetThr: 3.456 ± 2.033
1.152MetVal: 1.152 ± 0.678
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.304AsnAla: 2.304 ± 2.94
0.0AsnCys: 0.0 ± 0.0
1.152AsnAsp: 1.152 ± 0.678
2.304AsnGlu: 2.304 ± 0.944
0.0AsnPhe: 0.0 ± 0.0
2.304AsnGly: 2.304 ± 0.944
4.608AsnHis: 4.608 ± 2.094
1.152AsnIle: 1.152 ± 0.678
2.304AsnLys: 2.304 ± 0.944
1.152AsnLeu: 1.152 ± 0.678
0.0AsnMet: 0.0 ± 0.0
0.0AsnAsn: 0.0 ± 0.0
5.76AsnPro: 5.76 ± 2.05
2.304AsnGln: 2.304 ± 1.356
3.456AsnArg: 3.456 ± 1.596
2.304AsnSer: 2.304 ± 1.356
6.912AsnThr: 6.912 ± 2.832
3.456AsnVal: 3.456 ± 2.033
2.304AsnTrp: 2.304 ± 2.245
1.152AsnTyr: 1.152 ± 0.678
0.0AsnXaa: 0.0 ± 0.0
Pro
1.152ProAla: 1.152 ± 1.123
2.304ProCys: 2.304 ± 2.882
3.456ProAsp: 3.456 ± 1.296
1.152ProGlu: 1.152 ± 1.123
1.152ProPhe: 1.152 ± 1.123
5.76ProGly: 5.76 ± 1.378
2.304ProHis: 2.304 ± 2.061
2.304ProIle: 2.304 ± 1.965
6.912ProLys: 6.912 ± 3.92
5.76ProLeu: 5.76 ± 2.05
1.152ProMet: 1.152 ± 0.678
3.456ProAsn: 3.456 ± 1.296
9.217ProPro: 9.217 ± 1.723
6.912ProGln: 6.912 ± 2.933
6.912ProArg: 6.912 ± 8.183
6.912ProSer: 6.912 ± 1.58
9.217ProThr: 9.217 ± 3.419
4.608ProVal: 4.608 ± 2.207
2.304ProTrp: 2.304 ± 1.356
5.76ProTyr: 5.76 ± 2.301
0.0ProXaa: 0.0 ± 0.0
Gln
1.152GlnAla: 1.152 ± 0.678
0.0GlnCys: 0.0 ± 0.0
4.608GlnAsp: 4.608 ± 2.711
2.304GlnGlu: 2.304 ± 2.245
1.152GlnPhe: 1.152 ± 1.47
3.456GlnGly: 3.456 ± 1.323
0.0GlnHis: 0.0 ± 0.0
1.152GlnIle: 1.152 ± 0.678
1.152GlnLys: 1.152 ± 0.678
2.304GlnLeu: 2.304 ± 0.944
3.456GlnMet: 3.456 ± 1.347
2.304GlnAsn: 2.304 ± 0.944
4.608GlnPro: 4.608 ± 2.207
1.152GlnGln: 1.152 ± 0.678
3.456GlnArg: 3.456 ± 2.752
2.304GlnSer: 2.304 ± 1.356
9.217GlnThr: 9.217 ± 3.411
1.152GlnVal: 1.152 ± 0.678
0.0GlnTrp: 0.0 ± 0.0
4.608GlnTyr: 4.608 ± 1.705
0.0GlnXaa: 0.0 ± 0.0
Arg
10.369ArgAla: 10.369 ± 7.739
1.152ArgCys: 1.152 ± 1.123
3.456ArgAsp: 3.456 ± 3.368
1.152ArgGlu: 1.152 ± 2.176
5.76ArgPhe: 5.76 ± 3.884
5.76ArgGly: 5.76 ± 1.462
2.304ArgHis: 2.304 ± 1.223
1.152ArgIle: 1.152 ± 0.678
3.456ArgLys: 3.456 ± 3.067
6.912ArgLeu: 6.912 ± 3.668
0.0ArgMet: 0.0 ± 0.0
6.912ArgAsn: 6.912 ± 1.58
4.608ArgPro: 4.608 ± 1.888
2.304ArgGln: 2.304 ± 1.965
19.585ArgArg: 19.585 ± 4.527
4.608ArgSer: 4.608 ± 6.217
4.608ArgThr: 4.608 ± 2.094
1.152ArgVal: 1.152 ± 0.678
6.912ArgTrp: 6.912 ± 3.091
9.217ArgTyr: 9.217 ± 4.099
0.0ArgXaa: 0.0 ± 0.0
Ser
6.912SerAla: 6.912 ± 5.024
1.152SerCys: 1.152 ± 1.47
3.456SerAsp: 3.456 ± 1.96
4.608SerGlu: 4.608 ± 1.051
4.608SerPhe: 4.608 ± 1.705
0.0SerGly: 0.0 ± 0.0
1.152SerHis: 1.152 ± 1.123
3.456SerIle: 3.456 ± 4.892
4.608SerLys: 4.608 ± 1.937
4.608SerLeu: 4.608 ± 2.207
0.0SerMet: 0.0 ± 1.266
5.76SerAsn: 5.76 ± 2.301
4.608SerPro: 4.608 ± 1.888
1.152SerGln: 1.152 ± 0.678
6.912SerArg: 6.912 ± 3.219
8.065SerSer: 8.065 ± 4.428
4.608SerThr: 4.608 ± 2.605
2.304SerVal: 2.304 ± 2.245
0.0SerTrp: 0.0 ± 0.0
1.152SerTyr: 1.152 ± 1.123
0.0SerXaa: 0.0 ± 0.0
Thr
3.456ThrAla: 3.456 ± 2.512
0.0ThrCys: 0.0 ± 0.0
5.76ThrAsp: 5.76 ± 2.05
6.912ThrGlu: 6.912 ± 1.202
2.304ThrPhe: 2.304 ± 1.356
9.217ThrGly: 9.217 ± 3.75
2.304ThrHis: 2.304 ± 2.245
5.76ThrIle: 5.76 ± 1.378
1.152ThrLys: 1.152 ± 0.678
10.369ThrLeu: 10.369 ± 1.113
1.152ThrMet: 1.152 ± 0.678
1.152ThrAsn: 1.152 ± 1.47
8.065ThrPro: 8.065 ± 2.212
4.608ThrGln: 4.608 ± 1.888
5.76ThrArg: 5.76 ± 3.389
4.608ThrSer: 4.608 ± 1.051
5.76ThrThr: 5.76 ± 1.203
3.456ThrVal: 3.456 ± 1.2
0.0ThrTrp: 0.0 ± 0.0
1.152ThrTyr: 1.152 ± 0.678
0.0ThrXaa: 0.0 ± 0.0
Val
4.608ValAla: 4.608 ± 2.711
0.0ValCys: 0.0 ± 0.0
2.304ValAsp: 2.304 ± 2.245
1.152ValGlu: 1.152 ± 0.678
0.0ValPhe: 0.0 ± 0.0
2.304ValGly: 2.304 ± 1.356
1.152ValHis: 1.152 ± 2.176
0.0ValIle: 0.0 ± 0.0
2.304ValLys: 2.304 ± 1.965
3.456ValLeu: 3.456 ± 1.96
1.152ValMet: 1.152 ± 2.176
1.152ValAsn: 1.152 ± 0.678
2.304ValPro: 2.304 ± 0.944
3.456ValGln: 3.456 ± 1.977
2.304ValArg: 2.304 ± 1.356
2.304ValSer: 2.304 ± 1.356
3.456ValThr: 3.456 ± 1.2
0.0ValVal: 0.0 ± 0.0
0.0ValTrp: 0.0 ± 0.0
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
3.456TrpAla: 3.456 ± 2.033
0.0TrpCys: 0.0 ± 0.0
1.152TrpAsp: 1.152 ± 0.678
2.304TrpGlu: 2.304 ± 0.944
1.152TrpPhe: 1.152 ± 1.123
1.152TrpGly: 1.152 ± 2.176
1.152TrpHis: 1.152 ± 1.123
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
1.152TrpMet: 1.152 ± 0.678
0.0TrpAsn: 0.0 ± 0.0
1.152TrpPro: 1.152 ± 0.678
0.0TrpGln: 0.0 ± 0.0
3.456TrpArg: 3.456 ± 2.033
3.456TrpSer: 3.456 ± 2.033
1.152TrpThr: 1.152 ± 0.678
0.0TrpVal: 0.0 ± 0.0
1.152TrpTrp: 1.152 ± 0.678
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.152TyrAla: 1.152 ± 0.678
0.0TyrCys: 0.0 ± 0.0
0.0TyrAsp: 0.0 ± 0.0
0.0TyrGlu: 0.0 ± 0.0
0.0TyrPhe: 0.0 ± 0.0
2.304TyrGly: 2.304 ± 0.944
2.304TyrHis: 2.304 ± 1.781
3.456TyrIle: 3.456 ± 2.033
2.304TyrLys: 2.304 ± 1.356
2.304TyrLeu: 2.304 ± 1.356
0.0TyrMet: 0.0 ± 0.0
2.304TyrAsn: 2.304 ± 0.944
1.152TyrPro: 1.152 ± 0.678
2.304TyrGln: 2.304 ± 0.944
6.912TyrArg: 6.912 ± 2.933
2.304TyrSer: 2.304 ± 2.94
2.304TyrThr: 2.304 ± 1.356
0.0TyrVal: 0.0 ± 0.0
1.152TyrTrp: 1.152 ± 0.678
1.152TyrTyr: 1.152 ± 1.47
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (869 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski