Amino acid dipepetide frequency for Seal anellovirus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.942AlaAla: 3.942 ± 2.384
0.0AlaCys: 0.0 ± 0.0
2.628AlaAsp: 2.628 ± 2.132
1.314AlaGlu: 1.314 ± 0.795
0.0AlaPhe: 0.0 ± 0.0
6.57AlaGly: 6.57 ± 3.935
0.0AlaHis: 0.0 ± 0.0
0.0AlaIle: 0.0 ± 0.0
9.198AlaLys: 9.198 ± 3.958
3.942AlaLeu: 3.942 ± 2.08
2.628AlaMet: 2.628 ± 0.908
1.314AlaAsn: 1.314 ± 0.795
2.628AlaPro: 2.628 ± 1.016
1.314AlaGln: 1.314 ± 0.795
3.942AlaArg: 3.942 ± 2.06
3.942AlaSer: 3.942 ± 1.03
2.628AlaThr: 2.628 ± 1.016
1.314AlaVal: 1.314 ± 0.795
0.0AlaTrp: 0.0 ± 0.0
1.314AlaTyr: 1.314 ± 0.795
0.0AlaXaa: 0.0 ± 0.0
Cys
1.314CysAla: 1.314 ± 0.795
1.314CysCys: 1.314 ± 0.795
1.314CysAsp: 1.314 ± 2.472
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
2.628CysGly: 2.628 ± 2.132
1.314CysHis: 1.314 ± 2.472
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
1.314CysLeu: 1.314 ± 1.505
0.0CysMet: 0.0 ± 0.0
2.628CysAsn: 2.628 ± 2.132
1.314CysPro: 1.314 ± 0.795
2.628CysGln: 2.628 ± 1.589
1.314CysArg: 1.314 ± 1.505
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
1.314CysVal: 1.314 ± 0.795
1.314CysTrp: 1.314 ± 0.795
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.628AspAla: 2.628 ± 2.132
2.628AspCys: 2.628 ± 2.132
3.942AspAsp: 3.942 ± 2.384
3.942AspGlu: 3.942 ± 2.442
1.314AspPhe: 1.314 ± 1.505
1.314AspGly: 1.314 ± 2.472
2.628AspHis: 2.628 ± 2.132
2.628AspIle: 2.628 ± 1.016
1.314AspLys: 1.314 ± 0.795
5.256AspLeu: 5.256 ± 2.939
2.628AspMet: 2.628 ± 2.132
1.314AspAsn: 1.314 ± 1.505
7.884AspPro: 7.884 ± 2.954
1.314AspGln: 1.314 ± 0.795
2.628AspArg: 2.628 ± 1.589
6.57AspSer: 6.57 ± 4.851
3.942AspThr: 3.942 ± 2.06
1.314AspVal: 1.314 ± 0.795
1.314AspTrp: 1.314 ± 0.795
5.256AspTyr: 5.256 ± 1.534
0.0AspXaa: 0.0 ± 0.0
Glu
5.256GluAla: 5.256 ± 1.469
0.0GluCys: 0.0 ± 0.0
1.314GluAsp: 1.314 ± 0.795
2.628GluGlu: 2.628 ± 3.01
2.628GluPhe: 2.628 ± 2.132
3.942GluGly: 3.942 ± 2.06
1.314GluHis: 1.314 ± 0.795
0.0GluIle: 0.0 ± 0.0
3.942GluLys: 3.942 ± 4.515
9.198GluLeu: 9.198 ± 4.805
2.628GluMet: 2.628 ± 1.198
3.942GluAsn: 3.942 ± 2.442
2.628GluPro: 2.628 ± 1.589
0.0GluGln: 0.0 ± 0.0
2.628GluArg: 2.628 ± 1.589
3.942GluSer: 3.942 ± 2.442
5.256GluThr: 5.256 ± 3.927
2.628GluVal: 2.628 ± 1.589
0.0GluTrp: 0.0 ± 0.0
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
1.314PheCys: 1.314 ± 2.472
0.0PheAsp: 0.0 ± 0.0
0.0PheGlu: 0.0 ± 0.0
2.628PhePhe: 2.628 ± 1.589
1.314PheGly: 1.314 ± 0.795
3.942PheHis: 3.942 ± 1.03
1.314PheIle: 1.314 ± 1.505
5.256PheLys: 5.256 ± 2.281
2.628PheLeu: 2.628 ± 2.132
0.0PheMet: 0.0 ± 0.0
1.314PheAsn: 1.314 ± 0.795
2.628PhePro: 2.628 ± 1.589
2.628PheGln: 2.628 ± 1.589
2.628PheArg: 2.628 ± 1.589
2.628PheSer: 2.628 ± 1.589
2.628PheThr: 2.628 ± 1.589
1.314PheVal: 1.314 ± 0.795
1.314PheTrp: 1.314 ± 0.795
1.314PheTyr: 1.314 ± 0.795
0.0PheXaa: 0.0 ± 0.0
Gly
0.0GlyAla: 0.0 ± 0.0
1.314GlyCys: 1.314 ± 2.472
10.512GlyAsp: 10.512 ± 5.629
5.256GlyGlu: 5.256 ± 1.469
2.628GlyPhe: 2.628 ± 1.589
15.769GlyGly: 15.769 ± 18.325
2.628GlyHis: 2.628 ± 1.589
3.942GlyIle: 3.942 ± 2.08
2.628GlyLys: 2.628 ± 1.016
3.942GlyLeu: 3.942 ± 2.08
0.0GlyMet: 0.0 ± 0.0
1.314GlyAsn: 1.314 ± 0.795
9.198GlyPro: 9.198 ± 1.799
2.628GlyGln: 2.628 ± 1.016
2.628GlyArg: 2.628 ± 1.016
2.628GlySer: 2.628 ± 2.132
5.256GlyThr: 5.256 ± 1.469
2.628GlyVal: 2.628 ± 1.016
3.942GlyTrp: 3.942 ± 2.384
1.314GlyTyr: 1.314 ± 0.795
0.0GlyXaa: 0.0 ± 0.0
His
1.314HisAla: 1.314 ± 0.795
0.0HisCys: 0.0 ± 0.0
1.314HisAsp: 1.314 ± 0.795
0.0HisGlu: 0.0 ± 0.0
2.628HisPhe: 2.628 ± 2.132
2.628HisGly: 2.628 ± 1.589
2.628HisHis: 2.628 ± 1.589
1.314HisIle: 1.314 ± 0.795
0.0HisLys: 0.0 ± 0.0
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
2.628HisPro: 2.628 ± 2.132
0.0HisGln: 0.0 ± 0.0
6.57HisArg: 6.57 ± 1.12
2.628HisSer: 2.628 ± 1.016
3.942HisThr: 3.942 ± 1.03
0.0HisVal: 0.0 ± 0.0
1.314HisTrp: 1.314 ± 0.795
1.314HisTyr: 1.314 ± 0.795
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
3.942IleCys: 3.942 ± 1.03
1.314IleAsp: 1.314 ± 0.795
1.314IleGlu: 1.314 ± 1.505
1.314IlePhe: 1.314 ± 0.795
1.314IleGly: 1.314 ± 2.472
0.0IleHis: 0.0 ± 0.0
1.314IleIle: 1.314 ± 0.795
2.628IleLys: 2.628 ± 1.016
6.57IleLeu: 6.57 ± 1.886
1.314IleMet: 1.314 ± 0.795
0.0IleAsn: 0.0 ± 0.0
2.628IlePro: 2.628 ± 1.589
0.0IleGln: 0.0 ± 0.0
3.942IleArg: 3.942 ± 2.442
3.942IleSer: 3.942 ± 4.515
2.628IleThr: 2.628 ± 2.132
1.314IleVal: 1.314 ± 0.795
1.314IleTrp: 1.314 ± 0.795
1.314IleTyr: 1.314 ± 1.505
0.0IleXaa: 0.0 ± 0.0
Lys
2.628LysAla: 2.628 ± 1.016
1.314LysCys: 1.314 ± 0.795
2.628LysAsp: 2.628 ± 1.589
7.884LysGlu: 7.884 ± 1.27
1.314LysPhe: 1.314 ± 0.795
2.628LysGly: 2.628 ± 1.589
0.0LysHis: 0.0 ± 0.0
2.628LysIle: 2.628 ± 1.589
1.314LysLys: 1.314 ± 1.505
3.942LysLeu: 3.942 ± 2.08
0.0LysMet: 0.0 ± 0.0
2.628LysAsn: 2.628 ± 1.589
1.314LysPro: 1.314 ± 0.795
2.628LysGln: 2.628 ± 2.132
2.628LysArg: 2.628 ± 1.016
3.942LysSer: 3.942 ± 1.03
9.198LysThr: 9.198 ± 4.421
3.942LysVal: 3.942 ± 2.384
3.942LysTrp: 3.942 ± 5.047
5.256LysTyr: 5.256 ± 1.469
0.0LysXaa: 0.0 ± 0.0
Leu
5.256LeuAla: 5.256 ± 2.032
2.628LeuCys: 2.628 ± 2.132
6.57LeuAsp: 6.57 ± 1.12
6.57LeuGlu: 6.57 ± 3.424
1.314LeuPhe: 1.314 ± 0.795
1.314LeuGly: 1.314 ± 0.795
1.314LeuHis: 1.314 ± 2.472
3.942LeuIle: 3.942 ± 4.515
3.942LeuLys: 3.942 ± 1.03
6.57LeuLeu: 6.57 ± 1.12
1.314LeuMet: 1.314 ± 1.505
3.942LeuAsn: 3.942 ± 2.442
7.884LeuPro: 7.884 ± 1.27
9.198LeuGln: 9.198 ± 4.421
2.628LeuArg: 2.628 ± 2.132
7.884LeuSer: 7.884 ± 3.403
1.314LeuThr: 1.314 ± 0.795
2.628LeuVal: 2.628 ± 3.01
2.628LeuTrp: 2.628 ± 1.589
1.314LeuTyr: 1.314 ± 2.472
0.0LeuXaa: 0.0 ± 0.0
Met
1.314MetAla: 1.314 ± 0.795
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.314MetGlu: 1.314 ± 2.472
1.314MetPhe: 1.314 ± 0.795
0.0MetGly: 0.0 ± 0.0
1.314MetHis: 1.314 ± 0.795
1.314MetIle: 1.314 ± 1.505
0.0MetLys: 0.0 ± 0.0
1.314MetLeu: 1.314 ± 1.505
1.314MetMet: 1.314 ± 1.505
1.314MetAsn: 1.314 ± 0.795
0.0MetPro: 0.0 ± 0.0
1.314MetGln: 1.314 ± 0.795
0.0MetArg: 0.0 ± 0.0
2.628MetSer: 2.628 ± 2.785
1.314MetThr: 1.314 ± 1.505
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.314MetTyr: 1.314 ± 0.795
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
0.0AsnCys: 0.0 ± 0.0
0.0AsnAsp: 0.0 ± 0.0
0.0AsnGlu: 0.0 ± 0.0
2.628AsnPhe: 2.628 ± 2.785
1.314AsnGly: 1.314 ± 0.795
0.0AsnHis: 0.0 ± 0.0
2.628AsnIle: 2.628 ± 1.589
2.628AsnLys: 2.628 ± 1.589
2.628AsnLeu: 2.628 ± 1.016
1.314AsnMet: 1.314 ± 1.505
2.628AsnAsn: 2.628 ± 2.132
3.942AsnPro: 3.942 ± 1.03
1.314AsnGln: 1.314 ± 0.795
1.314AsnArg: 1.314 ± 0.795
1.314AsnSer: 1.314 ± 1.505
5.256AsnThr: 5.256 ± 1.534
1.314AsnVal: 1.314 ± 1.505
1.314AsnTrp: 1.314 ± 0.795
3.942AsnTyr: 3.942 ± 1.03
0.0AsnXaa: 0.0 ± 0.0
Pro
5.256ProAla: 5.256 ± 3.179
0.0ProCys: 0.0 ± 0.0
1.314ProAsp: 1.314 ± 2.472
2.628ProGlu: 2.628 ± 1.016
2.628ProPhe: 2.628 ± 1.589
14.455ProGly: 14.455 ± 1.979
2.628ProHis: 2.628 ± 1.016
1.314ProIle: 1.314 ± 0.795
6.57ProLys: 6.57 ± 1.886
3.942ProLeu: 3.942 ± 2.442
0.0ProMet: 0.0 ± 0.0
1.314ProAsn: 1.314 ± 0.795
3.942ProPro: 3.942 ± 2.442
3.942ProGln: 3.942 ± 2.06
5.256ProArg: 5.256 ± 1.534
5.256ProSer: 5.256 ± 1.534
5.256ProThr: 5.256 ± 1.534
0.0ProVal: 0.0 ± 0.0
3.942ProTrp: 3.942 ± 2.384
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.314GlnAla: 1.314 ± 0.795
0.0GlnCys: 0.0 ± 0.0
2.628GlnAsp: 2.628 ± 2.132
5.256GlnGlu: 5.256 ± 3.927
1.314GlnPhe: 1.314 ± 0.795
0.0GlnGly: 0.0 ± 0.0
1.314GlnHis: 1.314 ± 0.795
1.314GlnIle: 1.314 ± 0.795
2.628GlnLys: 2.628 ± 3.01
3.942GlnLeu: 3.942 ± 2.08
0.0GlnMet: 0.0 ± 0.0
3.942GlnAsn: 3.942 ± 2.384
3.942GlnPro: 3.942 ± 1.03
2.628GlnGln: 2.628 ± 2.132
3.942GlnArg: 3.942 ± 2.442
2.628GlnSer: 2.628 ± 1.589
1.314GlnThr: 1.314 ± 1.505
2.628GlnVal: 2.628 ± 2.132
2.628GlnTrp: 2.628 ± 1.589
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
6.57ArgAla: 6.57 ± 1.886
0.0ArgCys: 0.0 ± 0.0
1.314ArgAsp: 1.314 ± 1.505
3.942ArgGlu: 3.942 ± 2.06
5.256ArgPhe: 5.256 ± 3.179
9.198ArgGly: 9.198 ± 0.574
5.256ArgHis: 5.256 ± 3.179
2.628ArgIle: 2.628 ± 1.016
1.314ArgLys: 1.314 ± 0.795
6.57ArgLeu: 6.57 ± 3.973
0.0ArgMet: 0.0 ± 1.78
1.314ArgAsn: 1.314 ± 0.795
7.884ArgPro: 7.884 ± 2.061
1.314ArgGln: 1.314 ± 0.795
19.711ArgArg: 19.711 ± 9.997
2.628ArgSer: 2.628 ± 1.589
3.942ArgThr: 3.942 ± 2.442
2.628ArgVal: 2.628 ± 1.016
1.314ArgTrp: 1.314 ± 0.795
3.942ArgTyr: 3.942 ± 2.384
0.0ArgXaa: 0.0 ± 0.0
Ser
2.628SerAla: 2.628 ± 4.944
2.628SerCys: 2.628 ± 1.016
13.141SerAsp: 13.141 ± 6.848
2.628SerGlu: 2.628 ± 3.01
1.314SerPhe: 1.314 ± 0.795
3.942SerGly: 3.942 ± 1.03
1.314SerHis: 1.314 ± 0.795
2.628SerIle: 2.628 ± 2.132
5.256SerLys: 5.256 ± 1.469
5.256SerLeu: 5.256 ± 2.032
0.0SerMet: 0.0 ± 0.0
0.0SerAsn: 0.0 ± 0.0
0.0SerPro: 0.0 ± 0.0
3.942SerGln: 3.942 ± 3.733
5.256SerArg: 5.256 ± 1.534
10.512SerSer: 10.512 ± 7.855
3.942SerThr: 3.942 ± 2.08
3.942SerVal: 3.942 ± 2.384
1.314SerTrp: 1.314 ± 0.795
1.314SerTyr: 1.314 ± 1.505
0.0SerXaa: 0.0 ± 0.0
Thr
6.57ThrAla: 6.57 ± 4.117
1.314ThrCys: 1.314 ± 0.795
6.57ThrAsp: 6.57 ± 1.12
3.942ThrGlu: 3.942 ± 2.442
2.628ThrPhe: 2.628 ± 1.589
7.884ThrGly: 7.884 ± 1.355
1.314ThrHis: 1.314 ± 0.795
3.942ThrIle: 3.942 ± 2.442
5.256ThrLys: 5.256 ± 3.179
5.256ThrLeu: 5.256 ± 3.927
0.0ThrMet: 0.0 ± 0.0
2.628ThrAsn: 2.628 ± 1.016
5.256ThrPro: 5.256 ± 3.927
2.628ThrGln: 2.628 ± 1.016
1.314ThrArg: 1.314 ± 1.505
2.628ThrSer: 2.628 ± 1.016
1.314ThrThr: 1.314 ± 2.472
0.0ThrVal: 0.0 ± 0.0
3.942ThrTrp: 3.942 ± 1.03
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.0ValCys: 0.0 ± 0.0
0.0ValAsp: 0.0 ± 0.0
0.0ValGlu: 0.0 ± 0.0
0.0ValPhe: 0.0 ± 0.0
1.314ValGly: 1.314 ± 0.795
0.0ValHis: 0.0 ± 0.0
2.628ValIle: 2.628 ± 1.016
2.628ValLys: 2.628 ± 1.589
2.628ValLeu: 2.628 ± 1.016
1.314ValMet: 1.314 ± 0.795
1.314ValAsn: 1.314 ± 1.505
2.628ValPro: 2.628 ± 1.016
2.628ValGln: 2.628 ± 1.016
7.884ValArg: 7.884 ± 4.768
1.314ValSer: 1.314 ± 2.472
0.0ValThr: 0.0 ± 0.0
2.628ValVal: 2.628 ± 1.589
1.314ValTrp: 1.314 ± 0.795
1.314ValTyr: 1.314 ± 0.795
0.0ValXaa: 0.0 ± 0.0
Trp
1.314TrpAla: 1.314 ± 0.795
1.314TrpCys: 1.314 ± 0.795
3.942TrpAsp: 3.942 ± 1.03
3.942TrpGlu: 3.942 ± 2.384
2.628TrpPhe: 2.628 ± 1.589
1.314TrpGly: 1.314 ± 0.795
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.314TrpLys: 1.314 ± 2.472
2.628TrpLeu: 2.628 ± 1.589
1.314TrpMet: 1.314 ± 0.795
1.314TrpAsn: 1.314 ± 0.795
1.314TrpPro: 1.314 ± 2.472
0.0TrpGln: 0.0 ± 0.0
7.884TrpArg: 7.884 ± 4.768
2.628TrpSer: 2.628 ± 1.016
5.256TrpThr: 5.256 ± 1.534
0.0TrpVal: 0.0 ± 0.0
1.314TrpTrp: 1.314 ± 0.795
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.314TyrAla: 1.314 ± 0.795
0.0TyrCys: 0.0 ± 0.0
1.314TyrAsp: 1.314 ± 0.795
1.314TyrGlu: 1.314 ± 0.795
1.314TyrPhe: 1.314 ± 0.795
1.314TyrGly: 1.314 ± 1.505
1.314TyrHis: 1.314 ± 1.505
2.628TyrIle: 2.628 ± 1.589
3.942TyrLys: 3.942 ± 4.548
2.628TyrLeu: 2.628 ± 1.589
0.0TyrMet: 0.0 ± 0.0
1.314TyrAsn: 1.314 ± 1.505
0.0TyrPro: 0.0 ± 0.0
1.314TyrGln: 1.314 ± 1.505
3.942TyrArg: 3.942 ± 2.384
1.314TyrSer: 1.314 ± 0.795
0.0TyrThr: 0.0 ± 0.0
0.0TyrVal: 0.0 ± 0.0
5.256TyrTrp: 5.256 ± 1.534
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (762 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski