Amino acid dipepetide frequency for Sea otter polyomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.488AlaAla: 5.488 ± 2.577
1.098AlaCys: 1.098 ± 0.507
4.94AlaAsp: 4.94 ± 1.303
2.195AlaGlu: 2.195 ± 0.934
3.293AlaPhe: 3.293 ± 0.686
1.647AlaGly: 1.647 ± 0.879
0.0AlaHis: 0.0 ± 0.0
4.391AlaIle: 4.391 ± 1.478
2.195AlaLys: 2.195 ± 0.604
7.684AlaLeu: 7.684 ± 3.524
0.0AlaMet: 0.0 ± 0.0
2.195AlaAsn: 2.195 ± 0.712
4.94AlaPro: 4.94 ± 2.585
2.744AlaGln: 2.744 ± 1.964
7.684AlaArg: 7.684 ± 1.454
2.195AlaSer: 2.195 ± 1.011
2.195AlaThr: 2.195 ± 0.785
4.94AlaVal: 4.94 ± 0.9
1.098AlaTrp: 1.098 ± 0.729
2.195AlaTyr: 2.195 ± 1.018
0.0AlaXaa: 0.0 ± 0.0
Cys
1.647CysAla: 1.647 ± 0.942
0.0CysCys: 0.0 ± 0.0
1.647CysAsp: 1.647 ± 0.741
0.549CysGlu: 0.549 ± 0.51
1.098CysPhe: 1.098 ± 0.729
2.195CysGly: 2.195 ± 1.538
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
3.293CysLys: 3.293 ± 1.1
2.195CysLeu: 2.195 ± 1.145
1.098CysMet: 1.098 ± 0.729
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
1.098CysGln: 1.098 ± 1.129
0.0CysArg: 0.0 ± 0.0
1.098CysSer: 1.098 ± 0.729
2.744CysThr: 2.744 ± 2.328
0.0CysVal: 0.0 ± 0.0
0.549CysTrp: 0.549 ± 0.51
2.195CysTyr: 2.195 ± 3.059
0.0CysXaa: 0.0 ± 0.0
Asp
1.647AspAla: 1.647 ± 0.689
2.195AspCys: 2.195 ± 2.193
4.391AspAsp: 4.391 ± 0.728
4.391AspGlu: 4.391 ± 1.983
2.744AspPhe: 2.744 ± 0.547
3.293AspGly: 3.293 ± 0.783
0.0AspHis: 0.0 ± 0.0
1.647AspIle: 1.647 ± 0.741
3.842AspLys: 3.842 ± 0.453
7.684AspLeu: 7.684 ± 1.858
3.293AspMet: 3.293 ± 1.297
3.293AspAsn: 3.293 ± 0.42
3.842AspPro: 3.842 ± 1.491
1.098AspGln: 1.098 ± 1.129
1.098AspArg: 1.098 ± 0.507
5.488AspSer: 5.488 ± 1.223
3.842AspThr: 3.842 ± 1.631
1.098AspVal: 1.098 ± 0.507
1.647AspTrp: 1.647 ± 1.014
3.293AspTyr: 3.293 ± 1.297
0.0AspXaa: 0.0 ± 0.0
Glu
5.488GluAla: 5.488 ± 1.776
1.647GluCys: 1.647 ± 1.063
8.233GluAsp: 8.233 ± 1.483
7.684GluGlu: 7.684 ± 1.15
1.098GluPhe: 1.098 ± 0.769
2.744GluGly: 2.744 ± 1.077
1.098GluHis: 1.098 ± 0.507
1.647GluIle: 1.647 ± 0.741
1.647GluLys: 1.647 ± 1.063
7.684GluLeu: 7.684 ± 1.529
1.098GluMet: 1.098 ± 0.794
2.195GluAsn: 2.195 ± 1.018
2.744GluPro: 2.744 ± 1.424
1.098GluGln: 1.098 ± 1.129
1.098GluArg: 1.098 ± 0.729
2.744GluSer: 2.744 ± 1.422
1.647GluThr: 1.647 ± 0.879
7.684GluVal: 7.684 ± 1.872
2.195GluTrp: 2.195 ± 0.712
2.195GluTyr: 2.195 ± 1.538
0.0GluXaa: 0.0 ± 0.0
Phe
1.647PheAla: 1.647 ± 0.741
1.098PheCys: 1.098 ± 0.769
1.098PheAsp: 1.098 ± 0.729
2.195PheGlu: 2.195 ± 1.458
0.0PhePhe: 0.0 ± 0.0
3.842PheGly: 3.842 ± 1.079
1.098PheHis: 1.098 ± 0.507
2.195PheIle: 2.195 ± 0.604
2.195PheLys: 2.195 ± 1.066
3.293PheLeu: 3.293 ± 1.252
1.647PheMet: 1.647 ± 1.103
1.098PheAsn: 1.098 ± 0.507
2.744PhePro: 2.744 ± 0.943
1.098PheGln: 1.098 ± 0.794
2.744PheArg: 2.744 ± 0.547
3.842PheSer: 3.842 ± 1.494
1.647PheThr: 1.647 ± 0.888
0.549PheVal: 0.549 ± 0.765
0.549PheTrp: 0.549 ± 1.158
0.549PheTyr: 0.549 ± 0.51
0.0PheXaa: 0.0 ± 0.0
Gly
3.293GlyAla: 3.293 ± 1.378
0.549GlyCys: 0.549 ± 0.385
4.94GlyAsp: 4.94 ± 1.968
2.744GlyGlu: 2.744 ± 1.414
2.195GlyPhe: 2.195 ± 0.9
7.135GlyGly: 7.135 ± 1.177
1.098GlyHis: 1.098 ± 0.507
6.586GlyIle: 6.586 ± 2.68
2.195GlyLys: 2.195 ± 1.1
6.586GlyLeu: 6.586 ± 1.758
1.647GlyMet: 1.647 ± 1.063
2.744GlyAsn: 2.744 ± 1.251
7.684GlyPro: 7.684 ± 3.678
4.391GlyGln: 4.391 ± 1.654
2.744GlyArg: 2.744 ± 1.569
3.293GlySer: 3.293 ± 0.812
2.744GlyThr: 2.744 ± 1.378
4.94GlyVal: 4.94 ± 1.303
1.098GlyTrp: 1.098 ± 0.729
2.195GlyTyr: 2.195 ± 1.587
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
3.293HisGly: 3.293 ± 0.728
0.549HisHis: 0.549 ± 0.385
2.195HisIle: 2.195 ± 0.785
1.098HisLys: 1.098 ± 0.507
2.195HisLeu: 2.195 ± 1.4
1.098HisMet: 1.098 ± 1.021
0.549HisAsn: 0.549 ± 0.385
1.098HisPro: 1.098 ± 0.729
1.098HisGln: 1.098 ± 1.21
1.647HisArg: 1.647 ± 0.677
4.391HisSer: 4.391 ± 0.728
0.0HisThr: 0.0 ± 0.0
0.549HisVal: 0.549 ± 0.385
0.549HisTrp: 0.549 ± 0.385
1.647HisTyr: 1.647 ± 0.648
0.0HisXaa: 0.0 ± 0.0
Ile
2.744IleAla: 2.744 ± 0.951
1.647IleCys: 1.647 ± 1.154
6.037IleAsp: 6.037 ± 1.057
1.098IleGlu: 1.098 ± 1.021
1.098IlePhe: 1.098 ± 0.769
2.195IleGly: 2.195 ± 1.306
0.549IleHis: 0.549 ± 0.385
0.549IleIle: 0.549 ± 0.765
1.098IleLys: 1.098 ± 1.16
3.842IleLeu: 3.842 ± 0.657
0.0IleMet: 0.0 ± 0.0
2.744IleAsn: 2.744 ± 0.987
4.94IlePro: 4.94 ± 1.411
1.647IleGln: 1.647 ± 0.648
2.744IleArg: 2.744 ± 1.408
3.293IleSer: 3.293 ± 1.344
0.549IleThr: 0.549 ± 0.385
2.744IleVal: 2.744 ± 0.784
0.549IleTrp: 0.549 ± 0.385
0.549IleTyr: 0.549 ± 0.385
0.0IleXaa: 0.0 ± 0.0
Lys
6.586LysAla: 6.586 ± 1.075
0.549LysCys: 0.549 ± 1.158
1.647LysAsp: 1.647 ± 0.723
6.037LysGlu: 6.037 ± 1.386
1.647LysPhe: 1.647 ± 0.648
4.94LysGly: 4.94 ± 1.737
3.842LysHis: 3.842 ± 0.453
1.647LysIle: 1.647 ± 0.648
6.037LysLys: 6.037 ± 2.133
7.135LysLeu: 7.135 ± 2.838
0.549LysMet: 0.549 ± 0.385
2.195LysAsn: 2.195 ± 0.604
0.549LysPro: 0.549 ± 0.385
1.647LysGln: 1.647 ± 2.255
8.782LysArg: 8.782 ± 1.449
2.195LysSer: 2.195 ± 1.427
1.098LysThr: 1.098 ± 0.507
1.098LysVal: 1.098 ± 0.507
0.0LysTrp: 0.0 ± 0.0
1.647LysTyr: 1.647 ± 0.741
0.0LysXaa: 0.0 ± 0.0
Leu
4.391LeuAla: 4.391 ± 1.404
2.744LeuCys: 2.744 ± 0.987
8.782LeuAsp: 8.782 ± 1.0
6.586LeuGlu: 6.586 ± 1.542
5.488LeuPhe: 5.488 ± 1.294
2.744LeuGly: 2.744 ± 0.951
2.744LeuHis: 2.744 ± 0.708
3.293LeuIle: 3.293 ± 0.42
5.488LeuLys: 5.488 ± 3.014
13.172LeuLeu: 13.172 ± 3.298
5.488LeuMet: 5.488 ± 2.774
5.488LeuAsn: 5.488 ± 0.975
6.586LeuPro: 6.586 ± 0.785
3.293LeuGln: 3.293 ± 0.783
6.037LeuArg: 6.037 ± 0.949
7.684LeuSer: 7.684 ± 0.985
3.293LeuThr: 3.293 ± 1.034
4.94LeuVal: 4.94 ± 2.546
0.549LeuTrp: 0.549 ± 0.765
8.233LeuTyr: 8.233 ± 1.744
0.0LeuXaa: 0.0 ± 0.0
Met
2.744MetAla: 2.744 ± 0.666
1.098MetCys: 1.098 ± 0.729
2.195MetAsp: 2.195 ± 1.145
2.195MetGlu: 2.195 ± 1.145
1.098MetPhe: 1.098 ± 0.507
3.293MetGly: 3.293 ± 1.078
0.0MetHis: 0.0 ± 0.0
0.549MetIle: 0.549 ± 0.385
1.647MetLys: 1.647 ± 0.879
2.744MetLeu: 2.744 ± 0.547
2.195MetMet: 2.195 ± 1.458
0.549MetAsn: 0.549 ± 0.385
0.549MetPro: 0.549 ± 0.385
0.549MetGln: 0.549 ± 0.51
1.098MetArg: 1.098 ± 0.729
0.549MetSer: 0.549 ± 1.158
1.098MetThr: 1.098 ± 1.021
3.293MetVal: 3.293 ± 0.42
0.549MetTrp: 0.549 ± 0.51
0.549MetTyr: 0.549 ± 0.385
0.0MetXaa: 0.0 ± 0.0
Asn
2.195AsnAla: 2.195 ± 1.014
0.549AsnCys: 0.549 ± 1.158
0.549AsnAsp: 0.549 ± 0.385
2.744AsnGlu: 2.744 ± 0.987
1.647AsnPhe: 1.647 ± 0.741
1.098AsnGly: 1.098 ± 0.794
0.549AsnHis: 0.549 ± 0.385
2.744AsnIle: 2.744 ± 1.0
3.293AsnLys: 3.293 ± 0.783
4.94AsnLeu: 4.94 ± 1.731
1.647AsnMet: 1.647 ± 1.058
1.098AsnAsn: 1.098 ± 0.769
2.744AsnPro: 2.744 ± 1.574
2.195AsnGln: 2.195 ± 0.604
2.195AsnArg: 2.195 ± 0.785
2.195AsnSer: 2.195 ± 1.362
3.842AsnThr: 3.842 ± 1.386
2.195AsnVal: 2.195 ± 1.458
1.098AsnTrp: 1.098 ± 0.442
1.647AsnTyr: 1.647 ± 0.723
0.0AsnXaa: 0.0 ± 0.0
Pro
2.195ProAla: 2.195 ± 0.604
0.549ProCys: 0.549 ± 0.385
4.94ProAsp: 4.94 ± 1.015
3.293ProGlu: 3.293 ± 1.288
2.744ProPhe: 2.744 ± 0.943
3.842ProGly: 3.842 ± 1.22
1.098ProHis: 1.098 ± 0.507
0.549ProIle: 0.549 ± 0.385
5.488ProLys: 5.488 ± 2.395
8.782ProLeu: 8.782 ± 3.274
1.098ProMet: 1.098 ± 1.465
1.647ProAsn: 1.647 ± 0.852
8.233ProPro: 8.233 ± 2.929
3.842ProGln: 3.842 ± 1.276
2.195ProArg: 2.195 ± 0.604
6.037ProSer: 6.037 ± 2.149
1.098ProThr: 1.098 ± 0.507
5.488ProVal: 5.488 ± 2.246
0.0ProTrp: 0.0 ± 0.0
0.549ProTyr: 0.549 ± 0.51
0.0ProXaa: 0.0 ± 0.0
Gln
2.744GlnAla: 2.744 ± 0.943
0.0GlnCys: 0.0 ± 0.0
1.098GlnAsp: 1.098 ± 1.129
2.744GlnGlu: 2.744 ± 1.446
2.744GlnPhe: 2.744 ± 0.911
2.195GlnGly: 2.195 ± 1.227
1.647GlnHis: 1.647 ± 1.104
3.293GlnIle: 3.293 ± 1.895
3.842GlnLys: 3.842 ± 1.144
2.195GlnLeu: 2.195 ± 1.641
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
2.195GlnPro: 2.195 ± 1.1
0.549GlnGln: 0.549 ± 0.385
3.842GlnArg: 3.842 ± 1.641
2.195GlnSer: 2.195 ± 0.712
0.549GlnThr: 0.549 ± 0.477
2.744GlnVal: 2.744 ± 0.911
0.0GlnTrp: 0.0 ± 0.0
1.647GlnTyr: 1.647 ± 1.104
0.0GlnXaa: 0.0 ± 0.0
Arg
5.488ArgAla: 5.488 ± 1.789
1.647ArgCys: 1.647 ± 0.879
3.293ArgAsp: 3.293 ± 1.297
4.391ArgGlu: 4.391 ± 1.943
1.647ArgPhe: 1.647 ± 0.741
3.293ArgGly: 3.293 ± 2.208
0.549ArgHis: 0.549 ± 0.765
1.647ArgIle: 1.647 ± 0.741
3.842ArgLys: 3.842 ± 1.386
6.037ArgLeu: 6.037 ± 1.046
1.098ArgMet: 1.098 ± 0.997
4.94ArgAsn: 4.94 ± 1.52
3.842ArgPro: 3.842 ± 1.282
1.098ArgGln: 1.098 ± 0.794
1.647ArgArg: 1.647 ± 0.648
1.647ArgSer: 1.647 ± 0.741
7.135ArgThr: 7.135 ± 2.094
4.391ArgVal: 4.391 ± 1.173
1.098ArgTrp: 1.098 ± 1.021
2.195ArgTyr: 2.195 ± 1.427
0.0ArgXaa: 0.0 ± 0.0
Ser
3.842SerAla: 3.842 ± 1.276
2.195SerCys: 2.195 ± 0.783
1.098SerAsp: 1.098 ± 0.769
2.195SerGlu: 2.195 ± 1.538
2.195SerPhe: 2.195 ± 0.785
9.33SerGly: 9.33 ± 3.627
2.195SerHis: 2.195 ± 1.587
2.744SerIle: 2.744 ± 0.784
2.744SerLys: 2.744 ± 0.547
4.94SerLeu: 4.94 ± 1.945
1.098SerMet: 1.098 ± 0.794
2.744SerAsn: 2.744 ± 1.359
0.549SerPro: 0.549 ± 0.385
4.94SerGln: 4.94 ± 1.185
4.94SerArg: 4.94 ± 0.882
6.037SerSer: 6.037 ± 0.975
4.94SerThr: 4.94 ± 2.118
2.744SerVal: 2.744 ± 0.943
2.195SerTrp: 2.195 ± 0.785
1.098SerTyr: 1.098 ± 1.16
0.0SerXaa: 0.0 ± 0.0
Thr
2.195ThrAla: 2.195 ± 0.934
1.098ThrCys: 1.098 ± 1.021
2.195ThrAsp: 2.195 ± 2.259
3.842ThrGlu: 3.842 ± 0.832
0.549ThrPhe: 0.549 ± 0.765
3.842ThrGly: 3.842 ± 1.494
0.0ThrHis: 0.0 ± 0.0
2.195ThrIle: 2.195 ± 0.635
1.098ThrLys: 1.098 ± 0.769
3.293ThrLeu: 3.293 ± 1.211
3.842ThrMet: 3.842 ± 1.631
2.195ThrAsn: 2.195 ± 0.604
4.94ThrPro: 4.94 ± 0.903
1.647ThrGln: 1.647 ± 0.492
4.391ThrArg: 4.391 ± 2.65
1.647ThrSer: 1.647 ± 0.492
3.293ThrThr: 3.293 ± 1.063
3.293ThrVal: 3.293 ± 1.363
1.647ThrTrp: 1.647 ± 1.419
1.647ThrTyr: 1.647 ± 0.648
0.0ThrXaa: 0.0 ± 0.0
Val
3.842ValAla: 3.842 ± 1.771
0.0ValCys: 0.0 ± 0.0
1.647ValAsp: 1.647 ± 0.648
6.586ValGlu: 6.586 ± 1.46
1.647ValPhe: 1.647 ± 0.723
4.94ValGly: 4.94 ± 2.548
3.293ValHis: 3.293 ± 1.297
0.549ValIle: 0.549 ± 0.385
2.744ValLys: 2.744 ± 1.0
4.391ValLeu: 4.391 ± 1.901
0.0ValMet: 0.0 ± 0.0
3.842ValAsn: 3.842 ± 0.842
2.744ValPro: 2.744 ± 1.21
0.549ValGln: 0.549 ± 0.385
3.842ValArg: 3.842 ± 1.968
3.842ValSer: 3.842 ± 0.657
5.488ValThr: 5.488 ± 1.303
2.195ValVal: 2.195 ± 1.009
1.647ValTrp: 1.647 ± 1.014
2.195ValTyr: 2.195 ± 1.145
0.0ValXaa: 0.0 ± 0.0
Trp
2.744TrpAla: 2.744 ± 1.652
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.647TrpGlu: 1.647 ± 0.741
0.549TrpPhe: 0.549 ± 0.765
0.549TrpGly: 0.549 ± 0.51
1.098TrpHis: 1.098 ± 0.729
1.098TrpIle: 1.098 ± 0.853
3.293TrpLys: 3.293 ± 0.825
2.195TrpLeu: 2.195 ± 0.712
0.0TrpMet: 0.0 ± 0.0
1.647TrpAsn: 1.647 ± 1.396
0.0TrpPro: 0.0 ± 0.0
0.549TrpGln: 0.549 ± 0.477
0.0TrpArg: 0.0 ± 0.0
0.549TrpSer: 0.549 ± 1.158
0.0TrpThr: 0.0 ± 0.0
0.549TrpVal: 0.549 ± 0.51
0.549TrpTrp: 0.549 ± 0.385
1.098TrpTyr: 1.098 ± 0.794
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.195TyrAla: 2.195 ± 0.712
2.744TyrCys: 2.744 ± 1.569
1.098TyrAsp: 1.098 ± 0.769
0.549TyrGlu: 0.549 ± 0.385
1.647TyrPhe: 1.647 ± 0.942
4.391TyrGly: 4.391 ± 0.999
0.549TyrHis: 0.549 ± 0.477
1.098TyrIle: 1.098 ± 0.794
2.744TyrLys: 2.744 ± 0.708
6.037TyrLeu: 6.037 ± 3.285
1.098TyrMet: 1.098 ± 0.729
0.0TyrAsn: 0.0 ± 0.0
3.293TyrPro: 3.293 ± 1.589
1.098TyrGln: 1.098 ± 0.794
2.195TyrArg: 2.195 ± 1.018
3.842TyrSer: 3.842 ± 1.144
1.647TyrThr: 1.647 ± 0.942
0.549TyrVal: 0.549 ± 1.158
0.549TyrTrp: 0.549 ± 0.385
1.098TyrTyr: 1.098 ± 0.794
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1823 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski