Amino acid dipepetide frequency for Trichechus manatus papillomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.261AlaAla: 5.261 ± 1.391
1.214AlaCys: 1.214 ± 0.535
4.452AlaAsp: 4.452 ± 0.886
6.07AlaGlu: 6.07 ± 1.102
3.642AlaPhe: 3.642 ± 2.334
1.214AlaGly: 1.214 ± 0.881
0.405AlaHis: 0.405 ± 0.371
1.619AlaIle: 1.619 ± 0.648
3.642AlaLys: 3.642 ± 1.863
5.666AlaLeu: 5.666 ± 1.404
0.405AlaMet: 0.405 ± 0.371
3.642AlaAsn: 3.642 ± 1.687
4.047AlaPro: 4.047 ± 0.828
3.238AlaGln: 3.238 ± 0.993
4.452AlaArg: 4.452 ± 0.987
2.833AlaSer: 2.833 ± 1.063
5.666AlaThr: 5.666 ± 2.035
3.642AlaVal: 3.642 ± 1.059
2.023AlaTrp: 2.023 ± 0.714
2.023AlaTyr: 2.023 ± 0.826
0.0AlaXaa: 0.0 ± 0.0
Cys
2.428CysAla: 2.428 ± 1.559
0.405CysCys: 0.405 ± 0.329
1.214CysAsp: 1.214 ± 0.88
0.405CysGlu: 0.405 ± 0.329
2.023CysPhe: 2.023 ± 0.737
0.809CysGly: 0.809 ± 0.641
0.405CysHis: 0.405 ± 0.596
1.214CysIle: 1.214 ± 0.689
0.809CysLys: 0.809 ± 0.451
2.023CysLeu: 2.023 ± 1.14
0.405CysMet: 0.405 ± 0.329
0.809CysAsn: 0.809 ± 0.512
2.023CysPro: 2.023 ± 0.771
0.0CysGln: 0.0 ± 0.0
1.619CysArg: 1.619 ± 0.615
0.809CysSer: 0.809 ± 0.512
1.214CysThr: 1.214 ± 0.7
1.214CysVal: 1.214 ± 0.637
0.809CysTrp: 0.809 ± 0.404
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.238AspAla: 3.238 ± 1.001
0.0AspCys: 0.0 ± 0.0
2.428AspAsp: 2.428 ± 1.067
4.047AspGlu: 4.047 ± 1.508
2.023AspPhe: 2.023 ± 0.683
3.642AspGly: 3.642 ± 1.047
1.214AspHis: 1.214 ± 0.95
5.261AspIle: 5.261 ± 1.836
3.238AspLys: 3.238 ± 0.601
7.689AspLeu: 7.689 ± 1.617
1.619AspMet: 1.619 ± 0.859
2.428AspAsn: 2.428 ± 1.034
4.452AspPro: 4.452 ± 1.855
2.428AspGln: 2.428 ± 0.795
2.023AspArg: 2.023 ± 0.491
3.642AspSer: 3.642 ± 1.48
5.666AspThr: 5.666 ± 1.293
1.619AspVal: 1.619 ± 0.922
1.214AspTrp: 1.214 ± 0.683
2.833AspTyr: 2.833 ± 1.53
0.0AspXaa: 0.0 ± 0.0
Glu
4.856GluAla: 4.856 ± 1.097
2.023GluCys: 2.023 ± 0.893
4.047GluAsp: 4.047 ± 1.224
4.452GluGlu: 4.452 ± 1.661
1.214GluPhe: 1.214 ± 0.395
4.856GluGly: 4.856 ± 1.563
0.809GluHis: 0.809 ± 0.451
1.214GluIle: 1.214 ± 0.402
2.023GluLys: 2.023 ± 0.691
5.666GluLeu: 5.666 ± 1.361
1.214GluMet: 1.214 ± 0.638
5.261GluAsn: 5.261 ± 1.277
4.452GluPro: 4.452 ± 1.458
2.023GluGln: 2.023 ± 0.671
2.023GluArg: 2.023 ± 0.97
3.642GluSer: 3.642 ± 1.867
3.238GluThr: 3.238 ± 1.274
3.642GluVal: 3.642 ± 1.092
0.809GluTrp: 0.809 ± 0.659
2.023GluTyr: 2.023 ± 0.889
0.0GluXaa: 0.0 ± 0.0
Phe
2.833PheAla: 2.833 ± 1.312
0.809PheCys: 0.809 ± 0.512
2.833PheAsp: 2.833 ± 0.574
0.809PheGlu: 0.809 ± 0.741
1.619PhePhe: 1.619 ± 0.549
3.642PheGly: 3.642 ± 1.49
1.214PheHis: 1.214 ± 0.629
1.214PheIle: 1.214 ± 0.636
2.833PheLys: 2.833 ± 1.207
4.047PheLeu: 4.047 ± 0.711
0.405PheMet: 0.405 ± 0.371
1.619PheAsn: 1.619 ± 0.694
1.619PhePro: 1.619 ± 0.978
1.619PheGln: 1.619 ± 0.664
2.428PheArg: 2.428 ± 0.88
2.428PheSer: 2.428 ± 0.608
2.833PheThr: 2.833 ± 0.779
1.619PheVal: 1.619 ± 0.768
1.214PheTrp: 1.214 ± 0.683
0.809PheTyr: 0.809 ± 0.408
0.0PheXaa: 0.0 ± 0.0
Gly
4.856GlyAla: 4.856 ± 0.736
1.619GlyCys: 1.619 ± 1.195
5.666GlyAsp: 5.666 ± 1.596
3.642GlyGlu: 3.642 ± 0.591
0.405GlyPhe: 0.405 ± 0.371
7.285GlyGly: 7.285 ± 2.487
2.833GlyHis: 2.833 ± 0.905
4.856GlyIle: 4.856 ± 1.622
1.619GlyLys: 1.619 ± 0.939
6.88GlyLeu: 6.88 ± 1.422
0.809GlyMet: 0.809 ± 0.604
2.833GlyAsn: 2.833 ± 0.807
3.642GlyPro: 3.642 ± 0.958
1.214GlyGln: 1.214 ± 0.535
4.452GlyArg: 4.452 ± 0.992
6.475GlySer: 6.475 ± 2.273
5.666GlyThr: 5.666 ± 1.1
3.642GlyVal: 3.642 ± 1.223
0.0GlyTrp: 0.0 ± 0.0
3.238GlyTyr: 3.238 ± 1.491
0.0GlyXaa: 0.0 ± 0.0
His
2.023HisAla: 2.023 ± 0.927
0.405HisCys: 0.405 ± 0.5
0.405HisAsp: 0.405 ± 0.359
0.809HisGlu: 0.809 ± 0.82
0.809HisPhe: 0.809 ± 0.408
1.214HisGly: 1.214 ± 0.653
0.0HisHis: 0.0 ± 0.0
1.619HisIle: 1.619 ± 0.753
0.405HisLys: 0.405 ± 0.329
1.214HisLeu: 1.214 ± 1.033
0.405HisMet: 0.405 ± 0.568
1.214HisAsn: 1.214 ± 0.395
4.452HisPro: 4.452 ± 3.215
0.405HisGln: 0.405 ± 0.329
1.619HisArg: 1.619 ± 1.016
0.405HisSer: 0.405 ± 0.359
2.023HisThr: 2.023 ± 0.896
2.023HisVal: 2.023 ± 0.619
1.214HisTrp: 1.214 ± 0.684
1.214HisTyr: 1.214 ± 0.71
0.0HisXaa: 0.0 ± 0.0
Ile
2.428IleAla: 2.428 ± 0.95
0.809IleCys: 0.809 ± 0.82
2.023IleAsp: 2.023 ± 1.254
2.428IleGlu: 2.428 ± 0.763
2.428IlePhe: 2.428 ± 0.725
3.238IleGly: 3.238 ± 1.843
1.619IleHis: 1.619 ± 0.572
1.214IleIle: 1.214 ± 0.599
1.214IleLys: 1.214 ± 0.683
5.666IleLeu: 5.666 ± 1.116
0.809IleMet: 0.809 ± 0.436
1.214IleAsn: 1.214 ± 0.535
4.856IlePro: 4.856 ± 1.71
1.619IleGln: 1.619 ± 0.765
2.833IleArg: 2.833 ± 1.616
6.475IleSer: 6.475 ± 1.701
2.023IleThr: 2.023 ± 0.776
2.023IleVal: 2.023 ± 0.886
0.405IleTrp: 0.405 ± 0.329
2.428IleTyr: 2.428 ± 0.643
0.0IleXaa: 0.0 ± 0.0
Lys
3.238LysAla: 3.238 ± 1.145
1.619LysCys: 1.619 ± 0.807
2.428LysAsp: 2.428 ± 1.211
2.428LysGlu: 2.428 ± 0.542
2.428LysPhe: 2.428 ± 1.228
1.619LysGly: 1.619 ± 0.754
1.619LysHis: 1.619 ± 1.317
2.833LysIle: 2.833 ± 0.807
3.238LysLys: 3.238 ± 0.878
1.619LysLeu: 1.619 ± 0.619
0.809LysMet: 0.809 ± 0.451
0.405LysAsn: 0.405 ± 0.371
0.405LysPro: 0.405 ± 0.596
2.428LysGln: 2.428 ± 0.995
4.047LysArg: 4.047 ± 1.158
3.238LysSer: 3.238 ± 1.414
2.023LysThr: 2.023 ± 0.892
2.428LysVal: 2.428 ± 1.355
0.809LysTrp: 0.809 ± 0.441
2.023LysTyr: 2.023 ± 0.97
0.0LysXaa: 0.0 ± 0.0
Leu
7.285LeuAla: 7.285 ± 1.703
1.619LeuCys: 1.619 ± 1.136
7.689LeuAsp: 7.689 ± 1.226
5.666LeuGlu: 5.666 ± 1.568
5.261LeuPhe: 5.261 ± 1.096
6.475LeuGly: 6.475 ± 1.152
2.023LeuHis: 2.023 ± 0.97
4.047LeuIle: 4.047 ± 1.664
4.047LeuLys: 4.047 ± 1.666
10.117LeuLeu: 10.117 ± 3.778
1.214LeuMet: 1.214 ± 0.681
2.023LeuAsn: 2.023 ± 0.455
6.07LeuPro: 6.07 ± 2.382
4.047LeuGln: 4.047 ± 1.033
4.856LeuArg: 4.856 ± 0.947
5.666LeuSer: 5.666 ± 0.521
6.07LeuThr: 6.07 ± 1.682
4.047LeuVal: 4.047 ± 2.001
1.214LeuTrp: 1.214 ± 0.667
4.452LeuTyr: 4.452 ± 1.448
0.0LeuXaa: 0.0 ± 0.0
Met
0.809MetAla: 0.809 ± 0.741
0.809MetCys: 0.809 ± 0.718
0.809MetAsp: 0.809 ± 0.44
1.214MetGlu: 1.214 ± 0.92
0.405MetPhe: 0.405 ± 0.329
0.809MetGly: 0.809 ± 0.44
0.405MetHis: 0.405 ± 0.596
0.405MetIle: 0.405 ± 0.329
0.809MetLys: 0.809 ± 0.512
2.023MetLeu: 2.023 ± 0.566
0.405MetMet: 0.405 ± 0.661
1.619MetAsn: 1.619 ± 1.09
0.0MetPro: 0.0 ± 0.0
0.809MetGln: 0.809 ± 0.659
0.405MetArg: 0.405 ± 0.329
0.405MetSer: 0.405 ± 0.371
1.214MetThr: 1.214 ± 0.683
1.214MetVal: 1.214 ± 0.395
0.0MetTrp: 0.0 ± 0.0
0.405MetTyr: 0.405 ± 0.359
0.0MetXaa: 0.0 ± 0.0
Asn
2.023AsnAla: 2.023 ± 0.893
0.809AsnCys: 0.809 ± 0.404
2.833AsnAsp: 2.833 ± 1.02
0.809AsnGlu: 0.809 ± 0.598
1.214AsnPhe: 1.214 ± 0.689
2.023AsnGly: 2.023 ± 0.747
0.809AsnHis: 0.809 ± 0.404
2.023AsnIle: 2.023 ± 0.947
0.809AsnLys: 0.809 ± 0.441
2.833AsnLeu: 2.833 ± 1.256
0.809AsnMet: 0.809 ± 0.741
0.809AsnAsn: 0.809 ± 0.741
2.023AsnPro: 2.023 ± 0.733
2.428AsnGln: 2.428 ± 1.487
2.428AsnArg: 2.428 ± 1.166
3.238AsnSer: 3.238 ± 1.478
4.452AsnThr: 4.452 ± 1.156
1.619AsnVal: 1.619 ± 0.516
0.809AsnTrp: 0.809 ± 0.404
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.642ProAla: 3.642 ± 1.077
0.809ProCys: 0.809 ± 0.608
3.642ProAsp: 3.642 ± 0.887
4.856ProGlu: 4.856 ± 1.387
2.428ProPhe: 2.428 ± 1.272
4.047ProGly: 4.047 ± 1.52
2.428ProHis: 2.428 ± 1.647
3.642ProIle: 3.642 ± 1.404
3.238ProLys: 3.238 ± 1.466
6.475ProLeu: 6.475 ± 1.775
0.0ProMet: 0.0 ± 0.0
1.214ProAsn: 1.214 ± 1.112
9.713ProPro: 9.713 ± 2.335
2.428ProGln: 2.428 ± 1.282
3.642ProArg: 3.642 ± 1.678
7.689ProSer: 7.689 ± 3.112
5.666ProThr: 5.666 ± 1.335
4.047ProVal: 4.047 ± 1.714
0.0ProTrp: 0.0 ± 0.0
2.833ProTyr: 2.833 ± 1.463
0.0ProXaa: 0.0 ± 0.0
Gln
2.428GlnAla: 2.428 ± 0.641
1.214GlnCys: 1.214 ± 0.441
2.428GlnAsp: 2.428 ± 1.174
3.238GlnGlu: 3.238 ± 1.473
2.023GlnPhe: 2.023 ± 0.626
2.833GlnGly: 2.833 ± 0.767
1.214GlnHis: 1.214 ± 0.866
0.405GlnIle: 0.405 ± 0.359
0.809GlnLys: 0.809 ± 0.608
5.261GlnLeu: 5.261 ± 1.815
1.214GlnMet: 1.214 ± 0.683
2.023GlnAsn: 2.023 ± 1.164
1.619GlnPro: 1.619 ± 0.33
1.619GlnGln: 1.619 ± 0.33
2.023GlnArg: 2.023 ± 1.068
3.642GlnSer: 3.642 ± 0.72
1.619GlnThr: 1.619 ± 0.854
2.023GlnVal: 2.023 ± 0.671
0.405GlnTrp: 0.405 ± 0.329
2.428GlnTyr: 2.428 ± 1.05
0.0GlnXaa: 0.0 ± 0.0
Arg
4.047ArgAla: 4.047 ± 1.336
2.428ArgCys: 2.428 ± 1.041
2.833ArgAsp: 2.833 ± 1.0
2.023ArgGlu: 2.023 ± 0.834
2.833ArgPhe: 2.833 ± 0.901
5.666ArgGly: 5.666 ± 1.464
2.023ArgHis: 2.023 ± 0.892
2.428ArgIle: 2.428 ± 0.862
3.238ArgLys: 3.238 ± 1.273
7.689ArgLeu: 7.689 ± 1.581
0.0ArgMet: 0.0 ± 0.0
1.214ArgAsn: 1.214 ± 0.795
4.856ArgPro: 4.856 ± 0.846
2.023ArgGln: 2.023 ± 0.491
5.261ArgArg: 5.261 ± 2.212
2.833ArgSer: 2.833 ± 0.489
5.261ArgThr: 5.261 ± 2.3
2.023ArgVal: 2.023 ± 0.491
0.809ArgTrp: 0.809 ± 0.512
1.619ArgTyr: 1.619 ± 0.602
0.0ArgXaa: 0.0 ± 0.0
Ser
4.452SerAla: 4.452 ± 1.816
0.809SerCys: 0.809 ± 0.66
4.452SerAsp: 4.452 ± 1.571
4.047SerGlu: 4.047 ± 1.508
2.023SerPhe: 2.023 ± 0.491
9.308SerGly: 9.308 ± 2.709
0.405SerHis: 0.405 ± 0.661
4.452SerIle: 4.452 ± 1.012
2.428SerLys: 2.428 ± 0.86
4.856SerLeu: 4.856 ± 1.694
1.619SerMet: 1.619 ± 0.939
2.023SerAsn: 2.023 ± 1.16
3.642SerPro: 3.642 ± 1.952
3.642SerGln: 3.642 ± 0.871
3.642SerArg: 3.642 ± 1.185
7.285SerSer: 7.285 ± 2.392
7.285SerThr: 7.285 ± 2.086
5.261SerVal: 5.261 ± 1.504
0.405SerTrp: 0.405 ± 0.661
1.214SerTyr: 1.214 ± 0.636
0.0SerXaa: 0.0 ± 0.0
Thr
4.047ThrAla: 4.047 ± 1.194
1.214ThrCys: 1.214 ± 0.636
4.452ThrAsp: 4.452 ± 1.085
4.856ThrGlu: 4.856 ± 0.923
2.023ThrPhe: 2.023 ± 1.077
6.07ThrGly: 6.07 ± 1.242
2.023ThrHis: 2.023 ± 0.908
4.856ThrIle: 4.856 ± 1.274
1.619ThrLys: 1.619 ± 0.902
6.475ThrLeu: 6.475 ± 1.943
2.023ThrMet: 2.023 ± 0.748
1.619ThrAsn: 1.619 ± 0.619
6.88ThrPro: 6.88 ± 1.815
2.428ThrGln: 2.428 ± 0.46
6.88ThrArg: 6.88 ± 1.648
6.475ThrSer: 6.475 ± 1.909
6.475ThrThr: 6.475 ± 1.904
2.833ThrVal: 2.833 ± 1.061
0.809ThrTrp: 0.809 ± 0.451
2.428ThrTyr: 2.428 ± 0.947
0.0ThrXaa: 0.0 ± 0.0
Val
2.023ValAla: 2.023 ± 1.028
1.619ValCys: 1.619 ± 0.899
3.642ValAsp: 3.642 ± 1.575
4.452ValGlu: 4.452 ± 1.368
1.214ValPhe: 1.214 ± 0.743
3.238ValGly: 3.238 ± 1.024
1.214ValHis: 1.214 ± 0.629
1.214ValIle: 1.214 ± 0.683
2.428ValLys: 2.428 ± 0.744
2.023ValLeu: 2.023 ± 1.307
0.0ValMet: 0.0 ± 0.0
1.619ValAsn: 1.619 ± 0.693
4.047ValPro: 4.047 ± 1.257
3.238ValGln: 3.238 ± 0.661
3.238ValArg: 3.238 ± 1.293
5.261ValSer: 5.261 ± 2.326
5.666ValThr: 5.666 ± 1.658
4.856ValVal: 4.856 ± 1.816
0.405ValTrp: 0.405 ± 0.371
0.809ValTyr: 0.809 ± 0.408
0.0ValXaa: 0.0 ± 0.0
Trp
0.809TrpAla: 0.809 ± 0.647
0.405TrpCys: 0.405 ± 0.359
0.405TrpAsp: 0.405 ± 0.329
1.214TrpGlu: 1.214 ± 0.947
0.0TrpPhe: 0.0 ± 0.0
0.809TrpGly: 0.809 ± 0.44
0.405TrpHis: 0.405 ± 0.371
1.214TrpIle: 1.214 ± 0.636
2.023TrpLys: 2.023 ± 1.022
2.023TrpLeu: 2.023 ± 0.892
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.809TrpGln: 0.809 ± 0.741
1.214TrpArg: 1.214 ± 0.535
0.405TrpSer: 0.405 ± 0.359
0.405TrpThr: 0.405 ± 0.359
0.809TrpVal: 0.809 ± 0.44
0.0TrpTrp: 0.0 ± 0.0
0.809TrpTyr: 0.809 ± 0.647
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.428TyrAla: 2.428 ± 0.542
0.0TyrCys: 0.0 ± 0.0
2.023TyrAsp: 2.023 ± 0.948
2.023TyrGlu: 2.023 ± 0.697
2.428TyrPhe: 2.428 ± 0.624
2.833TyrGly: 2.833 ± 0.527
1.214TyrHis: 1.214 ± 0.684
2.023TyrIle: 2.023 ± 1.026
1.214TyrLys: 1.214 ± 0.683
3.642TyrLeu: 3.642 ± 0.48
0.405TyrMet: 0.405 ± 0.329
1.214TyrAsn: 1.214 ± 0.643
3.642TyrPro: 3.642 ± 1.865
2.023TyrGln: 2.023 ± 0.761
2.023TyrArg: 2.023 ± 0.842
0.405TyrSer: 0.405 ± 0.329
2.023TyrThr: 2.023 ± 1.419
1.619TyrVal: 1.619 ± 0.33
0.405TyrTrp: 0.405 ± 0.371
3.642TyrTyr: 3.642 ± 1.169
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2472 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski