Amino acid dipepetide frequency for Tursiops truncatus papillomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.048AlaAla: 6.048 ± 1.077
0.806AlaCys: 0.806 ± 0.359
2.419AlaAsp: 2.419 ± 0.723
6.048AlaGlu: 6.048 ± 1.219
2.016AlaPhe: 2.016 ± 0.428
3.629AlaGly: 3.629 ± 0.962
0.806AlaHis: 0.806 ± 0.47
2.016AlaIle: 2.016 ± 0.642
2.419AlaLys: 2.419 ± 0.393
3.629AlaLeu: 3.629 ± 1.304
1.613AlaMet: 1.613 ± 0.572
4.032AlaAsn: 4.032 ± 1.526
3.226AlaPro: 3.226 ± 0.961
1.613AlaGln: 1.613 ± 0.94
2.823AlaArg: 2.823 ± 0.961
5.645AlaSer: 5.645 ± 1.921
3.226AlaThr: 3.226 ± 0.974
4.435AlaVal: 4.435 ± 1.878
0.0AlaTrp: 0.0 ± 0.0
1.613AlaTyr: 1.613 ± 0.52
0.0AlaXaa: 0.0 ± 0.0
Cys
1.21CysAla: 1.21 ± 0.706
1.21CysCys: 1.21 ± 1.146
0.403CysAsp: 0.403 ± 0.393
2.419CysGlu: 2.419 ± 0.806
0.806CysPhe: 0.806 ± 0.461
0.403CysGly: 0.403 ± 0.393
0.403CysHis: 0.403 ± 0.393
1.613CysIle: 1.613 ± 1.251
1.613CysLys: 1.613 ± 0.571
2.419CysLeu: 2.419 ± 1.284
0.806CysMet: 0.806 ± 0.622
0.806CysAsn: 0.806 ± 0.612
1.613CysPro: 1.613 ± 0.718
0.806CysGln: 0.806 ± 0.642
1.613CysArg: 1.613 ± 1.051
2.419CysSer: 2.419 ± 1.573
2.419CysThr: 2.419 ± 1.136
1.613CysVal: 1.613 ± 0.906
1.21CysTrp: 1.21 ± 0.668
0.806CysTyr: 0.806 ± 1.135
0.0CysXaa: 0.0 ± 0.0
Asp
3.629AspAla: 3.629 ± 1.077
2.016AspCys: 2.016 ± 0.823
3.629AspAsp: 3.629 ± 2.04
2.016AspGlu: 2.016 ± 0.926
2.823AspPhe: 2.823 ± 1.237
4.435AspGly: 4.435 ± 1.608
0.0AspHis: 0.0 ± 0.0
4.435AspIle: 4.435 ± 1.402
1.613AspLys: 1.613 ± 0.52
6.452AspLeu: 6.452 ± 2.646
1.613AspMet: 1.613 ± 0.711
2.823AspAsn: 2.823 ± 1.128
3.629AspPro: 3.629 ± 0.875
0.403AspGln: 0.403 ± 0.567
1.613AspArg: 1.613 ± 1.033
5.645AspSer: 5.645 ± 1.32
5.242AspThr: 5.242 ± 1.591
3.226AspVal: 3.226 ± 1.454
1.613AspTrp: 1.613 ± 0.976
1.21AspTyr: 1.21 ± 0.706
0.0AspXaa: 0.0 ± 0.0
Glu
2.823GluAla: 2.823 ± 0.968
0.403GluCys: 0.403 ± 0.393
5.645GluAsp: 5.645 ± 1.745
3.226GluGlu: 3.226 ± 0.939
0.806GluPhe: 0.806 ± 0.47
5.242GluGly: 5.242 ± 1.889
0.0GluHis: 0.0 ± 0.0
2.419GluIle: 2.419 ± 1.002
4.435GluLys: 4.435 ± 1.927
4.435GluLeu: 4.435 ± 1.089
1.21GluMet: 1.21 ± 0.706
4.032GluAsn: 4.032 ± 0.789
4.435GluPro: 4.435 ± 1.715
4.032GluGln: 4.032 ± 0.707
2.823GluArg: 2.823 ± 0.828
4.839GluSer: 4.839 ± 1.662
3.629GluThr: 3.629 ± 1.553
2.419GluVal: 2.419 ± 1.214
0.403GluTrp: 0.403 ± 0.393
1.21GluTyr: 1.21 ± 0.638
0.0GluXaa: 0.0 ± 0.0
Phe
2.419PheAla: 2.419 ± 0.513
0.806PheCys: 0.806 ± 0.652
2.823PheAsp: 2.823 ± 1.38
3.226PheGlu: 3.226 ± 1.154
1.613PhePhe: 1.613 ± 1.08
2.419PheGly: 2.419 ± 1.262
0.403PheHis: 0.403 ± 0.567
0.806PheIle: 0.806 ± 0.612
2.016PheLys: 2.016 ± 0.678
4.032PheLeu: 4.032 ± 1.614
0.403PheMet: 0.403 ± 0.296
0.403PheAsn: 0.403 ± 0.389
1.21PhePro: 1.21 ± 0.765
1.613PheGln: 1.613 ± 0.517
2.419PheArg: 2.419 ± 0.657
2.016PheSer: 2.016 ± 1.112
1.613PheThr: 1.613 ± 1.148
3.226PheVal: 3.226 ± 1.091
2.016PheTrp: 2.016 ± 0.631
1.613PheTyr: 1.613 ± 0.694
0.0PheXaa: 0.0 ± 0.0
Gly
2.016GlyAla: 2.016 ± 0.678
1.613GlyCys: 1.613 ± 0.571
5.645GlyAsp: 5.645 ± 0.886
6.048GlyGlu: 6.048 ± 1.261
1.613GlyPhe: 1.613 ± 1.242
5.242GlyGly: 5.242 ± 1.731
1.613GlyHis: 1.613 ± 0.587
2.016GlyIle: 2.016 ± 0.611
4.435GlyLys: 4.435 ± 1.291
3.629GlyLeu: 3.629 ± 0.606
0.806GlyMet: 0.806 ± 0.359
2.419GlyAsn: 2.419 ± 0.793
4.839GlyPro: 4.839 ± 2.121
2.016GlyGln: 2.016 ± 0.517
4.839GlyArg: 4.839 ± 1.813
5.645GlySer: 5.645 ± 0.934
5.645GlyThr: 5.645 ± 1.837
4.435GlyVal: 4.435 ± 1.179
1.21GlyTrp: 1.21 ± 0.603
0.806GlyTyr: 0.806 ± 0.411
0.0GlyXaa: 0.0 ± 0.0
His
1.21HisAla: 1.21 ± 0.362
0.403HisCys: 0.403 ± 0.389
0.0HisAsp: 0.0 ± 0.0
1.21HisGlu: 1.21 ± 0.822
1.613HisPhe: 1.613 ± 0.636
0.403HisGly: 0.403 ± 0.348
4.032HisHis: 4.032 ± 3.732
2.016HisIle: 2.016 ± 1.373
0.403HisLys: 0.403 ± 0.339
2.419HisLeu: 2.419 ± 0.933
0.0HisMet: 0.0 ± 0.0
0.806HisAsn: 0.806 ± 0.58
2.016HisPro: 2.016 ± 0.912
2.016HisGln: 2.016 ± 1.385
1.613HisArg: 1.613 ± 0.718
1.21HisSer: 1.21 ± 0.579
1.21HisThr: 1.21 ± 0.499
1.613HisVal: 1.613 ± 0.795
0.403HisTrp: 0.403 ± 0.389
0.403HisTyr: 0.403 ± 0.339
0.0HisXaa: 0.0 ± 0.0
Ile
4.032IleAla: 4.032 ± 0.661
2.016IleCys: 2.016 ± 1.235
4.032IleAsp: 4.032 ± 1.594
2.419IleGlu: 2.419 ± 1.044
1.613IlePhe: 1.613 ± 0.7
2.419IleGly: 2.419 ± 1.012
1.21IleHis: 1.21 ± 1.045
1.613IleIle: 1.613 ± 0.858
1.613IleLys: 1.613 ± 1.11
3.226IleLeu: 3.226 ± 0.952
2.016IleMet: 2.016 ± 0.848
1.21IleAsn: 1.21 ± 0.561
4.032IlePro: 4.032 ± 1.022
0.806IleGln: 0.806 ± 0.678
1.21IleArg: 1.21 ± 0.668
5.242IleSer: 5.242 ± 1.122
2.419IleThr: 2.419 ± 0.637
3.226IleVal: 3.226 ± 0.959
0.403IleTrp: 0.403 ± 0.348
1.613IleTyr: 1.613 ± 0.728
0.0IleXaa: 0.0 ± 0.0
Lys
3.629LysAla: 3.629 ± 0.904
2.016LysCys: 2.016 ± 0.787
1.613LysAsp: 1.613 ± 0.669
1.613LysGlu: 1.613 ± 0.866
2.419LysPhe: 2.419 ± 0.793
2.016LysGly: 2.016 ± 0.517
1.613LysHis: 1.613 ± 0.572
1.21LysIle: 1.21 ± 0.668
2.823LysLys: 2.823 ± 1.554
2.419LysLeu: 2.419 ± 0.714
0.806LysMet: 0.806 ± 0.359
2.419LysAsn: 2.419 ± 0.797
2.016LysPro: 2.016 ± 1.318
2.823LysGln: 2.823 ± 0.999
3.226LysArg: 3.226 ± 1.232
4.839LysSer: 4.839 ± 1.219
3.629LysThr: 3.629 ± 0.592
5.242LysVal: 5.242 ± 1.283
2.016LysTrp: 2.016 ± 0.729
2.016LysTyr: 2.016 ± 0.793
0.0LysXaa: 0.0 ± 0.0
Leu
2.016LeuAla: 2.016 ± 0.8
1.21LeuCys: 1.21 ± 0.953
5.645LeuAsp: 5.645 ± 0.915
4.839LeuGlu: 4.839 ± 1.58
4.435LeuPhe: 4.435 ± 1.611
6.452LeuGly: 6.452 ± 1.144
2.016LeuHis: 2.016 ± 0.699
3.629LeuIle: 3.629 ± 1.482
4.435LeuLys: 4.435 ± 1.597
10.484LeuLeu: 10.484 ± 3.131
1.21LeuMet: 1.21 ± 0.669
2.823LeuAsn: 2.823 ± 0.768
5.242LeuPro: 5.242 ± 1.659
5.242LeuGln: 5.242 ± 0.941
5.645LeuArg: 5.645 ± 1.037
7.661LeuSer: 7.661 ± 1.748
5.242LeuThr: 5.242 ± 0.761
5.242LeuVal: 5.242 ± 0.891
0.403LeuTrp: 0.403 ± 0.389
4.032LeuTyr: 4.032 ± 1.192
0.0LeuXaa: 0.0 ± 0.0
Met
0.806MetAla: 0.806 ± 0.652
0.806MetCys: 0.806 ± 0.359
2.419MetAsp: 2.419 ± 0.837
0.806MetGlu: 0.806 ± 0.787
2.016MetPhe: 2.016 ± 0.428
0.0MetGly: 0.0 ± 0.0
0.806MetHis: 0.806 ± 0.717
0.806MetIle: 0.806 ± 0.359
0.403MetLys: 0.403 ± 0.339
2.419MetLeu: 2.419 ± 0.837
0.806MetMet: 0.806 ± 0.509
0.806MetAsn: 0.806 ± 0.568
1.21MetPro: 1.21 ± 0.668
1.613MetGln: 1.613 ± 0.878
0.403MetArg: 0.403 ± 0.348
1.613MetSer: 1.613 ± 0.52
0.0MetThr: 0.0 ± 0.0
1.21MetVal: 1.21 ± 0.579
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.823AsnAla: 2.823 ± 1.278
0.806AsnCys: 0.806 ± 0.612
0.806AsnAsp: 0.806 ± 0.642
2.016AsnGlu: 2.016 ± 0.553
0.403AsnPhe: 0.403 ± 0.393
4.032AsnGly: 4.032 ± 1.27
0.0AsnHis: 0.0 ± 0.0
2.823AsnIle: 2.823 ± 0.765
2.419AsnLys: 2.419 ± 1.133
2.823AsnLeu: 2.823 ± 1.449
0.403AsnMet: 0.403 ± 0.339
2.419AsnAsn: 2.419 ± 1.794
4.032AsnPro: 4.032 ± 1.003
2.016AsnGln: 2.016 ± 1.1
2.016AsnArg: 2.016 ± 0.6
1.21AsnSer: 1.21 ± 0.806
3.629AsnThr: 3.629 ± 1.047
3.226AsnVal: 3.226 ± 0.345
0.403AsnTrp: 0.403 ± 0.339
1.613AsnTyr: 1.613 ± 1.033
0.0AsnXaa: 0.0 ± 0.0
Pro
8.065ProAla: 8.065 ± 2.004
2.016ProCys: 2.016 ± 1.042
2.823ProAsp: 2.823 ± 1.938
4.435ProGlu: 4.435 ± 1.499
2.419ProPhe: 2.419 ± 0.393
3.226ProGly: 3.226 ± 1.188
2.419ProHis: 2.419 ± 1.014
2.419ProIle: 2.419 ± 0.99
4.435ProLys: 4.435 ± 1.152
8.065ProLeu: 8.065 ± 2.098
0.403ProMet: 0.403 ± 0.348
1.613ProAsn: 1.613 ± 0.777
9.677ProPro: 9.677 ± 2.628
1.613ProGln: 1.613 ± 0.718
2.823ProArg: 2.823 ± 0.885
6.048ProSer: 6.048 ± 2.517
3.226ProThr: 3.226 ± 1.577
3.629ProVal: 3.629 ± 1.797
0.403ProTrp: 0.403 ± 0.473
1.613ProTyr: 1.613 ± 0.774
0.0ProXaa: 0.0 ± 0.0
Gln
2.419GlnAla: 2.419 ± 0.621
0.0GlnCys: 0.0 ± 0.0
1.613GlnAsp: 1.613 ± 0.809
2.016GlnGlu: 2.016 ± 0.952
1.21GlnPhe: 1.21 ± 0.617
0.806GlnGly: 0.806 ± 0.509
1.613GlnHis: 1.613 ± 0.856
1.613GlnIle: 1.613 ± 0.52
1.613GlnLys: 1.613 ± 0.572
2.823GlnLeu: 2.823 ± 0.85
2.016GlnMet: 2.016 ± 0.888
2.419GlnAsn: 2.419 ± 1.446
3.226GlnPro: 3.226 ± 0.914
2.823GlnGln: 2.823 ± 1.05
3.226GlnArg: 3.226 ± 1.153
1.613GlnSer: 1.613 ± 0.856
2.823GlnThr: 2.823 ± 1.098
1.21GlnVal: 1.21 ± 0.579
2.016GlnTrp: 2.016 ± 1.381
1.21GlnTyr: 1.21 ± 0.789
0.0GlnXaa: 0.0 ± 0.0
Arg
3.226ArgAla: 3.226 ± 1.226
2.823ArgCys: 2.823 ± 2.019
1.613ArgAsp: 1.613 ± 0.91
1.613ArgGlu: 1.613 ± 0.478
1.21ArgPhe: 1.21 ± 0.683
5.242ArgGly: 5.242 ± 1.306
1.613ArgHis: 1.613 ± 0.669
2.419ArgIle: 2.419 ± 1.034
4.032ArgLys: 4.032 ± 0.331
4.435ArgLeu: 4.435 ± 0.848
0.0ArgMet: 0.0 ± 0.0
1.613ArgAsn: 1.613 ± 0.781
3.226ArgPro: 3.226 ± 0.753
1.21ArgGln: 1.21 ± 1.168
5.645ArgArg: 5.645 ± 2.363
4.435ArgSer: 4.435 ± 0.848
4.032ArgThr: 4.032 ± 0.496
2.823ArgVal: 2.823 ± 0.876
0.0ArgTrp: 0.0 ± 0.0
2.419ArgTyr: 2.419 ± 1.22
0.0ArgXaa: 0.0 ± 0.0
Ser
4.435SerAla: 4.435 ± 1.5
1.21SerCys: 1.21 ± 0.926
4.839SerAsp: 4.839 ± 0.887
4.839SerGlu: 4.839 ± 2.021
2.419SerPhe: 2.419 ± 0.531
6.855SerGly: 6.855 ± 0.776
2.823SerHis: 2.823 ± 0.968
4.032SerIle: 4.032 ± 1.165
2.419SerLys: 2.419 ± 1.277
8.468SerLeu: 8.468 ± 1.456
2.016SerMet: 2.016 ± 0.997
2.016SerAsn: 2.016 ± 0.642
8.065SerPro: 8.065 ± 1.566
2.016SerGln: 2.016 ± 0.804
4.032SerArg: 4.032 ± 1.38
10.081SerSer: 10.081 ± 3.622
6.452SerThr: 6.452 ± 1.857
3.226SerVal: 3.226 ± 0.467
1.21SerTrp: 1.21 ± 0.901
1.613SerTyr: 1.613 ± 0.638
0.0SerXaa: 0.0 ± 0.0
Thr
2.419ThrAla: 2.419 ± 0.551
2.419ThrCys: 2.419 ± 0.985
4.435ThrAsp: 4.435 ± 1.533
4.032ThrGlu: 4.032 ± 0.708
2.419ThrPhe: 2.419 ± 0.99
5.242ThrGly: 5.242 ± 1.116
1.613ThrHis: 1.613 ± 1.155
3.629ThrIle: 3.629 ± 0.974
2.823ThrLys: 2.823 ± 0.94
4.435ThrLeu: 4.435 ± 1.629
0.806ThrMet: 0.806 ± 0.461
3.629ThrAsn: 3.629 ± 0.894
3.629ThrPro: 3.629 ± 1.205
2.419ThrGln: 2.419 ± 0.873
3.226ThrArg: 3.226 ± 0.858
6.048ThrSer: 6.048 ± 1.761
4.839ThrThr: 4.839 ± 2.231
7.661ThrVal: 7.661 ± 2.061
2.419ThrTrp: 2.419 ± 1.198
1.21ThrTyr: 1.21 ± 0.763
0.0ThrXaa: 0.0 ± 0.0
Val
1.613ValAla: 1.613 ± 0.318
3.226ValCys: 3.226 ± 1.447
4.032ValAsp: 4.032 ± 1.962
4.032ValGlu: 4.032 ± 2.015
1.613ValPhe: 1.613 ± 0.478
6.048ValGly: 6.048 ± 1.53
1.21ValHis: 1.21 ± 1.045
3.629ValIle: 3.629 ± 1.146
2.016ValLys: 2.016 ± 0.428
6.452ValLeu: 6.452 ± 1.639
0.403ValMet: 0.403 ± 0.339
1.613ValAsn: 1.613 ± 0.718
5.242ValPro: 5.242 ± 1.968
2.823ValGln: 2.823 ± 1.03
2.016ValArg: 2.016 ± 1.19
4.839ValSer: 4.839 ± 1.88
6.452ValThr: 6.452 ± 2.036
4.839ValVal: 4.839 ± 1.171
0.806ValTrp: 0.806 ± 0.652
2.016ValTyr: 2.016 ± 0.86
0.0ValXaa: 0.0 ± 0.0
Trp
1.613TrpAla: 1.613 ± 0.52
0.403TrpCys: 0.403 ± 0.393
1.613TrpAsp: 1.613 ± 0.941
0.0TrpGlu: 0.0 ± 0.0
0.806TrpPhe: 0.806 ± 0.509
0.403TrpGly: 0.403 ± 0.567
0.403TrpHis: 0.403 ± 0.348
1.21TrpIle: 1.21 ± 1.016
2.419TrpLys: 2.419 ± 1.104
2.016TrpLeu: 2.016 ± 0.564
0.0TrpMet: 0.0 ± 0.0
0.806TrpAsn: 0.806 ± 0.779
0.403TrpPro: 0.403 ± 0.393
0.0TrpGln: 0.0 ± 0.0
1.21TrpArg: 1.21 ± 0.638
1.21TrpSer: 1.21 ± 0.584
1.21TrpThr: 1.21 ± 0.684
0.806TrpVal: 0.806 ± 0.461
0.0TrpTrp: 0.0 ± 0.0
0.403TrpTyr: 0.403 ± 0.339
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.613TyrAla: 1.613 ± 0.478
0.403TyrCys: 0.403 ± 0.393
2.016TyrAsp: 2.016 ± 0.631
1.613TyrGlu: 1.613 ± 0.518
2.823TyrPhe: 2.823 ± 0.982
2.016TyrGly: 2.016 ± 0.642
0.403TyrHis: 0.403 ± 0.339
2.016TyrIle: 2.016 ± 1.183
2.016TyrLys: 2.016 ± 0.553
2.823TyrLeu: 2.823 ± 0.596
1.21TyrMet: 1.21 ± 0.362
1.21TyrAsn: 1.21 ± 0.422
0.403TyrPro: 0.403 ± 0.567
0.806TyrGln: 0.806 ± 0.568
1.21TyrArg: 1.21 ± 0.532
0.806TyrSer: 0.806 ± 0.697
2.419TyrThr: 2.419 ± 0.531
1.613TyrVal: 1.613 ± 0.765
0.0TyrTrp: 0.0 ± 0.0
1.613TyrTyr: 1.613 ± 0.721
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2481 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski