Amino acid dipepetide frequency for Phocoena phocoena papillomavirus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.375AlaAla: 4.375 ± 1.113
1.989AlaCys: 1.989 ± 1.023
3.182AlaAsp: 3.182 ± 0.615
3.978AlaGlu: 3.978 ± 0.934
3.182AlaPhe: 3.182 ± 0.624
3.182AlaGly: 3.182 ± 0.845
0.796AlaHis: 0.796 ± 0.54
1.591AlaIle: 1.591 ± 0.512
1.193AlaLys: 1.193 ± 0.5
5.569AlaLeu: 5.569 ± 1.229
1.591AlaMet: 1.591 ± 0.672
3.182AlaAsn: 3.182 ± 0.798
4.375AlaPro: 4.375 ± 0.692
3.58AlaGln: 3.58 ± 1.176
3.58AlaArg: 3.58 ± 0.556
3.978AlaSer: 3.978 ± 0.837
5.967AlaThr: 5.967 ± 2.086
4.773AlaVal: 4.773 ± 0.654
0.398AlaTrp: 0.398 ± 0.332
1.591AlaTyr: 1.591 ± 0.58
0.0AlaXaa: 0.0 ± 0.0
Cys
2.784CysAla: 2.784 ± 0.608
0.398CysCys: 0.398 ± 0.332
1.193CysAsp: 1.193 ± 0.708
1.193CysGlu: 1.193 ± 0.437
0.796CysPhe: 0.796 ± 0.448
0.398CysGly: 0.398 ± 0.332
0.0CysHis: 0.0 ± 0.0
1.193CysIle: 1.193 ± 0.5
4.375CysLys: 4.375 ± 2.064
1.591CysLeu: 1.591 ± 0.903
1.591CysMet: 1.591 ± 0.784
0.398CysAsn: 0.398 ± 0.346
1.989CysPro: 1.989 ± 0.675
1.193CysGln: 1.193 ± 0.437
0.398CysArg: 0.398 ± 0.346
2.784CysSer: 2.784 ± 1.577
0.796CysThr: 0.796 ± 0.448
2.387CysVal: 2.387 ± 1.356
1.591CysTrp: 1.591 ± 0.678
0.796CysTyr: 0.796 ± 0.475
0.0CysXaa: 0.0 ± 0.0
Asp
4.773AspAla: 4.773 ± 0.996
1.989AspCys: 1.989 ± 0.622
6.364AspAsp: 6.364 ± 1.746
2.387AspGlu: 2.387 ± 0.89
1.591AspPhe: 1.591 ± 0.915
5.569AspGly: 5.569 ± 1.563
0.0AspHis: 0.0 ± 0.0
4.773AspIle: 4.773 ± 1.685
3.58AspLys: 3.58 ± 1.095
5.569AspLeu: 5.569 ± 1.465
0.398AspMet: 0.398 ± 0.305
2.784AspAsn: 2.784 ± 0.775
6.364AspPro: 6.364 ± 1.746
0.796AspGln: 0.796 ± 0.683
1.989AspArg: 1.989 ± 0.452
5.569AspSer: 5.569 ± 0.965
4.773AspThr: 4.773 ± 1.629
3.58AspVal: 3.58 ± 0.448
1.193AspTrp: 1.193 ± 1.037
0.796AspTyr: 0.796 ± 0.691
0.0AspXaa: 0.0 ± 0.0
Glu
3.978GluAla: 3.978 ± 0.791
1.193GluCys: 1.193 ± 0.708
6.364GluAsp: 6.364 ± 1.761
4.375GluGlu: 4.375 ± 1.24
1.193GluPhe: 1.193 ± 0.596
5.171GluGly: 5.171 ± 1.813
0.398GluHis: 0.398 ± 0.369
1.193GluIle: 1.193 ± 0.619
4.773GluLys: 4.773 ± 0.773
3.978GluLeu: 3.978 ± 0.876
1.193GluMet: 1.193 ± 0.473
1.591GluAsn: 1.591 ± 0.658
3.182GluPro: 3.182 ± 0.87
1.989GluGln: 1.989 ± 0.975
1.193GluArg: 1.193 ± 0.596
3.58GluSer: 3.58 ± 0.978
2.784GluThr: 2.784 ± 1.03
1.989GluVal: 1.989 ± 0.695
0.0GluTrp: 0.0 ± 0.0
1.193GluTyr: 1.193 ± 0.359
0.0GluXaa: 0.0 ± 0.0
Phe
2.784PheAla: 2.784 ± 0.896
1.591PheCys: 1.591 ± 1.011
3.58PheAsp: 3.58 ± 1.056
1.193PheGlu: 1.193 ± 1.108
0.796PhePhe: 0.796 ± 0.362
1.989PheGly: 1.989 ± 0.444
0.0PheHis: 0.0 ± 0.0
1.989PheIle: 1.989 ± 0.816
0.796PheLys: 0.796 ± 0.448
3.58PheLeu: 3.58 ± 0.983
0.796PheMet: 0.796 ± 0.439
1.989PheAsn: 1.989 ± 1.006
1.193PhePro: 1.193 ± 0.5
2.387PheGln: 2.387 ± 0.745
1.591PheArg: 1.591 ± 0.512
3.182PheSer: 3.182 ± 1.155
2.387PheThr: 2.387 ± 0.721
1.193PheVal: 1.193 ± 0.596
1.591PheTrp: 1.591 ± 0.759
1.989PheTyr: 1.989 ± 0.75
0.0PheXaa: 0.0 ± 0.0
Gly
2.784GlyAla: 2.784 ± 0.535
1.989GlyCys: 1.989 ± 0.669
7.558GlyAsp: 7.558 ± 1.7
1.989GlyGlu: 1.989 ± 1.244
2.387GlyPhe: 2.387 ± 1.084
8.353GlyGly: 8.353 ± 1.595
3.182GlyHis: 3.182 ± 0.634
5.967GlyIle: 5.967 ± 1.033
3.978GlyLys: 3.978 ± 1.203
4.375GlyLeu: 4.375 ± 1.452
0.796GlyMet: 0.796 ± 0.691
1.989GlyAsn: 1.989 ± 1.018
2.387GlyPro: 2.387 ± 0.506
1.193GlyGln: 1.193 ± 0.513
5.171GlyArg: 5.171 ± 0.586
7.558GlySer: 7.558 ± 2.013
5.967GlyThr: 5.967 ± 1.904
3.182GlyVal: 3.182 ± 0.624
0.398GlyTrp: 0.398 ± 0.346
0.796GlyTyr: 0.796 ± 0.952
0.0GlyXaa: 0.0 ± 0.0
His
1.989HisAla: 1.989 ± 0.908
1.193HisCys: 1.193 ± 0.596
0.398HisAsp: 0.398 ± 0.369
0.0HisGlu: 0.0 ± 0.0
1.591HisPhe: 1.591 ± 0.987
1.989HisGly: 1.989 ± 0.945
0.796HisHis: 0.796 ± 0.978
1.989HisIle: 1.989 ± 0.495
1.193HisLys: 1.193 ± 0.5
3.182HisLeu: 3.182 ± 1.52
0.0HisMet: 0.0 ± 0.0
0.398HisAsn: 0.398 ± 0.369
2.784HisPro: 2.784 ± 1.295
2.784HisGln: 2.784 ± 2.406
0.796HisArg: 0.796 ± 0.448
1.193HisSer: 1.193 ± 0.604
0.796HisThr: 0.796 ± 0.611
1.193HisVal: 1.193 ± 1.037
0.398HisTrp: 0.398 ± 0.305
1.193HisTyr: 1.193 ± 0.724
0.0HisXaa: 0.0 ± 0.0
Ile
2.784IleAla: 2.784 ± 1.262
1.193IleCys: 1.193 ± 0.708
4.375IleAsp: 4.375 ± 1.667
3.978IleGlu: 3.978 ± 1.811
1.193IlePhe: 1.193 ± 0.916
4.773IleGly: 4.773 ± 1.084
1.591IleHis: 1.591 ± 1.072
1.193IleIle: 1.193 ± 1.108
1.193IleLys: 1.193 ± 0.681
5.967IleLeu: 5.967 ± 1.606
0.796IleMet: 0.796 ± 0.417
1.591IleAsn: 1.591 ± 0.435
3.58IlePro: 3.58 ± 1.558
1.591IleGln: 1.591 ± 0.588
0.796IleArg: 0.796 ± 0.739
2.784IleSer: 2.784 ± 0.682
3.182IleThr: 3.182 ± 1.041
0.398IleVal: 0.398 ± 0.369
0.398IleTrp: 0.398 ± 0.476
1.591IleTyr: 1.591 ± 0.607
0.0IleXaa: 0.0 ± 0.0
Lys
2.784LysAla: 2.784 ± 1.104
1.591LysCys: 1.591 ± 0.521
2.387LysAsp: 2.387 ± 1.2
1.193LysGlu: 1.193 ± 0.682
2.387LysPhe: 2.387 ± 0.464
2.387LysGly: 2.387 ± 0.937
2.387LysHis: 2.387 ± 1.317
1.193LysIle: 1.193 ± 0.53
4.773LysLys: 4.773 ± 1.807
2.784LysLeu: 2.784 ± 0.631
0.796LysMet: 0.796 ± 0.379
3.58LysAsn: 3.58 ± 0.531
2.784LysPro: 2.784 ± 1.133
3.978LysGln: 3.978 ± 1.771
3.978LysArg: 3.978 ± 0.923
3.978LysSer: 3.978 ± 0.814
3.182LysThr: 3.182 ± 0.676
1.989LysVal: 1.989 ± 0.856
1.591LysTrp: 1.591 ± 0.248
3.182LysTyr: 3.182 ± 1.496
0.0LysXaa: 0.0 ± 0.0
Leu
2.387LeuAla: 2.387 ± 0.984
1.989LeuCys: 1.989 ± 1.032
5.569LeuAsp: 5.569 ± 0.611
4.375LeuGlu: 4.375 ± 1.718
2.387LeuPhe: 2.387 ± 1.317
5.569LeuGly: 5.569 ± 1.673
3.58LeuHis: 3.58 ± 0.912
4.773LeuIle: 4.773 ± 1.21
5.569LeuLys: 5.569 ± 1.788
9.149LeuLeu: 9.149 ± 2.867
1.193LeuMet: 1.193 ± 0.496
1.591LeuAsn: 1.591 ± 0.737
5.967LeuPro: 5.967 ± 2.379
3.978LeuGln: 3.978 ± 0.764
6.762LeuArg: 6.762 ± 1.22
7.16LeuSer: 7.16 ± 1.595
7.558LeuThr: 7.558 ± 1.129
5.171LeuVal: 5.171 ± 0.616
1.193LeuTrp: 1.193 ± 0.555
3.978LeuTyr: 3.978 ± 1.406
0.0LeuXaa: 0.0 ± 0.0
Met
1.591MetAla: 1.591 ± 0.608
0.796MetCys: 0.796 ± 0.448
2.387MetAsp: 2.387 ± 0.498
1.193MetGlu: 1.193 ± 0.708
1.193MetPhe: 1.193 ± 0.644
1.193MetGly: 1.193 ± 0.778
0.398MetHis: 0.398 ± 0.346
0.796MetIle: 0.796 ± 0.691
0.398MetLys: 0.398 ± 0.346
2.387MetLeu: 2.387 ± 0.721
0.0MetMet: 0.0 ± 0.0
0.796MetAsn: 0.796 ± 0.528
0.398MetPro: 0.398 ± 0.305
1.989MetGln: 1.989 ± 0.975
0.398MetArg: 0.398 ± 0.369
0.796MetSer: 0.796 ± 0.691
1.989MetThr: 1.989 ± 0.57
0.398MetVal: 0.398 ± 0.346
0.0MetTrp: 0.0 ± 0.0
0.796MetTyr: 0.796 ± 0.404
0.0MetXaa: 0.0 ± 0.0
Asn
3.978AsnAla: 3.978 ± 1.08
3.182AsnCys: 3.182 ± 1.5
1.193AsnAsp: 1.193 ± 0.728
0.796AsnGlu: 0.796 ± 0.448
0.796AsnPhe: 0.796 ± 0.952
3.182AsnGly: 3.182 ± 1.401
0.0AsnHis: 0.0 ± 0.0
1.989AsnIle: 1.989 ± 0.578
0.796AsnLys: 0.796 ± 0.415
2.387AsnLeu: 2.387 ± 0.721
1.591AsnMet: 1.591 ± 1.056
1.989AsnAsn: 1.989 ± 0.769
2.387AsnPro: 2.387 ± 0.625
1.591AsnGln: 1.591 ± 0.672
1.591AsnArg: 1.591 ± 0.501
2.387AsnSer: 2.387 ± 0.498
3.978AsnThr: 3.978 ± 2.039
3.182AsnVal: 3.182 ± 1.182
0.398AsnTrp: 0.398 ± 0.346
0.796AsnTyr: 0.796 ± 0.565
0.0AsnXaa: 0.0 ± 0.0
Pro
3.182ProAla: 3.182 ± 1.036
0.398ProCys: 0.398 ± 0.346
3.182ProAsp: 3.182 ± 0.813
5.171ProGlu: 5.171 ± 1.211
2.387ProPhe: 2.387 ± 0.857
1.193ProGly: 1.193 ± 0.359
1.193ProHis: 1.193 ± 0.313
2.784ProIle: 2.784 ± 0.712
2.387ProLys: 2.387 ± 0.826
8.751ProLeu: 8.751 ± 1.703
1.193ProMet: 1.193 ± 0.645
2.784ProAsn: 2.784 ± 0.717
8.751ProPro: 8.751 ± 2.168
2.387ProGln: 2.387 ± 1.474
3.978ProArg: 3.978 ± 0.901
5.171ProSer: 5.171 ± 2.404
6.364ProThr: 6.364 ± 2.853
3.978ProVal: 3.978 ± 1.265
0.0ProTrp: 0.0 ± 0.0
1.193ProTyr: 1.193 ± 0.612
0.0ProXaa: 0.0 ± 0.0
Gln
3.182GlnAla: 3.182 ± 1.204
1.591GlnCys: 1.591 ± 0.756
1.989GlnAsp: 1.989 ± 0.691
3.978GlnGlu: 3.978 ± 1.445
1.989GlnPhe: 1.989 ± 0.695
3.182GlnGly: 3.182 ± 1.679
3.182GlnHis: 3.182 ± 2.834
0.398GlnIle: 0.398 ± 0.346
0.0GlnLys: 0.0 ± 0.0
5.569GlnLeu: 5.569 ± 1.265
2.387GlnMet: 2.387 ± 0.836
1.193GlnAsn: 1.193 ± 0.596
2.387GlnPro: 2.387 ± 0.964
1.989GlnGln: 1.989 ± 1.936
1.591GlnArg: 1.591 ± 0.713
2.784GlnSer: 2.784 ± 1.02
1.591GlnThr: 1.591 ± 0.768
2.387GlnVal: 2.387 ± 1.275
1.591GlnTrp: 1.591 ± 1.048
0.398GlnTyr: 0.398 ± 0.305
0.0GlnXaa: 0.0 ± 0.0
Arg
4.773ArgAla: 4.773 ± 1.635
1.193ArgCys: 1.193 ± 0.681
1.193ArgAsp: 1.193 ± 0.658
2.387ArgGlu: 2.387 ± 0.89
0.796ArgPhe: 0.796 ± 0.404
4.773ArgGly: 4.773 ± 0.692
1.193ArgHis: 1.193 ± 0.596
0.796ArgIle: 0.796 ± 0.404
3.58ArgLys: 3.58 ± 0.76
4.375ArgLeu: 4.375 ± 0.875
0.398ArgMet: 0.398 ± 0.402
2.387ArgAsn: 2.387 ± 0.809
3.58ArgPro: 3.58 ± 2.255
1.989ArgGln: 1.989 ± 0.495
4.375ArgArg: 4.375 ± 1.75
2.784ArgSer: 2.784 ± 0.439
2.784ArgThr: 2.784 ± 0.521
3.978ArgVal: 3.978 ± 0.838
1.193ArgTrp: 1.193 ± 0.359
1.193ArgTyr: 1.193 ± 0.406
0.0ArgXaa: 0.0 ± 0.0
Ser
3.58SerAla: 3.58 ± 0.923
0.398SerCys: 0.398 ± 0.332
4.375SerAsp: 4.375 ± 0.621
2.387SerGlu: 2.387 ± 0.466
2.784SerPhe: 2.784 ± 0.649
7.558SerGly: 7.558 ± 1.135
3.978SerHis: 3.978 ± 0.629
2.387SerIle: 2.387 ± 1.762
2.784SerLys: 2.784 ± 0.697
8.353SerLeu: 8.353 ± 1.68
1.591SerMet: 1.591 ± 0.723
4.375SerAsn: 4.375 ± 1.395
4.375SerPro: 4.375 ± 1.79
1.193SerGln: 1.193 ± 0.604
3.182SerArg: 3.182 ± 1.03
8.353SerSer: 8.353 ± 2.968
8.353SerThr: 8.353 ± 1.611
4.773SerVal: 4.773 ± 2.237
0.796SerTrp: 0.796 ± 0.952
1.193SerTyr: 1.193 ± 0.628
0.0SerXaa: 0.0 ± 0.0
Thr
3.978ThrAla: 3.978 ± 0.724
1.989ThrCys: 1.989 ± 0.972
5.171ThrAsp: 5.171 ± 0.763
2.784ThrGlu: 2.784 ± 1.32
4.375ThrPhe: 4.375 ± 1.42
6.762ThrGly: 6.762 ± 1.583
1.591ThrHis: 1.591 ± 0.987
4.773ThrIle: 4.773 ± 1.239
3.58ThrLys: 3.58 ± 2.354
4.375ThrLeu: 4.375 ± 1.68
1.193ThrMet: 1.193 ± 0.437
1.989ThrAsn: 1.989 ± 0.578
4.773ThrPro: 4.773 ± 1.388
3.978ThrGln: 3.978 ± 1.566
3.58ThrArg: 3.58 ± 0.974
5.967ThrSer: 5.967 ± 1.367
5.569ThrThr: 5.569 ± 1.499
6.762ThrVal: 6.762 ± 2.117
1.591ThrTrp: 1.591 ± 0.746
1.591ThrTyr: 1.591 ± 0.594
0.0ThrXaa: 0.0 ± 0.0
Val
3.978ValAla: 3.978 ± 1.239
2.387ValCys: 2.387 ± 0.949
2.784ValAsp: 2.784 ± 0.835
5.171ValGlu: 5.171 ± 1.815
2.784ValPhe: 2.784 ± 1.028
2.387ValGly: 2.387 ± 0.647
1.193ValHis: 1.193 ± 0.645
1.989ValIle: 1.989 ± 1.343
2.784ValLys: 2.784 ± 1.029
3.182ValLeu: 3.182 ± 0.665
0.796ValMet: 0.796 ± 0.448
1.193ValAsn: 1.193 ± 0.604
4.375ValPro: 4.375 ± 0.818
3.182ValGln: 3.182 ± 0.615
1.989ValArg: 1.989 ± 0.618
4.773ValSer: 4.773 ± 1.505
5.569ValThr: 5.569 ± 2.1
3.182ValVal: 3.182 ± 0.993
0.796ValTrp: 0.796 ± 0.565
2.387ValTyr: 2.387 ± 0.552
0.0ValXaa: 0.0 ± 0.0
Trp
0.796TrpAla: 0.796 ± 0.691
0.0TrpCys: 0.0 ± 0.0
0.796TrpAsp: 0.796 ± 0.379
0.796TrpGlu: 0.796 ± 0.448
0.796TrpPhe: 0.796 ± 0.565
0.398TrpGly: 0.398 ± 0.476
0.0TrpHis: 0.0 ± 0.0
1.193TrpIle: 1.193 ± 0.658
2.784TrpLys: 2.784 ± 1.342
0.796TrpLeu: 0.796 ± 0.362
0.796TrpMet: 0.796 ± 0.528
1.193TrpAsn: 1.193 ± 0.916
0.398TrpPro: 0.398 ± 0.369
0.0TrpGln: 0.0 ± 0.0
1.193TrpArg: 1.193 ± 0.784
1.193TrpSer: 1.193 ± 0.608
0.796TrpThr: 0.796 ± 0.557
1.193TrpVal: 1.193 ± 0.728
0.0TrpTrp: 0.0 ± 0.0
0.398TrpTyr: 0.398 ± 0.346
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.591TyrAla: 1.591 ± 0.647
0.796TyrCys: 0.796 ± 0.557
0.796TyrAsp: 0.796 ± 0.362
1.989TyrGlu: 1.989 ± 0.604
1.193TyrPhe: 1.193 ± 0.313
1.989TyrGly: 1.989 ± 0.444
0.398TyrHis: 0.398 ± 0.305
1.989TyrIle: 1.989 ± 0.75
2.387TyrLys: 2.387 ± 0.721
3.978TyrLeu: 3.978 ± 1.513
0.398TyrMet: 0.398 ± 0.332
1.193TyrAsn: 1.193 ± 0.916
0.398TyrPro: 0.398 ± 0.369
1.591TyrGln: 1.591 ± 0.608
1.989TyrArg: 1.989 ± 0.729
0.796TyrSer: 0.796 ± 0.739
1.989TyrThr: 1.989 ± 0.495
1.193TyrVal: 1.193 ± 0.995
0.398TyrTrp: 0.398 ± 0.476
1.591TyrTyr: 1.591 ± 0.756
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2515 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski