Amino acid dipepetide frequency for Psittacus erithacus timneh papillomavirus (PePV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.842AlaAla: 8.842 ± 3.817
0.402AlaCys: 0.402 ± 0.494
5.627AlaAsp: 5.627 ± 1.616
3.215AlaGlu: 3.215 ± 1.225
1.206AlaPhe: 1.206 ± 0.482
3.215AlaGly: 3.215 ± 0.77
0.0AlaHis: 0.0 ± 0.0
2.814AlaIle: 2.814 ± 0.726
1.206AlaLys: 1.206 ± 0.391
7.235AlaLeu: 7.235 ± 1.773
0.804AlaMet: 0.804 ± 0.888
1.206AlaAsn: 1.206 ± 0.717
4.421AlaPro: 4.421 ± 1.728
1.608AlaGln: 1.608 ± 0.668
4.019AlaArg: 4.019 ± 1.588
4.019AlaSer: 4.019 ± 1.337
5.627AlaThr: 5.627 ± 1.016
3.215AlaVal: 3.215 ± 0.775
0.402AlaTrp: 0.402 ± 0.345
2.814AlaTyr: 2.814 ± 0.909
0.0AlaXaa: 0.0 ± 0.0
Cys
0.804CysAla: 0.804 ± 0.632
0.804CysCys: 0.804 ± 0.564
0.402CysAsp: 0.402 ± 0.316
0.402CysGlu: 0.402 ± 0.345
1.206CysPhe: 1.206 ± 0.613
0.804CysGly: 0.804 ± 0.622
0.402CysHis: 0.402 ± 0.473
0.804CysIle: 0.804 ± 0.69
1.206CysLys: 1.206 ± 0.513
0.804CysLeu: 0.804 ± 0.529
0.402CysMet: 0.402 ± 0.316
0.804CysAsn: 0.804 ± 0.596
2.01CysPro: 2.01 ± 0.835
0.804CysGln: 0.804 ± 0.333
1.206CysArg: 1.206 ± 0.513
0.402CysSer: 0.402 ± 0.345
2.412CysThr: 2.412 ± 1.257
0.804CysVal: 0.804 ± 0.632
0.402CysTrp: 0.402 ± 0.316
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.814AspAla: 2.814 ± 1.05
1.608AspCys: 1.608 ± 0.55
7.235AspAsp: 7.235 ± 2.595
3.617AspGlu: 3.617 ± 1.179
2.01AspPhe: 2.01 ± 0.91
4.823AspGly: 4.823 ± 1.333
0.804AspHis: 0.804 ± 0.529
4.823AspIle: 4.823 ± 1.203
3.215AspLys: 3.215 ± 1.4
5.225AspLeu: 5.225 ± 1.368
1.206AspMet: 1.206 ± 0.77
2.814AspAsn: 2.814 ± 0.964
7.637AspPro: 7.637 ± 1.145
0.402AspGln: 0.402 ± 0.316
2.814AspArg: 2.814 ± 1.206
7.235AspSer: 7.235 ± 1.836
5.627AspThr: 5.627 ± 1.254
5.225AspVal: 5.225 ± 0.656
0.804AspTrp: 0.804 ± 0.358
1.206AspTyr: 1.206 ± 0.628
0.0AspXaa: 0.0 ± 0.0
Glu
3.215GluAla: 3.215 ± 1.271
0.0GluCys: 0.0 ± 0.0
3.617GluAsp: 3.617 ± 0.591
5.225GluGlu: 5.225 ± 1.344
0.402GluPhe: 0.402 ± 0.332
5.225GluGly: 5.225 ± 1.649
2.814GluHis: 2.814 ± 0.906
2.412GluIle: 2.412 ± 0.848
0.402GluLys: 0.402 ± 0.316
4.823GluLeu: 4.823 ± 0.945
0.804GluMet: 0.804 ± 0.664
1.608GluAsn: 1.608 ± 1.008
2.412GluPro: 2.412 ± 0.946
1.608GluGln: 1.608 ± 0.882
2.814GluArg: 2.814 ± 0.989
3.215GluSer: 3.215 ± 1.26
5.627GluThr: 5.627 ± 1.66
2.01GluVal: 2.01 ± 0.935
2.01GluTrp: 2.01 ± 0.717
2.814GluTyr: 2.814 ± 0.955
0.0GluXaa: 0.0 ± 0.0
Phe
0.804PheAla: 0.804 ± 0.439
0.0PheCys: 0.0 ± 0.0
1.608PheAsp: 1.608 ± 0.668
2.01PheGlu: 2.01 ± 0.871
1.206PhePhe: 1.206 ± 0.327
2.814PheGly: 2.814 ± 0.764
0.402PheHis: 0.402 ± 0.473
2.01PheIle: 2.01 ± 0.801
0.804PheLys: 0.804 ± 0.69
4.019PheLeu: 4.019 ± 0.929
1.206PheMet: 1.206 ± 1.035
2.412PheAsn: 2.412 ± 0.681
2.01PhePro: 2.01 ± 1.345
0.804PheGln: 0.804 ± 0.664
2.814PheArg: 2.814 ± 0.955
1.206PheSer: 1.206 ± 0.327
3.215PheThr: 3.215 ± 1.01
1.608PheVal: 1.608 ± 0.702
1.206PheTrp: 1.206 ± 0.327
0.804PheTyr: 0.804 ± 0.765
0.0PheXaa: 0.0 ± 0.0
Gly
4.823GlyAla: 4.823 ± 1.595
1.608GlyCys: 1.608 ± 0.715
6.833GlyAsp: 6.833 ± 1.608
4.421GlyGlu: 4.421 ± 1.783
1.206GlyPhe: 1.206 ± 0.628
8.441GlyGly: 8.441 ± 1.596
1.206GlyHis: 1.206 ± 0.744
6.431GlyIle: 6.431 ± 2.156
1.206GlyLys: 1.206 ± 0.58
4.823GlyLeu: 4.823 ± 1.37
0.804GlyMet: 0.804 ± 0.721
2.814GlyAsn: 2.814 ± 0.864
4.421GlyPro: 4.421 ± 1.601
4.421GlyGln: 4.421 ± 0.291
6.029GlyArg: 6.029 ± 0.671
3.215GlySer: 3.215 ± 0.784
6.029GlyThr: 6.029 ± 2.153
5.225GlyVal: 5.225 ± 1.741
0.804GlyTrp: 0.804 ± 0.333
2.01GlyTyr: 2.01 ± 1.218
0.0GlyXaa: 0.0 ± 0.0
His
2.01HisAla: 2.01 ± 1.108
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.206HisGlu: 1.206 ± 0.629
0.402HisPhe: 0.402 ± 0.345
2.412HisGly: 2.412 ± 1.278
0.402HisHis: 0.402 ± 0.345
0.804HisIle: 0.804 ± 0.529
1.206HisLys: 1.206 ± 0.744
2.412HisLeu: 2.412 ± 0.982
0.804HisMet: 0.804 ± 0.439
1.608HisAsn: 1.608 ± 0.948
0.0HisPro: 0.0 ± 0.0
0.804HisGln: 0.804 ± 0.596
0.804HisArg: 0.804 ± 0.69
1.608HisSer: 1.608 ± 0.505
1.206HisThr: 1.206 ± 0.699
3.215HisVal: 3.215 ± 1.194
0.804HisTrp: 0.804 ± 0.439
1.608HisTyr: 1.608 ± 1.074
0.0HisXaa: 0.0 ± 0.0
Ile
2.412IleAla: 2.412 ± 0.796
0.804IleCys: 0.804 ± 0.544
3.617IleAsp: 3.617 ± 1.398
1.608IleGlu: 1.608 ± 0.505
0.402IlePhe: 0.402 ± 0.345
4.019IleGly: 4.019 ± 0.931
0.402IleHis: 0.402 ± 0.345
3.215IleIle: 3.215 ± 0.997
0.402IleLys: 0.402 ± 0.494
4.421IleLeu: 4.421 ± 2.452
1.206IleMet: 1.206 ± 0.631
1.608IleAsn: 1.608 ± 0.593
4.823IlePro: 4.823 ± 0.827
1.608IleGln: 1.608 ± 0.91
3.617IleArg: 3.617 ± 1.226
6.833IleSer: 6.833 ± 2.096
2.814IleThr: 2.814 ± 1.016
3.215IleVal: 3.215 ± 0.829
1.206IleTrp: 1.206 ± 0.918
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
0.804LysAla: 0.804 ± 0.632
1.206LysCys: 1.206 ± 0.494
2.412LysAsp: 2.412 ± 0.851
0.804LysGlu: 0.804 ± 0.69
1.608LysPhe: 1.608 ± 0.555
1.608LysGly: 1.608 ± 0.555
1.206LysHis: 1.206 ± 1.035
0.804LysIle: 0.804 ± 0.575
0.804LysLys: 0.804 ± 0.445
1.608LysLeu: 1.608 ± 0.668
0.402LysMet: 0.402 ± 0.345
0.402LysAsn: 0.402 ± 0.345
1.206LysPro: 1.206 ± 0.58
2.01LysGln: 2.01 ± 0.79
4.421LysArg: 4.421 ± 1.639
1.608LysSer: 1.608 ± 1.38
2.412LysThr: 2.412 ± 1.288
1.206LysVal: 1.206 ± 0.662
0.0LysTrp: 0.0 ± 0.0
2.01LysTyr: 2.01 ± 0.734
0.0LysXaa: 0.0 ± 0.0
Leu
3.617LeuAla: 3.617 ± 0.793
1.608LeuCys: 1.608 ± 0.749
6.431LeuAsp: 6.431 ± 0.906
3.215LeuGlu: 3.215 ± 0.951
4.421LeuPhe: 4.421 ± 1.336
6.029LeuGly: 6.029 ± 1.033
1.608LeuHis: 1.608 ± 0.531
4.019LeuIle: 4.019 ± 1.431
3.617LeuLys: 3.617 ± 1.304
9.646LeuLeu: 9.646 ± 2.292
2.01LeuMet: 2.01 ± 0.851
2.412LeuAsn: 2.412 ± 0.654
5.627LeuPro: 5.627 ± 2.314
6.833LeuGln: 6.833 ± 1.796
5.225LeuArg: 5.225 ± 1.734
8.039LeuSer: 8.039 ± 1.114
2.01LeuThr: 2.01 ± 0.961
3.215LeuVal: 3.215 ± 0.697
0.804LeuTrp: 0.804 ± 0.333
3.215LeuTyr: 3.215 ± 1.324
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
1.608MetCys: 1.608 ± 0.668
2.01MetAsp: 2.01 ± 0.533
3.215MetGlu: 3.215 ± 1.878
0.402MetPhe: 0.402 ± 0.345
0.0MetGly: 0.0 ± 0.0
0.402MetHis: 0.402 ± 0.316
1.608MetIle: 1.608 ± 0.89
0.402MetLys: 0.402 ± 0.494
0.804MetLeu: 0.804 ± 0.358
0.0MetMet: 0.0 ± 0.0
0.402MetAsn: 0.402 ± 0.316
0.0MetPro: 0.0 ± 0.0
0.804MetGln: 0.804 ± 0.544
2.01MetArg: 2.01 ± 0.994
1.206MetSer: 1.206 ± 0.628
0.402MetThr: 0.402 ± 0.316
2.412MetVal: 2.412 ± 0.52
0.402MetTrp: 0.402 ± 0.316
0.402MetTyr: 0.402 ± 0.345
0.0MetXaa: 0.0 ± 0.0
Asn
1.608AsnAla: 1.608 ± 0.668
0.804AsnCys: 0.804 ± 0.564
2.814AsnAsp: 2.814 ± 0.813
0.0AsnGlu: 0.0 ± 0.0
0.0AsnPhe: 0.0 ± 0.0
2.01AsnGly: 2.01 ± 0.888
0.804AsnHis: 0.804 ± 0.544
2.01AsnIle: 2.01 ± 0.395
1.608AsnLys: 1.608 ± 0.683
1.206AsnLeu: 1.206 ± 0.677
0.402AsnMet: 0.402 ± 0.345
1.206AsnAsn: 1.206 ± 0.58
3.215AsnPro: 3.215 ± 0.75
1.206AsnGln: 1.206 ± 0.662
2.412AsnArg: 2.412 ± 0.809
3.617AsnSer: 3.617 ± 1.051
3.617AsnThr: 3.617 ± 1.613
2.412AsnVal: 2.412 ± 1.045
0.0AsnTrp: 0.0 ± 0.0
2.01AsnTyr: 2.01 ± 0.823
0.0AsnXaa: 0.0 ± 0.0
Pro
6.431ProAla: 6.431 ± 1.74
0.804ProCys: 0.804 ± 0.358
6.431ProAsp: 6.431 ± 1.783
4.019ProGlu: 4.019 ± 1.335
2.412ProPhe: 2.412 ± 1.991
2.814ProGly: 2.814 ± 0.651
2.814ProHis: 2.814 ± 0.934
2.01ProIle: 2.01 ± 0.533
2.814ProLys: 2.814 ± 1.308
6.833ProLeu: 6.833 ± 1.876
0.0ProMet: 0.0 ± 0.0
3.215ProAsn: 3.215 ± 0.674
7.637ProPro: 7.637 ± 2.31
1.608ProGln: 1.608 ± 0.538
4.421ProArg: 4.421 ± 1.089
4.421ProSer: 4.421 ± 1.681
7.235ProThr: 7.235 ± 2.094
6.029ProVal: 6.029 ± 2.893
1.206ProTrp: 1.206 ± 0.939
3.617ProTyr: 3.617 ± 0.688
0.0ProXaa: 0.0 ± 0.0
Gln
1.608GlnAla: 1.608 ± 0.715
0.804GlnCys: 0.804 ± 0.358
2.412GlnAsp: 2.412 ± 0.972
2.814GlnGlu: 2.814 ± 1.622
0.804GlnPhe: 0.804 ± 0.358
1.608GlnGly: 1.608 ± 1.093
2.412GlnHis: 2.412 ± 0.873
0.804GlnIle: 0.804 ± 0.333
1.206GlnLys: 1.206 ± 0.569
4.823GlnLeu: 4.823 ± 1.045
0.804GlnMet: 0.804 ± 0.358
0.804GlnAsn: 0.804 ± 0.444
2.814GlnPro: 2.814 ± 0.846
0.402GlnGln: 0.402 ± 0.473
2.814GlnArg: 2.814 ± 1.021
0.804GlnSer: 0.804 ± 0.544
2.01GlnThr: 2.01 ± 0.743
2.814GlnVal: 2.814 ± 0.739
1.206GlnTrp: 1.206 ± 0.592
1.206GlnTyr: 1.206 ± 0.327
0.0GlnXaa: 0.0 ± 0.0
Arg
2.814ArgAla: 2.814 ± 0.552
1.608ArgCys: 1.608 ± 0.668
2.412ArgAsp: 2.412 ± 1.055
4.019ArgGlu: 4.019 ± 0.961
2.412ArgPhe: 2.412 ± 0.567
5.627ArgGly: 5.627 ± 1.031
1.608ArgHis: 1.608 ± 0.668
2.814ArgIle: 2.814 ± 0.555
2.01ArgLys: 2.01 ± 0.533
8.842ArgLeu: 8.842 ± 3.461
2.814ArgMet: 2.814 ± 0.914
1.206ArgAsn: 1.206 ± 0.744
5.627ArgPro: 5.627 ± 1.411
2.01ArgGln: 2.01 ± 0.931
11.656ArgArg: 11.656 ± 3.262
5.225ArgSer: 5.225 ± 1.311
6.029ArgThr: 6.029 ± 1.782
5.225ArgVal: 5.225 ± 2.472
0.402ArgTrp: 0.402 ± 0.316
3.215ArgTyr: 3.215 ± 0.613
0.0ArgXaa: 0.0 ± 0.0
Ser
8.039SerAla: 8.039 ± 2.357
0.402SerCys: 0.402 ± 0.316
5.627SerAsp: 5.627 ± 0.875
4.019SerGlu: 4.019 ± 0.531
3.215SerPhe: 3.215 ± 0.975
7.235SerGly: 7.235 ± 0.996
2.412SerHis: 2.412 ± 1.288
2.412SerIle: 2.412 ± 0.945
2.01SerLys: 2.01 ± 0.962
4.019SerLeu: 4.019 ± 1.703
1.206SerMet: 1.206 ± 0.494
3.617SerAsn: 3.617 ± 1.088
6.029SerPro: 6.029 ± 0.693
2.01SerGln: 2.01 ± 1.281
3.617SerArg: 3.617 ± 0.894
5.225SerSer: 5.225 ± 2.264
6.029SerThr: 6.029 ± 1.635
4.019SerVal: 4.019 ± 1.384
0.0SerTrp: 0.0 ± 0.0
2.01SerTyr: 2.01 ± 0.585
0.0SerXaa: 0.0 ± 0.0
Thr
6.029ThrAla: 6.029 ± 1.251
0.0ThrCys: 0.0 ± 0.0
4.019ThrAsp: 4.019 ± 1.463
4.421ThrGlu: 4.421 ± 1.909
4.421ThrPhe: 4.421 ± 1.029
6.029ThrGly: 6.029 ± 1.417
1.206ThrHis: 1.206 ± 0.778
5.225ThrIle: 5.225 ± 1.262
1.206ThrLys: 1.206 ± 0.327
3.215ThrLeu: 3.215 ± 1.241
0.804ThrMet: 0.804 ± 0.544
2.01ThrAsn: 2.01 ± 0.533
6.833ThrPro: 6.833 ± 1.393
2.412ThrGln: 2.412 ± 0.743
7.637ThrArg: 7.637 ± 1.572
5.627ThrSer: 5.627 ± 1.254
4.823ThrThr: 4.823 ± 1.48
6.029ThrVal: 6.029 ± 1.129
1.608ThrTrp: 1.608 ± 1.111
1.206ThrTyr: 1.206 ± 0.498
0.0ThrXaa: 0.0 ± 0.0
Val
1.608ValAla: 1.608 ± 0.428
2.01ValCys: 2.01 ± 1.067
4.019ValAsp: 4.019 ± 1.193
2.814ValGlu: 2.814 ± 1.434
3.215ValPhe: 3.215 ± 0.785
6.833ValGly: 6.833 ± 1.295
0.402ValHis: 0.402 ± 0.345
1.206ValIle: 1.206 ± 0.592
0.804ValLys: 0.804 ± 0.445
5.225ValLeu: 5.225 ± 1.451
0.804ValMet: 0.804 ± 0.358
1.608ValAsn: 1.608 ± 0.428
6.431ValPro: 6.431 ± 2.447
2.01ValGln: 2.01 ± 0.888
4.823ValArg: 4.823 ± 1.293
8.842ValSer: 8.842 ± 1.519
4.019ValThr: 4.019 ± 1.119
4.019ValVal: 4.019 ± 1.756
0.402ValTrp: 0.402 ± 0.316
3.617ValTyr: 3.617 ± 0.944
0.0ValXaa: 0.0 ± 0.0
Trp
0.804TrpAla: 0.804 ± 0.358
0.0TrpCys: 0.0 ± 0.0
1.206TrpAsp: 1.206 ± 0.365
0.804TrpGlu: 0.804 ± 0.765
0.402TrpPhe: 0.402 ± 0.332
2.412TrpGly: 2.412 ± 0.961
0.804TrpHis: 0.804 ± 0.765
1.206TrpIle: 1.206 ± 0.391
0.804TrpLys: 0.804 ± 0.544
1.206TrpLeu: 1.206 ± 0.628
0.402TrpMet: 0.402 ± 0.345
0.402TrpAsn: 0.402 ± 0.473
0.402TrpPro: 0.402 ± 0.345
1.206TrpGln: 1.206 ± 0.77
1.206TrpArg: 1.206 ± 0.365
0.402TrpSer: 0.402 ± 0.383
0.804TrpThr: 0.804 ± 0.439
0.804TrpVal: 0.804 ± 0.333
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.814TyrAla: 2.814 ± 0.775
0.804TyrCys: 0.804 ± 0.596
2.412TyrAsp: 2.412 ± 0.819
1.206TyrGlu: 1.206 ± 0.628
1.608TyrPhe: 1.608 ± 0.468
3.215TyrGly: 3.215 ± 0.775
1.206TyrHis: 1.206 ± 0.613
0.804TyrIle: 0.804 ± 0.632
1.608TyrLys: 1.608 ± 0.686
2.412TyrLeu: 2.412 ± 0.846
1.206TyrMet: 1.206 ± 0.628
0.804TyrAsn: 0.804 ± 0.445
3.215TyrPro: 3.215 ± 1.271
0.402TyrGln: 0.402 ± 0.345
3.215TyrArg: 3.215 ± 0.632
0.402TyrSer: 0.402 ± 0.473
2.814TyrThr: 2.814 ± 0.888
2.01TyrVal: 2.01 ± 0.91
1.608TyrTrp: 1.608 ± 0.686
1.206TyrTyr: 1.206 ± 0.569
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2489 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski