Amino acid dipepetide frequency for Pyrobaculum filamentous virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.379AlaAla: 3.379 ± 0.906
0.889AlaCys: 0.889 ± 0.445
2.134AlaAsp: 2.134 ± 0.563
3.379AlaGlu: 3.379 ± 0.952
2.845AlaPhe: 2.845 ± 0.74
4.446AlaGly: 4.446 ± 1.126
0.889AlaHis: 0.889 ± 0.39
2.845AlaIle: 2.845 ± 0.494
3.023AlaLys: 3.023 ± 0.72
6.402AlaLeu: 6.402 ± 1.356
3.201AlaMet: 3.201 ± 0.857
1.956AlaAsn: 1.956 ± 0.788
2.134AlaPro: 2.134 ± 0.693
1.245AlaGln: 1.245 ± 0.415
3.379AlaArg: 3.379 ± 0.789
5.157AlaSer: 5.157 ± 0.926
3.913AlaThr: 3.913 ± 0.976
9.959AlaVal: 9.959 ± 1.578
1.067AlaTrp: 1.067 ± 0.387
3.557AlaTyr: 3.557 ± 0.72
0.0AlaXaa: 0.0 ± 0.0
Cys
0.534CysAla: 0.534 ± 0.312
0.534CysCys: 0.534 ± 0.291
1.067CysAsp: 1.067 ± 0.421
1.601CysGlu: 1.601 ± 0.603
0.356CysPhe: 0.356 ± 0.217
1.423CysGly: 1.423 ± 0.631
0.356CysHis: 0.356 ± 0.241
1.067CysIle: 1.067 ± 0.457
0.178CysLys: 0.178 ± 0.193
0.889CysLeu: 0.889 ± 0.498
1.067CysMet: 1.067 ± 0.49
0.711CysAsn: 0.711 ± 0.385
0.178CysPro: 0.178 ± 0.178
0.178CysGln: 0.178 ± 0.198
2.312CysArg: 2.312 ± 0.643
0.711CysSer: 0.711 ± 0.291
0.889CysThr: 0.889 ± 0.371
2.134CysVal: 2.134 ± 0.681
0.356CysTrp: 0.356 ± 0.234
0.356CysTyr: 0.356 ± 0.393
0.0CysXaa: 0.0 ± 0.0
Asp
3.557AspAla: 3.557 ± 0.952
0.711AspCys: 0.711 ± 0.34
2.312AspAsp: 2.312 ± 0.585
3.735AspGlu: 3.735 ± 0.983
2.49AspPhe: 2.49 ± 0.662
6.224AspGly: 6.224 ± 1.143
0.356AspHis: 0.356 ± 0.262
4.802AspIle: 4.802 ± 0.937
4.09AspLys: 4.09 ± 0.873
3.023AspLeu: 3.023 ± 0.684
1.601AspMet: 1.601 ± 0.606
1.778AspAsn: 1.778 ± 0.537
3.201AspPro: 3.201 ± 1.016
0.356AspGln: 0.356 ± 0.243
4.446AspArg: 4.446 ± 0.852
4.446AspSer: 4.446 ± 0.785
2.312AspThr: 2.312 ± 0.719
6.758AspVal: 6.758 ± 1.202
1.956AspTrp: 1.956 ± 0.555
1.778AspTyr: 1.778 ± 0.531
0.0AspXaa: 0.0 ± 0.0
Glu
4.446GluAla: 4.446 ± 1.255
1.778GluCys: 1.778 ± 0.72
3.735GluAsp: 3.735 ± 1.14
4.446GluGlu: 4.446 ± 1.173
2.134GluPhe: 2.134 ± 0.538
2.668GluGly: 2.668 ± 0.623
0.534GluHis: 0.534 ± 0.263
4.268GluIle: 4.268 ± 1.063
5.335GluLys: 5.335 ± 1.239
4.98GluLeu: 4.98 ± 1.053
1.423GluMet: 1.423 ± 0.558
2.668GluAsn: 2.668 ± 0.748
2.49GluPro: 2.49 ± 0.63
0.711GluGln: 0.711 ± 0.399
1.601GluArg: 1.601 ± 0.505
1.601GluSer: 1.601 ± 0.439
2.312GluThr: 2.312 ± 0.469
6.047GluVal: 6.047 ± 1.495
1.245GluTrp: 1.245 ± 0.441
3.023GluTyr: 3.023 ± 0.777
0.0GluXaa: 0.0 ± 0.0
Phe
4.446PheAla: 4.446 ± 0.966
0.711PheCys: 0.711 ± 0.364
3.023PheAsp: 3.023 ± 0.669
1.601PheGlu: 1.601 ± 0.616
1.601PhePhe: 1.601 ± 0.692
3.201PheGly: 3.201 ± 0.63
0.356PheHis: 0.356 ± 0.235
2.134PheIle: 2.134 ± 0.75
1.956PheLys: 1.956 ± 0.731
3.023PheLeu: 3.023 ± 0.998
0.356PheMet: 0.356 ± 0.248
0.711PheAsn: 0.711 ± 0.311
0.711PhePro: 0.711 ± 0.347
0.356PheGln: 0.356 ± 0.264
3.201PheArg: 3.201 ± 0.783
3.023PheSer: 3.023 ± 0.714
1.423PheThr: 1.423 ± 0.655
4.98PheVal: 4.98 ± 0.777
0.534PheTrp: 0.534 ± 0.313
1.423PheTyr: 1.423 ± 0.446
0.0PheXaa: 0.0 ± 0.0
Gly
5.157GlyAla: 5.157 ± 1.059
1.601GlyCys: 1.601 ± 0.675
4.446GlyAsp: 4.446 ± 0.887
4.624GlyGlu: 4.624 ± 0.953
3.023GlyPhe: 3.023 ± 0.835
9.781GlyGly: 9.781 ± 2.841
0.889GlyHis: 0.889 ± 0.395
6.758GlyIle: 6.758 ± 1.118
3.379GlyLys: 3.379 ± 0.907
3.735GlyLeu: 3.735 ± 0.808
1.956GlyMet: 1.956 ± 0.575
3.557GlyAsn: 3.557 ± 0.919
0.889GlyPro: 0.889 ± 0.348
2.49GlyGln: 2.49 ± 0.661
3.735GlyArg: 3.735 ± 0.753
5.513GlySer: 5.513 ± 1.31
4.268GlyThr: 4.268 ± 1.244
6.224GlyVal: 6.224 ± 1.106
1.245GlyTrp: 1.245 ± 0.404
6.224GlyTyr: 6.224 ± 1.297
0.0GlyXaa: 0.0 ± 0.0
His
0.534HisAla: 0.534 ± 0.298
0.356HisCys: 0.356 ± 0.268
1.067HisAsp: 1.067 ± 0.581
0.711HisGlu: 0.711 ± 0.403
0.534HisPhe: 0.534 ± 0.303
0.178HisGly: 0.178 ± 0.16
0.178HisHis: 0.178 ± 0.16
0.889HisIle: 0.889 ± 0.442
1.067HisLys: 1.067 ± 0.472
0.534HisLeu: 0.534 ± 0.341
0.0HisMet: 0.0 ± 0.0
1.067HisAsn: 1.067 ± 0.378
0.711HisPro: 0.711 ± 0.238
0.356HisGln: 0.356 ± 0.228
1.245HisArg: 1.245 ± 0.496
0.711HisSer: 0.711 ± 0.327
0.534HisThr: 0.534 ± 0.266
1.423HisVal: 1.423 ± 0.447
0.0HisTrp: 0.0 ± 0.0
0.356HisTyr: 0.356 ± 0.249
0.0HisXaa: 0.0 ± 0.0
Ile
6.047IleAla: 6.047 ± 0.914
0.889IleCys: 0.889 ± 0.369
5.157IleAsp: 5.157 ± 0.832
5.869IleGlu: 5.869 ± 0.938
2.668IlePhe: 2.668 ± 0.791
3.023IleGly: 3.023 ± 0.716
0.711IleHis: 0.711 ± 0.401
4.09IleIle: 4.09 ± 0.879
2.312IleLys: 2.312 ± 0.672
4.802IleLeu: 4.802 ± 0.723
1.423IleMet: 1.423 ± 0.393
1.956IleAsn: 1.956 ± 0.621
2.668IlePro: 2.668 ± 0.58
0.534IleGln: 0.534 ± 0.385
4.446IleArg: 4.446 ± 0.81
4.446IleSer: 4.446 ± 0.891
3.557IleThr: 3.557 ± 0.741
6.936IleVal: 6.936 ± 1.083
0.534IleTrp: 0.534 ± 0.32
3.735IleTyr: 3.735 ± 0.771
0.0IleXaa: 0.0 ± 0.0
Lys
3.379LysAla: 3.379 ± 0.602
0.534LysCys: 0.534 ± 0.263
3.023LysAsp: 3.023 ± 0.706
3.023LysGlu: 3.023 ± 1.036
2.312LysPhe: 2.312 ± 0.624
2.312LysGly: 2.312 ± 0.774
0.889LysHis: 0.889 ± 0.388
4.446LysIle: 4.446 ± 0.783
1.601LysLys: 1.601 ± 0.538
4.802LysLeu: 4.802 ± 0.967
1.778LysMet: 1.778 ± 0.584
1.423LysAsn: 1.423 ± 0.457
2.312LysPro: 2.312 ± 0.631
1.601LysGln: 1.601 ± 0.491
3.023LysArg: 3.023 ± 0.778
2.134LysSer: 2.134 ± 0.782
3.735LysThr: 3.735 ± 0.995
6.402LysVal: 6.402 ± 1.302
1.245LysTrp: 1.245 ± 0.496
3.201LysTyr: 3.201 ± 0.814
0.0LysXaa: 0.0 ± 0.0
Leu
3.735LeuAla: 3.735 ± 0.831
1.067LeuCys: 1.067 ± 0.508
4.09LeuAsp: 4.09 ± 0.826
3.735LeuGlu: 3.735 ± 0.942
2.845LeuPhe: 2.845 ± 0.794
5.691LeuGly: 5.691 ± 1.217
0.711LeuHis: 0.711 ± 0.383
4.802LeuIle: 4.802 ± 1.189
3.557LeuLys: 3.557 ± 0.83
7.291LeuLeu: 7.291 ± 1.295
3.379LeuMet: 3.379 ± 0.78
1.601LeuAsn: 1.601 ± 0.639
2.845LeuPro: 2.845 ± 0.679
0.534LeuGln: 0.534 ± 0.289
5.157LeuArg: 5.157 ± 0.962
5.513LeuSer: 5.513 ± 1.081
3.735LeuThr: 3.735 ± 0.762
6.402LeuVal: 6.402 ± 1.073
1.956LeuTrp: 1.956 ± 0.671
4.09LeuTyr: 4.09 ± 0.688
0.0LeuXaa: 0.0 ± 0.0
Met
2.134MetAla: 2.134 ± 0.591
1.245MetCys: 1.245 ± 0.442
1.423MetAsp: 1.423 ± 0.531
1.601MetGlu: 1.601 ± 0.581
1.067MetPhe: 1.067 ± 0.466
1.423MetGly: 1.423 ± 0.401
0.711MetHis: 0.711 ± 0.396
1.956MetIle: 1.956 ± 0.642
1.956MetLys: 1.956 ± 0.624
1.423MetLeu: 1.423 ± 0.531
1.067MetMet: 1.067 ± 0.401
0.534MetAsn: 0.534 ± 0.328
1.601MetPro: 1.601 ± 0.634
0.534MetGln: 0.534 ± 0.28
1.601MetArg: 1.601 ± 0.536
3.023MetSer: 3.023 ± 0.597
1.423MetThr: 1.423 ± 0.374
2.312MetVal: 2.312 ± 0.521
0.711MetTrp: 0.711 ± 0.336
2.49MetTyr: 2.49 ± 0.607
0.0MetXaa: 0.0 ± 0.0
Asn
1.778AsnAla: 1.778 ± 0.567
0.356AsnCys: 0.356 ± 0.234
1.778AsnAsp: 1.778 ± 0.505
1.601AsnGlu: 1.601 ± 0.449
0.711AsnPhe: 0.711 ± 0.274
3.913AsnGly: 3.913 ± 0.98
0.356AsnHis: 0.356 ± 0.234
1.601AsnIle: 1.601 ± 0.506
2.49AsnLys: 2.49 ± 0.669
2.134AsnLeu: 2.134 ± 0.837
0.711AsnMet: 0.711 ± 0.367
1.067AsnAsn: 1.067 ± 0.38
1.778AsnPro: 1.778 ± 0.518
0.534AsnGln: 0.534 ± 0.312
1.245AsnArg: 1.245 ± 0.5
1.956AsnSer: 1.956 ± 0.625
1.778AsnThr: 1.778 ± 0.678
5.869AsnVal: 5.869 ± 1.058
0.534AsnTrp: 0.534 ± 0.281
2.312AsnTyr: 2.312 ± 0.983
0.0AsnXaa: 0.0 ± 0.0
Pro
2.312ProAla: 2.312 ± 0.598
0.534ProCys: 0.534 ± 0.304
1.423ProAsp: 1.423 ± 0.439
1.423ProGlu: 1.423 ± 0.619
0.889ProPhe: 0.889 ± 0.347
2.845ProGly: 2.845 ± 0.859
0.356ProHis: 0.356 ± 0.271
1.956ProIle: 1.956 ± 0.538
2.134ProLys: 2.134 ± 0.554
3.201ProLeu: 3.201 ± 0.768
1.245ProMet: 1.245 ± 0.436
1.245ProAsn: 1.245 ± 0.462
1.067ProPro: 1.067 ± 0.364
0.711ProGln: 0.711 ± 0.321
1.423ProArg: 1.423 ± 0.628
1.245ProSer: 1.245 ± 0.57
1.778ProThr: 1.778 ± 0.601
3.023ProVal: 3.023 ± 0.736
0.534ProTrp: 0.534 ± 0.271
1.423ProTyr: 1.423 ± 0.53
0.0ProXaa: 0.0 ± 0.0
Gln
1.245GlnAla: 1.245 ± 0.552
0.356GlnCys: 0.356 ± 0.261
0.356GlnAsp: 0.356 ± 0.22
1.067GlnGlu: 1.067 ± 0.482
1.067GlnPhe: 1.067 ± 0.418
0.534GlnGly: 0.534 ± 0.311
0.534GlnHis: 0.534 ± 0.232
1.601GlnIle: 1.601 ± 0.531
1.067GlnLys: 1.067 ± 0.415
2.134GlnLeu: 2.134 ± 0.64
0.534GlnMet: 0.534 ± 0.284
0.178GlnAsn: 0.178 ± 0.167
0.889GlnPro: 0.889 ± 0.499
0.178GlnGln: 0.178 ± 0.177
0.534GlnArg: 0.534 ± 0.323
1.245GlnSer: 1.245 ± 0.491
1.956GlnThr: 1.956 ± 0.645
1.601GlnVal: 1.601 ± 0.469
0.178GlnTrp: 0.178 ± 0.177
1.778GlnTyr: 1.778 ± 0.683
0.0GlnXaa: 0.0 ± 0.0
Arg
3.735ArgAla: 3.735 ± 1.008
2.134ArgCys: 2.134 ± 0.445
3.201ArgAsp: 3.201 ± 0.82
4.268ArgGlu: 4.268 ± 0.974
1.956ArgPhe: 1.956 ± 0.559
5.335ArgGly: 5.335 ± 1.074
0.711ArgHis: 0.711 ± 0.359
3.557ArgIle: 3.557 ± 0.785
3.379ArgLys: 3.379 ± 0.849
3.735ArgLeu: 3.735 ± 0.699
1.956ArgMet: 1.956 ± 0.471
1.423ArgAsn: 1.423 ± 0.485
1.067ArgPro: 1.067 ± 0.44
1.067ArgGln: 1.067 ± 0.382
5.513ArgArg: 5.513 ± 1.108
2.49ArgSer: 2.49 ± 0.564
1.956ArgThr: 1.956 ± 0.594
5.691ArgVal: 5.691 ± 0.928
1.245ArgTrp: 1.245 ± 0.408
4.09ArgTyr: 4.09 ± 0.807
0.0ArgXaa: 0.0 ± 0.0
Ser
4.268SerAla: 4.268 ± 0.99
0.534SerCys: 0.534 ± 0.298
4.09SerAsp: 4.09 ± 0.855
2.49SerGlu: 2.49 ± 0.7
2.845SerPhe: 2.845 ± 0.782
7.291SerGly: 7.291 ± 1.499
0.711SerHis: 0.711 ± 0.382
2.668SerIle: 2.668 ± 0.711
3.201SerLys: 3.201 ± 0.694
3.913SerLeu: 3.913 ± 0.784
1.601SerMet: 1.601 ± 0.489
2.668SerAsn: 2.668 ± 0.584
1.245SerPro: 1.245 ± 0.352
1.601SerGln: 1.601 ± 0.581
2.49SerArg: 2.49 ± 0.647
3.023SerSer: 3.023 ± 0.578
3.023SerThr: 3.023 ± 0.986
8.181SerVal: 8.181 ± 1.153
1.245SerTrp: 1.245 ± 0.424
2.49SerTyr: 2.49 ± 0.833
0.0SerXaa: 0.0 ± 0.0
Thr
1.778ThrAla: 1.778 ± 0.505
0.711ThrCys: 0.711 ± 0.399
3.735ThrAsp: 3.735 ± 0.86
2.49ThrGlu: 2.49 ± 0.569
1.601ThrPhe: 1.601 ± 0.518
3.913ThrGly: 3.913 ± 1.103
0.711ThrHis: 0.711 ± 0.431
3.379ThrIle: 3.379 ± 0.804
2.49ThrLys: 2.49 ± 0.712
4.624ThrLeu: 4.624 ± 0.801
1.067ThrMet: 1.067 ± 0.402
1.245ThrAsn: 1.245 ± 0.365
1.601ThrPro: 1.601 ± 0.524
1.956ThrGln: 1.956 ± 0.787
2.312ThrArg: 2.312 ± 0.763
3.201ThrSer: 3.201 ± 0.82
3.023ThrThr: 3.023 ± 1.008
6.402ThrVal: 6.402 ± 1.249
2.312ThrTrp: 2.312 ± 0.946
2.845ThrTyr: 2.845 ± 1.069
0.0ThrXaa: 0.0 ± 0.0
Val
8.892ValAla: 8.892 ± 1.451
1.067ValCys: 1.067 ± 0.479
9.426ValAsp: 9.426 ± 1.085
5.691ValGlu: 5.691 ± 1.118
5.157ValPhe: 5.157 ± 1.331
8.536ValGly: 8.536 ± 1.185
0.889ValHis: 0.889 ± 0.359
5.869ValIle: 5.869 ± 0.977
6.047ValLys: 6.047 ± 1.155
8.536ValLeu: 8.536 ± 1.17
3.023ValMet: 3.023 ± 0.782
4.268ValAsn: 4.268 ± 0.792
2.49ValPro: 2.49 ± 0.56
2.668ValGln: 2.668 ± 0.805
6.047ValArg: 6.047 ± 1.148
6.402ValSer: 6.402 ± 1.078
5.157ValThr: 5.157 ± 1.15
14.227ValVal: 14.227 ± 1.726
1.956ValTrp: 1.956 ± 0.538
7.469ValTyr: 7.469 ± 0.97
0.0ValXaa: 0.0 ± 0.0
Trp
0.889TrpAla: 0.889 ± 0.4
0.356TrpCys: 0.356 ± 0.223
1.423TrpAsp: 1.423 ± 0.585
1.423TrpGlu: 1.423 ± 0.521
1.245TrpPhe: 1.245 ± 0.436
1.778TrpGly: 1.778 ± 0.631
0.711TrpHis: 0.711 ± 0.32
1.956TrpIle: 1.956 ± 0.629
0.356TrpLys: 0.356 ± 0.238
1.423TrpLeu: 1.423 ± 0.414
0.534TrpMet: 0.534 ± 0.291
1.067TrpAsn: 1.067 ± 0.439
0.0TrpPro: 0.0 ± 0.0
0.356TrpGln: 0.356 ± 0.238
0.711TrpArg: 0.711 ± 0.419
0.534TrpSer: 0.534 ± 0.392
1.067TrpThr: 1.067 ± 0.452
1.956TrpVal: 1.956 ± 0.568
0.711TrpTrp: 0.711 ± 0.403
2.49TrpTyr: 2.49 ± 1.123
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.557TyrAla: 3.557 ± 0.85
0.534TyrCys: 0.534 ± 0.306
3.023TyrAsp: 3.023 ± 0.637
2.668TyrGlu: 2.668 ± 0.686
1.423TyrPhe: 1.423 ± 0.468
4.98TyrGly: 4.98 ± 1.164
0.889TyrHis: 0.889 ± 0.329
5.157TyrIle: 5.157 ± 0.855
3.557TyrLys: 3.557 ± 1.001
2.312TyrLeu: 2.312 ± 0.504
2.134TyrMet: 2.134 ± 0.64
3.379TyrAsn: 3.379 ± 0.745
0.889TyrPro: 0.889 ± 0.388
1.067TyrGln: 1.067 ± 0.497
4.09TyrArg: 4.09 ± 0.931
3.201TyrSer: 3.201 ± 0.57
3.557TyrThr: 3.557 ± 1.288
7.291TyrVal: 7.291 ± 0.81
1.423TyrTrp: 1.423 ± 0.425
3.557TyrTyr: 3.557 ± 0.895
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 39 proteins (5624 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski