Amino acid dipepetide frequency for Simian foamy virus (isolate chimpanzee) (SFVcpz)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.553AlaAla: 3.553 ± 0.611
0.508AlaCys: 0.508 ± 0.225
1.523AlaAsp: 1.523 ± 0.248
5.33AlaGlu: 5.33 ± 1.283
1.269AlaPhe: 1.269 ± 0.4
3.553AlaGly: 3.553 ± 0.551
2.538AlaHis: 2.538 ± 0.639
1.777AlaIle: 1.777 ± 0.55
1.523AlaLys: 1.523 ± 0.792
6.345AlaLeu: 6.345 ± 0.929
1.777AlaMet: 1.777 ± 0.766
3.299AlaAsn: 3.299 ± 0.39
3.807AlaPro: 3.807 ± 1.62
1.015AlaGln: 1.015 ± 0.362
1.777AlaArg: 1.777 ± 0.506
4.569AlaSer: 4.569 ± 0.631
5.33AlaThr: 5.33 ± 0.698
3.807AlaVal: 3.807 ± 1.169
1.015AlaTrp: 1.015 ± 0.848
2.03AlaTyr: 2.03 ± 1.082
0.0AlaXaa: 0.0 ± 0.0
Cys
0.508CysAla: 0.508 ± 0.322
0.761CysCys: 0.761 ± 0.41
1.269CysAsp: 1.269 ± 0.692
0.254CysGlu: 0.254 ± 0.212
1.015CysPhe: 1.015 ± 0.482
0.508CysGly: 0.508 ± 0.322
0.0CysHis: 0.0 ± 0.0
1.015CysIle: 1.015 ± 0.279
1.523CysLys: 1.523 ± 0.373
1.523CysLeu: 1.523 ± 0.491
0.0CysMet: 0.0 ± 0.0
0.761CysAsn: 0.761 ± 0.396
1.015CysPro: 1.015 ± 0.426
1.269CysGln: 1.269 ± 0.826
1.523CysArg: 1.523 ± 0.589
0.761CysSer: 0.761 ± 0.454
1.269CysThr: 1.269 ± 0.389
0.761CysVal: 0.761 ± 0.407
0.254CysTrp: 0.254 ± 0.186
0.761CysTyr: 0.761 ± 0.636
0.0CysXaa: 0.0 ± 0.0
Asp
1.269AspAla: 1.269 ± 0.506
1.523AspCys: 1.523 ± 0.781
1.777AspAsp: 1.777 ± 0.344
2.03AspGlu: 2.03 ± 0.543
0.508AspPhe: 0.508 ± 0.424
2.284AspGly: 2.284 ± 0.805
1.523AspHis: 1.523 ± 0.373
3.046AspIle: 3.046 ± 0.539
2.284AspLys: 2.284 ± 0.459
4.822AspLeu: 4.822 ± 1.492
0.254AspMet: 0.254 ± 0.302
2.03AspAsn: 2.03 ± 0.651
3.807AspPro: 3.807 ± 1.542
2.792AspGln: 2.792 ± 0.935
1.777AspArg: 1.777 ± 0.55
4.315AspSer: 4.315 ± 1.403
2.03AspThr: 2.03 ± 0.559
3.553AspVal: 3.553 ± 0.812
1.015AspTrp: 1.015 ± 0.478
2.538AspTyr: 2.538 ± 0.583
0.0AspXaa: 0.0 ± 0.0
Glu
2.538GluAla: 2.538 ± 0.308
1.015GluCys: 1.015 ± 0.594
2.538GluAsp: 2.538 ± 0.729
7.868GluGlu: 7.868 ± 3.48
1.015GluPhe: 1.015 ± 0.519
5.33GluGly: 5.33 ± 0.626
0.761GluHis: 0.761 ± 0.316
5.838GluIle: 5.838 ± 1.227
2.792GluLys: 2.792 ± 1.062
4.315GluLeu: 4.315 ± 0.709
2.284GluMet: 2.284 ± 1.063
3.046GluAsn: 3.046 ± 0.863
3.553GluPro: 3.553 ± 0.902
3.046GluGln: 3.046 ± 0.415
4.569GluArg: 4.569 ± 0.663
2.284GluSer: 2.284 ± 0.851
2.03GluThr: 2.03 ± 0.862
3.807GluVal: 3.807 ± 1.202
0.761GluTrp: 0.761 ± 0.354
1.523GluTyr: 1.523 ± 0.493
0.0GluXaa: 0.0 ± 0.0
Phe
3.299PheAla: 3.299 ± 0.549
0.761PheCys: 0.761 ± 0.316
0.508PheAsp: 0.508 ± 0.275
0.761PheGlu: 0.761 ± 0.454
0.254PhePhe: 0.254 ± 0.186
1.523PheGly: 1.523 ± 0.31
1.523PheHis: 1.523 ± 0.812
2.03PheIle: 2.03 ± 0.703
2.284PheLys: 2.284 ± 0.64
2.03PheLeu: 2.03 ± 1.002
0.0PheMet: 0.0 ± 0.0
0.254PheAsn: 0.254 ± 0.186
0.761PhePro: 0.761 ± 0.422
1.269PheGln: 1.269 ± 0.515
0.0PheArg: 0.0 ± 0.0
1.777PheSer: 1.777 ± 0.608
2.284PheThr: 2.284 ± 0.718
1.015PheVal: 1.015 ± 0.367
1.523PheTrp: 1.523 ± 0.623
1.269PheTyr: 1.269 ± 0.361
0.0PheXaa: 0.0 ± 0.0
Gly
2.792GlyAla: 2.792 ± 0.671
1.015GlyCys: 1.015 ± 0.884
4.569GlyAsp: 4.569 ± 1.494
3.046GlyGlu: 3.046 ± 1.158
2.03GlyPhe: 2.03 ± 0.914
2.538GlyGly: 2.538 ± 1.495
2.03GlyHis: 2.03 ± 0.379
3.807GlyIle: 3.807 ± 1.038
3.046GlyLys: 3.046 ± 1.068
4.822GlyLeu: 4.822 ± 1.039
1.015GlyMet: 1.015 ± 0.325
3.807GlyAsn: 3.807 ± 1.234
5.584GlyPro: 5.584 ± 2.115
5.33GlyGln: 5.33 ± 1.991
4.569GlyArg: 4.569 ± 1.962
3.553GlySer: 3.553 ± 0.648
2.03GlyThr: 2.03 ± 0.891
2.284GlyVal: 2.284 ± 0.682
0.761GlyTrp: 0.761 ± 0.477
4.822GlyTyr: 4.822 ± 1.615
0.0GlyXaa: 0.0 ± 0.0
His
1.523HisAla: 1.523 ± 0.599
0.254HisCys: 0.254 ± 0.212
1.015HisAsp: 1.015 ± 0.575
1.523HisGlu: 1.523 ± 0.38
0.0HisPhe: 0.0 ± 0.0
2.03HisGly: 2.03 ± 0.669
1.015HisHis: 1.015 ± 0.666
1.523HisIle: 1.523 ± 0.533
1.777HisLys: 1.777 ± 0.832
2.538HisLeu: 2.538 ± 1.121
0.508HisMet: 0.508 ± 0.322
0.508HisAsn: 0.508 ± 0.371
2.538HisPro: 2.538 ± 0.529
1.269HisGln: 1.269 ± 0.577
1.777HisArg: 1.777 ± 0.713
1.015HisSer: 1.015 ± 0.436
1.777HisThr: 1.777 ± 0.53
1.523HisVal: 1.523 ± 0.792
0.508HisTrp: 0.508 ± 0.371
1.777HisTyr: 1.777 ± 0.713
0.0HisXaa: 0.0 ± 0.0
Ile
4.061IleAla: 4.061 ± 1.092
0.761IleCys: 0.761 ± 0.454
4.315IleAsp: 4.315 ± 0.649
2.284IleGlu: 2.284 ± 1.13
1.269IlePhe: 1.269 ± 0.513
3.299IleGly: 3.299 ± 0.67
2.03IleHis: 2.03 ± 0.619
2.538IleIle: 2.538 ± 1.089
3.299IleLys: 3.299 ± 1.051
5.584IleLeu: 5.584 ± 1.358
0.761IleMet: 0.761 ± 0.41
2.03IleAsn: 2.03 ± 0.734
5.584IlePro: 5.584 ± 1.263
4.822IleGln: 4.822 ± 0.85
3.553IleArg: 3.553 ± 0.656
1.777IleSer: 1.777 ± 0.769
2.792IleThr: 2.792 ± 0.995
3.807IleVal: 3.807 ± 0.901
1.015IleTrp: 1.015 ± 0.426
1.269IleTyr: 1.269 ± 0.306
0.0IleXaa: 0.0 ± 0.0
Lys
3.807LysAla: 3.807 ± 1.068
1.269LysCys: 1.269 ± 0.618
2.284LysAsp: 2.284 ± 0.809
3.807LysGlu: 3.807 ± 1.213
1.523LysPhe: 1.523 ± 0.315
3.046LysGly: 3.046 ± 0.585
2.03LysHis: 2.03 ± 0.919
3.046LysIle: 3.046 ± 0.845
4.315LysLys: 4.315 ± 0.825
3.299LysLeu: 3.299 ± 0.584
0.254LysMet: 0.254 ± 0.212
3.046LysAsn: 3.046 ± 1.068
5.584LysPro: 5.584 ± 1.676
4.061LysGln: 4.061 ± 1.682
3.807LysArg: 3.807 ± 1.696
3.807LysSer: 3.807 ± 0.748
3.046LysThr: 3.046 ± 1.165
5.584LysVal: 5.584 ± 1.552
1.015LysTrp: 1.015 ± 0.451
2.03LysTyr: 2.03 ± 0.754
0.0LysXaa: 0.0 ± 0.0
Leu
5.584LeuAla: 5.584 ± 0.724
1.523LeuCys: 1.523 ± 1.081
4.061LeuAsp: 4.061 ± 0.543
5.076LeuGlu: 5.076 ± 0.957
2.284LeuPhe: 2.284 ± 0.323
6.345LeuGly: 6.345 ± 0.813
1.269LeuHis: 1.269 ± 0.667
2.792LeuIle: 2.792 ± 0.903
6.599LeuLys: 6.599 ± 1.717
8.376LeuLeu: 8.376 ± 2.516
1.269LeuMet: 1.269 ± 0.512
5.584LeuAsn: 5.584 ± 1.44
5.838LeuPro: 5.838 ± 0.937
7.868LeuGln: 7.868 ± 0.849
3.553LeuArg: 3.553 ± 1.106
4.822LeuSer: 4.822 ± 0.682
5.838LeuThr: 5.838 ± 1.226
6.599LeuVal: 6.599 ± 1.617
0.508LeuTrp: 0.508 ± 0.225
3.553LeuTyr: 3.553 ± 0.627
0.0LeuXaa: 0.0 ± 0.0
Met
1.523MetAla: 1.523 ± 0.403
0.0MetCys: 0.0 ± 0.0
1.015MetAsp: 1.015 ± 0.462
1.269MetGlu: 1.269 ± 0.513
0.761MetPhe: 0.761 ± 0.319
1.777MetGly: 1.777 ± 0.672
0.761MetHis: 0.761 ± 0.396
1.269MetIle: 1.269 ± 0.667
0.761MetLys: 0.761 ± 0.41
1.015MetLeu: 1.015 ± 0.367
0.254MetMet: 0.254 ± 0.212
0.761MetAsn: 0.761 ± 0.354
0.761MetPro: 0.761 ± 0.319
0.761MetGln: 0.761 ± 0.422
0.254MetArg: 0.254 ± 0.186
1.777MetSer: 1.777 ± 1.002
2.03MetThr: 2.03 ± 0.246
0.761MetVal: 0.761 ± 0.311
0.0MetTrp: 0.0 ± 0.0
0.761MetTyr: 0.761 ± 0.39
0.0MetXaa: 0.0 ± 0.0
Asn
3.553AsnAla: 3.553 ± 0.491
0.508AsnCys: 0.508 ± 0.341
1.777AsnAsp: 1.777 ± 0.806
2.792AsnGlu: 2.792 ± 1.03
2.538AsnPhe: 2.538 ± 0.779
1.777AsnGly: 1.777 ± 0.327
0.508AsnHis: 0.508 ± 0.424
3.553AsnIle: 3.553 ± 0.986
2.538AsnLys: 2.538 ± 1.128
3.807AsnLeu: 3.807 ± 1.06
1.269AsnMet: 1.269 ± 0.448
3.046AsnAsn: 3.046 ± 1.193
4.822AsnPro: 4.822 ± 1.285
2.792AsnGln: 2.792 ± 0.921
1.015AsnArg: 1.015 ± 0.367
3.046AsnSer: 3.046 ± 1.101
3.807AsnThr: 3.807 ± 1.115
2.792AsnVal: 2.792 ± 0.892
0.508AsnTrp: 0.508 ± 0.424
1.523AsnTyr: 1.523 ± 0.31
0.0AsnXaa: 0.0 ± 0.0
Pro
4.061ProAla: 4.061 ± 0.558
0.254ProCys: 0.254 ± 0.212
3.046ProAsp: 3.046 ± 1.194
4.569ProGlu: 4.569 ± 1.128
3.046ProPhe: 3.046 ± 1.18
3.046ProGly: 3.046 ± 1.299
3.046ProHis: 3.046 ± 0.71
3.553ProIle: 3.553 ± 1.274
4.315ProLys: 4.315 ± 0.972
6.853ProLeu: 6.853 ± 0.877
1.269ProMet: 1.269 ± 0.467
2.284ProAsn: 2.284 ± 0.53
5.076ProPro: 5.076 ± 1.272
3.807ProGln: 3.807 ± 0.675
5.584ProArg: 5.584 ± 1.909
6.853ProSer: 6.853 ± 1.642
3.299ProThr: 3.299 ± 0.538
6.599ProVal: 6.599 ± 1.098
0.761ProTrp: 0.761 ± 0.39
3.046ProTyr: 3.046 ± 0.539
0.0ProXaa: 0.0 ± 0.0
Gln
2.03GlnAla: 2.03 ± 0.279
0.761GlnCys: 0.761 ± 0.227
3.299GlnAsp: 3.299 ± 1.051
5.076GlnGlu: 5.076 ± 0.815
1.777GlnPhe: 1.777 ± 1.248
5.838GlnGly: 5.838 ± 0.641
1.777GlnHis: 1.777 ± 0.53
2.03GlnIle: 2.03 ± 0.616
3.807GlnLys: 3.807 ± 0.758
5.584GlnLeu: 5.584 ± 1.785
1.269GlnMet: 1.269 ± 0.583
3.553GlnAsn: 3.553 ± 0.786
2.538GlnPro: 2.538 ± 0.842
3.046GlnGln: 3.046 ± 1.323
2.792GlnArg: 2.792 ± 0.9
3.807GlnSer: 3.807 ± 1.239
3.553GlnThr: 3.553 ± 0.765
2.284GlnVal: 2.284 ± 0.64
0.761GlnTrp: 0.761 ± 0.396
2.284GlnTyr: 2.284 ± 0.487
0.0GlnXaa: 0.0 ± 0.0
Arg
3.553ArgAla: 3.553 ± 1.335
0.761ArgCys: 0.761 ± 0.342
2.284ArgAsp: 2.284 ± 0.819
2.284ArgGlu: 2.284 ± 0.365
0.761ArgPhe: 0.761 ± 0.422
5.076ArgGly: 5.076 ± 2.41
1.015ArgHis: 1.015 ± 0.549
1.777ArgIle: 1.777 ± 0.734
2.792ArgLys: 2.792 ± 0.616
3.807ArgLeu: 3.807 ± 0.473
1.015ArgMet: 1.015 ± 0.31
2.792ArgAsn: 2.792 ± 1.401
5.584ArgPro: 5.584 ± 1.253
1.523ArgGln: 1.523 ± 0.455
4.569ArgArg: 4.569 ± 1.633
3.299ArgSer: 3.299 ± 1.319
2.284ArgThr: 2.284 ± 0.794
2.03ArgVal: 2.03 ± 0.669
2.03ArgTrp: 2.03 ± 0.764
2.284ArgTyr: 2.284 ± 0.971
0.0ArgXaa: 0.0 ± 0.0
Ser
3.807SerAla: 3.807 ± 1.104
1.269SerCys: 1.269 ± 0.758
3.553SerAsp: 3.553 ± 0.991
2.538SerGlu: 2.538 ± 0.978
2.03SerPhe: 2.03 ± 0.498
7.36SerGly: 7.36 ± 2.422
1.523SerHis: 1.523 ± 0.248
5.33SerIle: 5.33 ± 1.134
2.792SerLys: 2.792 ± 0.716
5.33SerLeu: 5.33 ± 1.086
1.015SerMet: 1.015 ± 0.5
2.03SerAsn: 2.03 ± 0.912
4.061SerPro: 4.061 ± 1.086
3.807SerGln: 3.807 ± 1.084
3.299SerArg: 3.299 ± 0.992
6.599SerSer: 6.599 ± 1.34
4.315SerThr: 4.315 ± 0.768
1.015SerVal: 1.015 ± 0.436
1.523SerTrp: 1.523 ± 0.391
3.046SerTyr: 3.046 ± 0.869
0.0SerXaa: 0.0 ± 0.0
Thr
3.553ThrAla: 3.553 ± 1.163
2.03ThrCys: 2.03 ± 0.997
2.284ThrAsp: 2.284 ± 0.826
2.792ThrGlu: 2.792 ± 0.563
1.777ThrPhe: 1.777 ± 0.537
1.777ThrGly: 1.777 ± 0.608
0.508ThrHis: 0.508 ± 0.424
2.538ThrIle: 2.538 ± 0.9
4.569ThrLys: 4.569 ± 1.707
5.33ThrLeu: 5.33 ± 0.697
1.523ThrMet: 1.523 ± 0.527
1.777ThrAsn: 1.777 ± 0.678
6.345ThrPro: 6.345 ± 1.329
2.538ThrGln: 2.538 ± 0.582
3.807ThrArg: 3.807 ± 0.92
5.838ThrSer: 5.838 ± 0.709
3.046ThrThr: 3.046 ± 0.587
3.299ThrVal: 3.299 ± 0.572
2.03ThrTrp: 2.03 ± 0.764
2.284ThrTyr: 2.284 ± 0.682
0.0ThrXaa: 0.0 ± 0.0
Val
2.284ValAla: 2.284 ± 0.68
0.508ValCys: 0.508 ± 0.424
1.523ValAsp: 1.523 ± 0.399
2.792ValGlu: 2.792 ± 0.773
1.269ValPhe: 1.269 ± 0.577
2.792ValGly: 2.792 ± 0.554
0.761ValHis: 0.761 ± 0.316
6.091ValIle: 6.091 ± 0.335
4.822ValLys: 4.822 ± 1.002
7.614ValLeu: 7.614 ± 0.991
0.508ValMet: 0.508 ± 0.424
3.807ValAsn: 3.807 ± 0.797
3.299ValPro: 3.299 ± 0.538
2.538ValGln: 2.538 ± 0.895
1.269ValArg: 1.269 ± 0.419
3.299ValSer: 3.299 ± 0.813
4.569ValThr: 4.569 ± 1.369
4.061ValVal: 4.061 ± 0.617
2.284ValTrp: 2.284 ± 1.244
3.299ValTyr: 3.299 ± 1.117
0.0ValXaa: 0.0 ± 0.0
Trp
0.761TrpAla: 0.761 ± 0.354
0.508TrpCys: 0.508 ± 0.442
1.015TrpAsp: 1.015 ± 0.279
2.03TrpGlu: 2.03 ± 0.723
0.0TrpPhe: 0.0 ± 0.0
0.254TrpGly: 0.254 ± 0.212
0.254TrpHis: 0.254 ± 0.186
1.523TrpIle: 1.523 ± 0.522
1.777TrpLys: 1.777 ± 0.832
1.777TrpLeu: 1.777 ± 0.327
1.015TrpMet: 1.015 ± 0.356
1.777TrpAsn: 1.777 ± 0.753
1.269TrpPro: 1.269 ± 0.361
0.508TrpGln: 0.508 ± 0.371
0.508TrpArg: 0.508 ± 0.225
1.015TrpSer: 1.015 ± 0.451
1.523TrpThr: 1.523 ± 0.373
0.761TrpVal: 0.761 ± 0.54
1.015TrpTrp: 1.015 ± 0.404
0.508TrpTyr: 0.508 ± 0.225
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.284TyrAla: 2.284 ± 1.23
0.761TyrCys: 0.761 ± 0.39
1.523TyrAsp: 1.523 ± 0.399
2.792TyrGlu: 2.792 ± 0.586
0.0TyrPhe: 0.0 ± 0.0
3.299TyrGly: 3.299 ± 1.648
1.523TyrHis: 1.523 ± 0.599
1.777TyrIle: 1.777 ± 0.608
3.046TyrLys: 3.046 ± 0.901
4.569TyrLeu: 4.569 ± 0.903
0.761TyrMet: 0.761 ± 0.41
1.777TyrAsn: 1.777 ± 0.608
2.538TyrPro: 2.538 ± 0.549
3.807TyrGln: 3.807 ± 0.778
1.523TyrArg: 1.523 ± 0.812
2.03TyrSer: 2.03 ± 0.676
2.792TyrThr: 2.792 ± 0.531
3.046TyrVal: 3.046 ± 1.189
0.761TyrTrp: 0.761 ± 0.557
3.299TyrTyr: 3.299 ± 0.93
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3941 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski