Amino acid dipepetide frequency for Tupaia virus (isolate Tupaia/Thailand/-/1986) (TUPV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.042AlaAla: 2.042 ± 0.511
0.51AlaCys: 0.51 ± 0.25
2.552AlaAsp: 2.552 ± 0.471
3.573AlaGlu: 3.573 ± 0.452
2.297AlaPhe: 2.297 ± 1.368
2.297AlaGly: 2.297 ± 0.864
1.021AlaHis: 1.021 ± 0.267
3.063AlaIle: 3.063 ± 0.776
2.297AlaLys: 2.297 ± 1.594
6.126AlaLeu: 6.126 ± 0.818
0.766AlaMet: 0.766 ± 0.236
0.766AlaAsn: 0.766 ± 0.337
3.063AlaPro: 3.063 ± 1.013
1.276AlaGln: 1.276 ± 0.429
2.808AlaArg: 2.808 ± 0.537
1.787AlaSer: 1.787 ± 0.651
3.828AlaThr: 3.828 ± 1.336
1.531AlaVal: 1.531 ± 0.91
1.021AlaTrp: 1.021 ± 0.636
1.787AlaTyr: 1.787 ± 0.598
0.0AlaXaa: 0.0 ± 0.0
Cys
0.766CysAla: 0.766 ± 0.744
1.021CysCys: 1.021 ± 0.846
0.255CysAsp: 0.255 ± 0.33
0.766CysGlu: 0.766 ± 0.337
0.51CysPhe: 0.51 ± 0.293
1.531CysGly: 1.531 ± 0.239
0.766CysHis: 0.766 ± 0.507
0.51CysIle: 0.51 ± 0.293
1.531CysLys: 1.531 ± 0.838
2.042CysLeu: 2.042 ± 0.4
0.766CysMet: 0.766 ± 0.71
0.51CysAsn: 0.51 ± 0.293
0.766CysPro: 0.766 ± 0.374
0.255CysGln: 0.255 ± 0.33
0.51CysArg: 0.51 ± 0.236
1.787CysSer: 1.787 ± 0.696
0.0CysThr: 0.0 ± 0.0
1.276CysVal: 1.276 ± 0.309
0.51CysTrp: 0.51 ± 0.293
0.766CysTyr: 0.766 ± 0.236
0.0CysXaa: 0.0 ± 0.0
Asp
2.808AspAla: 2.808 ± 1.204
0.51AspCys: 0.51 ± 0.468
5.615AspAsp: 5.615 ± 1.628
3.063AspGlu: 3.063 ± 1.007
2.808AspPhe: 2.808 ± 0.613
2.042AspGly: 2.042 ± 0.677
0.51AspHis: 0.51 ± 0.342
1.276AspIle: 1.276 ± 0.348
2.297AspLys: 2.297 ± 0.742
7.402AspLeu: 7.402 ± 1.09
1.787AspMet: 1.787 ± 0.99
2.042AspAsn: 2.042 ± 1.045
4.339AspPro: 4.339 ± 0.837
2.042AspGln: 2.042 ± 1.09
2.042AspArg: 2.042 ± 0.508
4.084AspSer: 4.084 ± 1.697
1.787AspThr: 1.787 ± 0.706
2.042AspVal: 2.042 ± 0.596
1.531AspTrp: 1.531 ± 0.586
2.808AspTyr: 2.808 ± 1.347
0.0AspXaa: 0.0 ± 0.0
Glu
3.318GluAla: 3.318 ± 0.869
2.042GluCys: 2.042 ± 0.442
4.849GluAsp: 4.849 ± 1.253
5.105GluGlu: 5.105 ± 1.691
3.063GluPhe: 3.063 ± 0.785
3.573GluGly: 3.573 ± 0.894
1.021GluHis: 1.021 ± 0.267
4.594GluIle: 4.594 ± 0.993
2.552GluLys: 2.552 ± 0.837
7.147GluLeu: 7.147 ± 1.174
1.531GluMet: 1.531 ± 0.414
1.021GluAsn: 1.021 ± 0.319
2.042GluPro: 2.042 ± 0.441
1.531GluGln: 1.531 ± 0.735
3.318GluArg: 3.318 ± 0.877
4.594GluSer: 4.594 ± 1.062
4.339GluThr: 4.339 ± 2.328
2.042GluVal: 2.042 ± 0.912
0.766GluTrp: 0.766 ± 0.367
2.552GluTyr: 2.552 ± 0.813
0.0GluXaa: 0.0 ± 0.0
Phe
1.276PheAla: 1.276 ± 0.531
1.021PheCys: 1.021 ± 0.47
3.063PheAsp: 3.063 ± 0.47
3.318PheGlu: 3.318 ± 0.718
3.063PhePhe: 3.063 ± 0.744
2.808PheGly: 2.808 ± 0.904
1.021PheHis: 1.021 ± 0.664
2.552PheIle: 2.552 ± 1.18
3.573PheLys: 3.573 ± 0.768
6.381PheLeu: 6.381 ± 2.561
1.021PheMet: 1.021 ± 0.502
2.297PheAsn: 2.297 ± 0.613
2.552PhePro: 2.552 ± 0.437
1.787PheGln: 1.787 ± 0.645
2.042PheArg: 2.042 ± 0.708
5.105PheSer: 5.105 ± 1.057
2.042PheThr: 2.042 ± 0.533
4.084PheVal: 4.084 ± 1.386
1.276PheTrp: 1.276 ± 0.348
0.51PheTyr: 0.51 ± 0.293
0.0PheXaa: 0.0 ± 0.0
Gly
2.808GlyAla: 2.808 ± 0.627
1.021GlyCys: 1.021 ± 0.314
3.573GlyAsp: 3.573 ± 0.947
2.042GlyGlu: 2.042 ± 0.834
2.297GlyPhe: 2.297 ± 0.864
4.849GlyGly: 4.849 ± 1.738
1.021GlyHis: 1.021 ± 0.592
4.849GlyIle: 4.849 ± 0.613
2.808GlyLys: 2.808 ± 0.61
7.402GlyLeu: 7.402 ± 2.019
0.766GlyMet: 0.766 ± 0.374
3.828GlyAsn: 3.828 ± 1.044
1.531GlyPro: 1.531 ± 1.036
3.318GlyGln: 3.318 ± 0.867
3.063GlyArg: 3.063 ± 0.985
7.402GlySer: 7.402 ± 1.362
1.021GlyThr: 1.021 ± 0.823
2.808GlyVal: 2.808 ± 0.666
1.276GlyTrp: 1.276 ± 0.902
2.808GlyTyr: 2.808 ± 0.73
0.0GlyXaa: 0.0 ± 0.0
His
0.255HisAla: 0.255 ± 0.146
0.0HisCys: 0.0 ± 0.0
0.766HisAsp: 0.766 ± 0.454
1.531HisGlu: 1.531 ± 0.868
1.787HisPhe: 1.787 ± 0.551
0.766HisGly: 0.766 ± 0.331
1.021HisHis: 1.021 ± 0.314
1.276HisIle: 1.276 ± 0.545
0.255HisLys: 0.255 ± 0.33
3.318HisLeu: 3.318 ± 0.351
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.787HisPro: 1.787 ± 0.462
0.766HisGln: 0.766 ± 0.457
2.297HisArg: 2.297 ± 0.832
1.531HisSer: 1.531 ± 0.682
0.766HisThr: 0.766 ± 0.367
1.787HisVal: 1.787 ± 0.449
1.021HisTrp: 1.021 ± 0.314
1.021HisTyr: 1.021 ± 0.39
0.0HisXaa: 0.0 ± 0.0
Ile
1.531IleAla: 1.531 ± 0.601
1.021IleCys: 1.021 ± 0.586
2.808IleAsp: 2.808 ± 0.573
3.063IleGlu: 3.063 ± 0.709
3.063IlePhe: 3.063 ± 0.689
3.828IleGly: 3.828 ± 0.499
1.787IleHis: 1.787 ± 1.481
4.594IleIle: 4.594 ± 1.875
5.105IleLys: 5.105 ± 0.755
5.105IleLeu: 5.105 ± 0.856
0.766IleMet: 0.766 ± 0.439
2.808IleAsn: 2.808 ± 1.003
3.573IlePro: 3.573 ± 0.914
2.297IleGln: 2.297 ± 0.492
3.828IleArg: 3.828 ± 1.299
5.105IleSer: 5.105 ± 0.49
4.084IleThr: 4.084 ± 0.784
4.084IleVal: 4.084 ± 1.371
0.255IleTrp: 0.255 ± 0.37
2.042IleTyr: 2.042 ± 0.764
0.0IleXaa: 0.0 ± 0.0
Lys
2.297LysAla: 2.297 ± 0.719
0.255LysCys: 0.255 ± 0.33
3.318LysAsp: 3.318 ± 1.14
2.552LysGlu: 2.552 ± 0.566
3.063LysPhe: 3.063 ± 1.34
3.573LysGly: 3.573 ± 0.522
1.021LysHis: 1.021 ± 0.342
2.808LysIle: 2.808 ± 0.794
5.615LysLys: 5.615 ± 0.749
6.636LysLeu: 6.636 ± 1.354
0.766LysMet: 0.766 ± 0.402
3.573LysAsn: 3.573 ± 1.298
2.808LysPro: 2.808 ± 0.528
0.255LysGln: 0.255 ± 0.37
3.573LysArg: 3.573 ± 0.818
5.36LysSer: 5.36 ± 0.468
5.105LysThr: 5.105 ± 1.127
3.573LysVal: 3.573 ± 0.506
2.552LysTrp: 2.552 ± 0.618
1.531LysTyr: 1.531 ± 0.644
0.0LysXaa: 0.0 ± 0.0
Leu
5.615LeuAla: 5.615 ± 0.648
2.808LeuCys: 2.808 ± 0.518
4.339LeuAsp: 4.339 ± 1.498
6.891LeuGlu: 6.891 ± 0.88
3.828LeuPhe: 3.828 ± 0.494
8.678LeuGly: 8.678 ± 1.086
1.021LeuHis: 1.021 ± 0.586
10.72LeuIle: 10.72 ± 1.613
5.87LeuLys: 5.87 ± 0.819
9.188LeuLeu: 9.188 ± 1.59
2.297LeuMet: 2.297 ± 0.818
4.594LeuAsn: 4.594 ± 0.483
4.594LeuPro: 4.594 ± 1.172
2.808LeuGln: 2.808 ± 0.665
8.423LeuArg: 8.423 ± 2.608
9.699LeuSer: 9.699 ± 1.653
7.402LeuThr: 7.402 ± 1.719
5.36LeuVal: 5.36 ± 0.907
1.021LeuTrp: 1.021 ± 0.306
2.808LeuTyr: 2.808 ± 0.704
0.0LeuXaa: 0.0 ± 0.0
Met
0.766MetAla: 0.766 ± 0.236
0.255MetCys: 0.255 ± 0.146
0.255MetAsp: 0.255 ± 0.146
2.808MetGlu: 2.808 ± 0.765
1.276MetPhe: 1.276 ± 0.65
0.766MetGly: 0.766 ± 0.236
0.0MetHis: 0.0 ± 0.0
1.787MetIle: 1.787 ± 1.044
1.021MetLys: 1.021 ± 0.861
1.276MetLeu: 1.276 ± 0.453
0.255MetMet: 0.255 ± 0.287
1.021MetAsn: 1.021 ± 0.586
0.255MetPro: 0.255 ± 0.146
0.766MetGln: 0.766 ± 0.439
1.787MetArg: 1.787 ± 1.375
1.787MetSer: 1.787 ± 0.546
2.042MetThr: 2.042 ± 1.037
2.042MetVal: 2.042 ± 1.585
0.0MetTrp: 0.0 ± 0.0
0.51MetTyr: 0.51 ± 0.236
0.0MetXaa: 0.0 ± 0.0
Asn
2.042AsnAla: 2.042 ± 1.121
0.51AsnCys: 0.51 ± 0.293
1.021AsnAsp: 1.021 ± 0.306
1.787AsnGlu: 1.787 ± 0.687
1.787AsnPhe: 1.787 ± 0.508
2.297AsnGly: 2.297 ± 1.026
1.787AsnHis: 1.787 ± 0.551
2.297AsnIle: 2.297 ± 0.622
3.318AsnLys: 3.318 ± 0.755
5.87AsnLeu: 5.87 ± 1.197
1.787AsnMet: 1.787 ± 1.42
1.276AsnAsn: 1.276 ± 0.511
2.808AsnPro: 2.808 ± 0.724
1.531AsnGln: 1.531 ± 0.407
3.063AsnArg: 3.063 ± 0.614
3.318AsnSer: 3.318 ± 0.517
2.042AsnThr: 2.042 ± 0.515
2.042AsnVal: 2.042 ± 0.633
1.531AsnTrp: 1.531 ± 0.471
1.787AsnTyr: 1.787 ± 0.66
0.0AsnXaa: 0.0 ± 0.0
Pro
2.552ProAla: 2.552 ± 0.44
0.255ProCys: 0.255 ± 0.37
2.808ProAsp: 2.808 ± 0.681
4.084ProGlu: 4.084 ± 0.562
2.042ProPhe: 2.042 ± 0.834
2.808ProGly: 2.808 ± 0.414
1.276ProHis: 1.276 ± 0.469
1.531ProIle: 1.531 ± 0.367
3.828ProLys: 3.828 ± 0.541
5.105ProLeu: 5.105 ± 2.095
0.51ProMet: 0.51 ± 0.25
4.084ProAsn: 4.084 ± 1.557
1.531ProPro: 1.531 ± 0.61
1.787ProGln: 1.787 ± 1.383
1.787ProArg: 1.787 ± 0.81
4.084ProSer: 4.084 ± 0.842
2.552ProThr: 2.552 ± 1.034
3.063ProVal: 3.063 ± 0.703
0.766ProTrp: 0.766 ± 0.331
1.021ProTyr: 1.021 ± 0.5
0.0ProXaa: 0.0 ± 0.0
Gln
2.042GlnAla: 2.042 ± 0.487
1.021GlnCys: 1.021 ± 0.319
0.766GlnAsp: 0.766 ± 0.374
1.531GlnGlu: 1.531 ± 0.853
2.042GlnPhe: 2.042 ± 0.535
2.297GlnGly: 2.297 ± 0.782
0.51GlnHis: 0.51 ± 0.293
4.084GlnIle: 4.084 ± 0.866
1.531GlnLys: 1.531 ± 0.471
4.594GlnLeu: 4.594 ± 1.575
1.276GlnMet: 1.276 ± 1.05
1.021GlnAsn: 1.021 ± 0.466
1.531GlnPro: 1.531 ± 1.071
1.531GlnGln: 1.531 ± 1.218
1.531GlnArg: 1.531 ± 0.554
2.808GlnSer: 2.808 ± 0.552
2.042GlnThr: 2.042 ± 0.728
2.297GlnVal: 2.297 ± 0.492
0.255GlnTrp: 0.255 ± 0.314
0.51GlnTyr: 0.51 ± 0.293
0.0GlnXaa: 0.0 ± 0.0
Arg
3.573ArgAla: 3.573 ± 0.769
0.51ArgCys: 0.51 ± 0.25
2.552ArgAsp: 2.552 ± 1.11
3.573ArgGlu: 3.573 ± 0.597
5.36ArgPhe: 5.36 ± 1.983
2.552ArgGly: 2.552 ± 1.204
1.787ArgHis: 1.787 ± 0.779
0.766ArgIle: 0.766 ± 0.518
2.808ArgLys: 2.808 ± 0.822
5.87ArgLeu: 5.87 ± 0.815
1.531ArgMet: 1.531 ± 1.027
4.084ArgAsn: 4.084 ± 0.897
2.042ArgPro: 2.042 ± 0.554
2.297ArgGln: 2.297 ± 0.946
2.808ArgArg: 2.808 ± 0.822
4.849ArgSer: 4.849 ± 1.192
3.573ArgThr: 3.573 ± 0.683
4.084ArgVal: 4.084 ± 0.659
1.021ArgTrp: 1.021 ± 0.319
2.297ArgTyr: 2.297 ± 0.456
0.0ArgXaa: 0.0 ± 0.0
Ser
3.318SerAla: 3.318 ± 1.295
1.276SerCys: 1.276 ± 0.267
4.849SerAsp: 4.849 ± 0.88
4.339SerGlu: 4.339 ± 1.026
4.339SerPhe: 4.339 ± 0.733
5.615SerGly: 5.615 ± 1.62
2.552SerHis: 2.552 ± 0.497
4.339SerIle: 4.339 ± 1.093
5.36SerLys: 5.36 ± 0.838
8.167SerLeu: 8.167 ± 2.094
0.255SerMet: 0.255 ± 0.314
4.084SerAsn: 4.084 ± 0.982
2.808SerPro: 2.808 ± 0.475
3.573SerGln: 3.573 ± 2.517
7.657SerArg: 7.657 ± 1.643
8.933SerSer: 8.933 ± 2.263
4.339SerThr: 4.339 ± 0.997
7.402SerVal: 7.402 ± 1.477
2.042SerTrp: 2.042 ± 0.442
3.063SerTyr: 3.063 ± 1.227
0.0SerXaa: 0.0 ± 0.0
Thr
2.297ThrAla: 2.297 ± 1.102
1.276ThrCys: 1.276 ± 1.159
3.063ThrAsp: 3.063 ± 0.594
4.849ThrGlu: 4.849 ± 1.002
2.042ThrPhe: 2.042 ± 0.595
2.808ThrGly: 2.808 ± 0.61
1.276ThrHis: 1.276 ± 0.429
4.849ThrIle: 4.849 ± 1.501
4.084ThrLys: 4.084 ± 0.949
4.084ThrLeu: 4.084 ± 0.58
1.531ThrMet: 1.531 ± 0.471
1.787ThrAsn: 1.787 ± 1.175
2.808ThrPro: 2.808 ± 0.588
2.808ThrGln: 2.808 ± 0.83
2.042ThrArg: 2.042 ± 1.273
3.828ThrSer: 3.828 ± 0.705
3.828ThrThr: 3.828 ± 0.459
4.339ThrVal: 4.339 ± 2.124
1.276ThrTrp: 1.276 ± 0.435
1.531ThrTyr: 1.531 ± 0.374
0.0ThrXaa: 0.0 ± 0.0
Val
4.084ValAla: 4.084 ± 1.782
1.021ValCys: 1.021 ± 0.314
3.828ValAsp: 3.828 ± 1.473
2.808ValGlu: 2.808 ± 0.855
2.808ValPhe: 2.808 ± 0.555
4.849ValGly: 4.849 ± 0.67
0.766ValHis: 0.766 ± 0.337
2.297ValIle: 2.297 ± 0.491
3.573ValLys: 3.573 ± 1.294
4.339ValLeu: 4.339 ± 0.591
1.787ValMet: 1.787 ± 0.503
2.297ValAsn: 2.297 ± 0.483
3.828ValPro: 3.828 ± 1.0
3.318ValGln: 3.318 ± 0.561
4.084ValArg: 4.084 ± 0.84
6.126ValSer: 6.126 ± 1.2
2.552ValThr: 2.552 ± 0.852
3.063ValVal: 3.063 ± 1.142
1.276ValTrp: 1.276 ± 0.901
2.042ValTyr: 2.042 ± 0.771
0.0ValXaa: 0.0 ± 0.0
Trp
0.51TrpAla: 0.51 ± 0.468
0.0TrpCys: 0.0 ± 0.0
1.276TrpAsp: 1.276 ± 0.732
1.787TrpGlu: 1.787 ± 0.448
1.276TrpPhe: 1.276 ± 0.395
1.021TrpGly: 1.021 ± 0.592
0.766TrpHis: 0.766 ± 0.293
1.531TrpIle: 1.531 ± 0.471
1.276TrpLys: 1.276 ± 0.89
1.021TrpLeu: 1.021 ± 1.005
0.766TrpMet: 0.766 ± 0.518
1.276TrpAsn: 1.276 ± 0.732
1.021TrpPro: 1.021 ± 0.342
0.255TrpGln: 0.255 ± 0.146
0.255TrpArg: 0.255 ± 0.146
2.297TrpSer: 2.297 ± 0.483
1.021TrpThr: 1.021 ± 0.636
2.042TrpVal: 2.042 ± 0.807
0.255TrpTrp: 0.255 ± 0.146
0.255TrpTyr: 0.255 ± 0.489
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.51TyrAla: 0.51 ± 0.295
0.51TyrCys: 0.51 ± 0.25
1.531TyrAsp: 1.531 ± 0.355
1.531TyrGlu: 1.531 ± 0.338
2.042TyrPhe: 2.042 ± 1.389
1.531TyrGly: 1.531 ± 0.661
1.021TyrHis: 1.021 ± 0.267
1.021TyrIle: 1.021 ± 0.314
1.531TyrLys: 1.531 ± 0.338
6.891TyrLeu: 6.891 ± 1.274
0.255TyrMet: 0.255 ± 0.287
1.531TyrAsn: 1.531 ± 0.602
1.787TyrPro: 1.787 ± 0.758
1.021TyrGln: 1.021 ± 0.744
1.021TyrArg: 1.021 ± 0.5
3.828TyrSer: 3.828 ± 0.896
2.042TyrThr: 2.042 ± 0.7
1.787TyrVal: 1.787 ± 0.673
0.255TyrTrp: 0.255 ± 0.146
0.51TyrTyr: 0.51 ± 0.614
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (3919 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski