Amino acid dipepetide frequency for Itacaiunas virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.278AlaAla: 2.278 ± 1.606
1.013AlaCys: 1.013 ± 0.34
2.025AlaAsp: 2.025 ± 0.709
2.532AlaGlu: 2.532 ± 0.41
1.519AlaPhe: 1.519 ± 1.46
1.772AlaGly: 1.772 ± 0.678
1.519AlaHis: 1.519 ± 0.482
2.278AlaIle: 2.278 ± 1.311
2.025AlaLys: 2.025 ± 1.143
5.063AlaLeu: 5.063 ± 0.811
0.759AlaMet: 0.759 ± 0.292
1.266AlaAsn: 1.266 ± 0.791
1.013AlaPro: 1.013 ± 0.304
1.013AlaGln: 1.013 ± 0.59
1.519AlaArg: 1.519 ± 0.564
2.025AlaSer: 2.025 ± 0.755
1.266AlaThr: 1.266 ± 1.033
3.544AlaVal: 3.544 ± 1.608
0.759AlaTrp: 0.759 ± 0.566
2.025AlaTyr: 2.025 ± 0.521
0.0AlaXaa: 0.0 ± 0.0
Cys
0.253CysAla: 0.253 ± 0.311
0.253CysCys: 0.253 ± 0.311
0.253CysAsp: 0.253 ± 0.147
0.506CysGlu: 0.506 ± 0.541
1.519CysPhe: 1.519 ± 0.601
1.266CysGly: 1.266 ± 0.248
0.253CysHis: 0.253 ± 0.147
1.013CysIle: 1.013 ± 0.403
1.772CysLys: 1.772 ± 0.43
1.772CysLeu: 1.772 ± 0.549
0.253CysMet: 0.253 ± 0.147
1.013CysAsn: 1.013 ± 0.727
0.506CysPro: 0.506 ± 0.21
0.506CysGln: 0.506 ± 0.295
0.759CysArg: 0.759 ± 0.278
2.025CysSer: 2.025 ± 1.18
1.266CysThr: 1.266 ± 0.381
1.013CysVal: 1.013 ± 0.59
0.0CysTrp: 0.0 ± 0.0
0.759CysTyr: 0.759 ± 0.418
0.0CysXaa: 0.0 ± 0.0
Asp
2.532AspAla: 2.532 ± 0.649
0.506AspCys: 0.506 ± 0.273
3.291AspAsp: 3.291 ± 1.364
5.063AspGlu: 5.063 ± 0.904
3.544AspPhe: 3.544 ± 0.804
2.532AspGly: 2.532 ± 1.014
1.013AspHis: 1.013 ± 0.467
2.532AspIle: 2.532 ± 0.518
4.051AspLys: 4.051 ± 1.538
7.595AspLeu: 7.595 ± 1.003
1.013AspMet: 1.013 ± 0.555
2.785AspAsn: 2.785 ± 1.355
4.81AspPro: 4.81 ± 0.98
2.532AspGln: 2.532 ± 0.536
3.038AspArg: 3.038 ± 1.203
2.785AspSer: 2.785 ± 0.445
1.266AspThr: 1.266 ± 0.682
3.544AspVal: 3.544 ± 0.468
1.519AspTrp: 1.519 ± 0.444
2.278AspTyr: 2.278 ± 0.558
0.0AspXaa: 0.0 ± 0.0
Glu
2.278GluAla: 2.278 ± 0.63
1.013GluCys: 1.013 ± 0.34
5.316GluAsp: 5.316 ± 0.666
5.57GluGlu: 5.57 ± 1.138
2.532GluPhe: 2.532 ± 0.985
4.81GluGly: 4.81 ± 0.646
1.266GluHis: 1.266 ± 0.352
5.316GluIle: 5.316 ± 1.67
4.81GluLys: 4.81 ± 1.134
5.57GluLeu: 5.57 ± 0.659
1.013GluMet: 1.013 ± 0.625
4.304GluAsn: 4.304 ± 1.559
2.278GluPro: 2.278 ± 0.477
0.759GluGln: 0.759 ± 0.59
4.557GluArg: 4.557 ± 0.646
6.076GluSer: 6.076 ± 1.818
3.038GluThr: 3.038 ± 0.894
4.81GluVal: 4.81 ± 0.928
0.759GluTrp: 0.759 ± 0.461
1.013GluTyr: 1.013 ± 0.52
0.0GluXaa: 0.0 ± 0.0
Phe
1.013PheAla: 1.013 ± 0.449
1.013PheCys: 1.013 ± 0.551
1.772PheAsp: 1.772 ± 0.714
2.532PheGlu: 2.532 ± 0.495
2.532PhePhe: 2.532 ± 1.216
4.304PheGly: 4.304 ± 1.005
1.519PheHis: 1.519 ± 0.527
2.532PheIle: 2.532 ± 0.748
3.544PheLys: 3.544 ± 1.543
4.81PheLeu: 4.81 ± 1.665
1.266PheMet: 1.266 ± 0.504
1.013PheAsn: 1.013 ± 0.403
3.797PhePro: 3.797 ± 1.41
1.013PheGln: 1.013 ± 0.59
2.785PheArg: 2.785 ± 0.689
7.089PheSer: 7.089 ± 0.897
0.506PheThr: 0.506 ± 0.43
2.532PheVal: 2.532 ± 1.138
1.266PheTrp: 1.266 ± 0.52
0.506PheTyr: 0.506 ± 0.273
0.0PheXaa: 0.0 ± 0.0
Gly
2.785GlyAla: 2.785 ± 1.813
1.772GlyCys: 1.772 ± 0.52
3.544GlyAsp: 3.544 ± 0.938
4.557GlyGlu: 4.557 ± 1.259
2.532GlyPhe: 2.532 ± 0.681
4.557GlyGly: 4.557 ± 1.207
1.519GlyHis: 1.519 ± 0.659
4.051GlyIle: 4.051 ± 0.9
3.291GlyLys: 3.291 ± 0.752
7.595GlyLeu: 7.595 ± 1.623
0.506GlyMet: 0.506 ± 0.273
2.785GlyAsn: 2.785 ± 0.519
2.532GlyPro: 2.532 ± 0.67
2.532GlyGln: 2.532 ± 0.703
2.785GlyArg: 2.785 ± 0.474
6.076GlySer: 6.076 ± 1.187
3.291GlyThr: 3.291 ± 0.943
5.316GlyVal: 5.316 ± 1.029
0.253GlyTrp: 0.253 ± 0.147
2.278GlyTyr: 2.278 ± 0.619
0.0GlyXaa: 0.0 ± 0.0
His
0.759HisAla: 0.759 ± 0.278
0.0HisCys: 0.0 ± 0.0
0.253HisAsp: 0.253 ± 0.147
1.013HisGlu: 1.013 ± 0.367
1.266HisPhe: 1.266 ± 0.515
0.506HisGly: 0.506 ± 0.21
0.253HisHis: 0.253 ± 0.147
1.013HisIle: 1.013 ± 0.59
0.759HisLys: 0.759 ± 0.418
2.025HisLeu: 2.025 ± 0.461
0.253HisMet: 0.253 ± 0.311
0.759HisAsn: 0.759 ± 0.278
1.266HisPro: 1.266 ± 0.466
1.013HisGln: 1.013 ± 0.516
2.025HisArg: 2.025 ± 0.441
2.278HisSer: 2.278 ± 0.818
1.266HisThr: 1.266 ± 0.749
1.013HisVal: 1.013 ± 0.34
0.759HisTrp: 0.759 ± 0.497
1.266HisTyr: 1.266 ± 0.379
0.0HisXaa: 0.0 ± 0.0
Ile
2.278IleAla: 2.278 ± 0.579
1.519IleCys: 1.519 ± 0.601
1.772IleAsp: 1.772 ± 0.741
3.038IleGlu: 3.038 ± 0.88
2.278IlePhe: 2.278 ± 1.173
2.785IleGly: 2.785 ± 0.654
1.266IleHis: 1.266 ± 0.33
3.797IleIle: 3.797 ± 0.504
6.076IleLys: 6.076 ± 0.538
7.342IleLeu: 7.342 ± 1.594
1.013IleMet: 1.013 ± 0.362
2.785IleAsn: 2.785 ± 1.386
3.038IlePro: 3.038 ± 0.618
2.025IleGln: 2.025 ± 0.68
3.291IleArg: 3.291 ± 0.837
6.329IleSer: 6.329 ± 1.928
2.278IleThr: 2.278 ± 0.578
3.038IleVal: 3.038 ± 0.646
2.025IleTrp: 2.025 ± 0.884
2.785IleTyr: 2.785 ± 0.759
0.0IleXaa: 0.0 ± 0.0
Lys
2.532LysAla: 2.532 ± 0.917
0.253LysCys: 0.253 ± 0.147
5.316LysAsp: 5.316 ± 0.778
4.81LysGlu: 4.81 ± 1.313
3.544LysPhe: 3.544 ± 0.576
5.57LysGly: 5.57 ± 1.234
0.0LysHis: 0.0 ± 0.0
5.823LysIle: 5.823 ± 2.108
6.076LysLys: 6.076 ± 1.82
5.063LysLeu: 5.063 ± 0.845
1.266LysMet: 1.266 ± 0.736
4.304LysAsn: 4.304 ± 0.415
4.304LysPro: 4.304 ± 1.359
1.266LysGln: 1.266 ± 0.52
5.57LysArg: 5.57 ± 1.23
5.316LysSer: 5.316 ± 0.626
3.038LysThr: 3.038 ± 0.538
2.278LysVal: 2.278 ± 0.477
2.025LysTrp: 2.025 ± 0.828
1.013LysTyr: 1.013 ± 0.34
0.0LysXaa: 0.0 ± 0.0
Leu
3.544LeuAla: 3.544 ± 0.557
1.519LeuCys: 1.519 ± 0.922
7.342LeuAsp: 7.342 ± 0.867
7.342LeuGlu: 7.342 ± 2.421
4.051LeuPhe: 4.051 ± 1.144
7.595LeuGly: 7.595 ± 1.358
1.772LeuHis: 1.772 ± 0.628
8.101LeuIle: 8.101 ± 2.859
8.608LeuLys: 8.608 ± 1.851
9.367LeuLeu: 9.367 ± 1.458
3.038LeuMet: 3.038 ± 0.474
6.076LeuAsn: 6.076 ± 0.735
3.544LeuPro: 3.544 ± 1.209
1.266LeuGln: 1.266 ± 0.992
5.316LeuArg: 5.316 ± 1.041
8.354LeuSer: 8.354 ± 1.05
5.823LeuThr: 5.823 ± 1.733
7.089LeuVal: 7.089 ± 1.67
0.506LeuTrp: 0.506 ± 0.51
2.785LeuTyr: 2.785 ± 0.62
0.0LeuXaa: 0.0 ± 0.0
Met
1.266MetAla: 1.266 ± 0.445
0.253MetCys: 0.253 ± 0.373
1.772MetAsp: 1.772 ± 0.672
1.266MetGlu: 1.266 ± 0.504
1.519MetPhe: 1.519 ± 0.867
1.013MetGly: 1.013 ± 0.402
0.759MetHis: 0.759 ± 0.566
1.519MetIle: 1.519 ± 0.601
1.013MetLys: 1.013 ± 0.218
1.519MetLeu: 1.519 ± 0.592
1.519MetMet: 1.519 ± 1.133
1.266MetAsn: 1.266 ± 0.381
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
1.266MetArg: 1.266 ± 0.737
1.519MetSer: 1.519 ± 0.854
2.278MetThr: 2.278 ± 1.021
1.266MetVal: 1.266 ± 0.632
0.253MetTrp: 0.253 ± 0.271
0.506MetTyr: 0.506 ± 0.273
0.0MetXaa: 0.0 ± 0.0
Asn
1.266AsnAla: 1.266 ± 0.364
1.013AsnCys: 1.013 ± 0.543
1.772AsnAsp: 1.772 ± 0.741
3.038AsnGlu: 3.038 ± 0.921
1.772AsnPhe: 1.772 ± 0.605
3.038AsnGly: 3.038 ± 1.902
1.266AsnHis: 1.266 ± 0.52
1.772AsnIle: 1.772 ± 0.461
2.785AsnLys: 2.785 ± 0.888
7.595AsnLeu: 7.595 ± 1.121
1.519AsnMet: 1.519 ± 0.944
2.278AsnAsn: 2.278 ± 0.763
2.278AsnPro: 2.278 ± 1.008
2.278AsnGln: 2.278 ± 0.639
2.278AsnArg: 2.278 ± 1.164
5.316AsnSer: 5.316 ± 1.159
2.785AsnThr: 2.785 ± 0.572
1.772AsnVal: 1.772 ± 0.489
2.025AsnTrp: 2.025 ± 0.545
1.013AsnTyr: 1.013 ± 0.766
0.0AsnXaa: 0.0 ± 0.0
Pro
1.772ProAla: 1.772 ± 0.463
0.759ProCys: 0.759 ± 0.241
3.544ProAsp: 3.544 ± 0.801
3.797ProGlu: 3.797 ± 0.717
3.038ProPhe: 3.038 ± 0.928
3.038ProGly: 3.038 ± 1.081
0.506ProHis: 0.506 ± 0.21
2.785ProIle: 2.785 ± 1.063
1.519ProLys: 1.519 ± 0.679
3.797ProLeu: 3.797 ± 1.066
1.266ProMet: 1.266 ± 0.736
2.532ProAsn: 2.532 ± 0.767
3.038ProPro: 3.038 ± 1.538
2.532ProGln: 2.532 ± 1.747
2.278ProArg: 2.278 ± 1.076
5.316ProSer: 5.316 ± 0.79
2.278ProThr: 2.278 ± 0.361
2.278ProVal: 2.278 ± 1.431
1.013ProTrp: 1.013 ± 0.218
1.772ProTyr: 1.772 ± 0.469
0.0ProXaa: 0.0 ± 0.0
Gln
0.759GlnAla: 0.759 ± 0.442
0.253GlnCys: 0.253 ± 0.271
1.013GlnAsp: 1.013 ± 0.34
2.025GlnGlu: 2.025 ± 1.101
2.025GlnPhe: 2.025 ± 1.178
2.532GlnGly: 2.532 ± 0.374
1.266GlnHis: 1.266 ± 0.544
1.772GlnIle: 1.772 ± 0.469
2.532GlnLys: 2.532 ± 0.845
1.772GlnLeu: 1.772 ± 0.671
0.0GlnMet: 0.0 ± 0.0
1.772GlnAsn: 1.772 ± 0.995
0.759GlnPro: 0.759 ± 0.241
0.506GlnGln: 0.506 ± 0.21
2.532GlnArg: 2.532 ± 0.685
1.013GlnSer: 1.013 ± 0.41
1.266GlnThr: 1.266 ± 0.427
2.025GlnVal: 2.025 ± 0.829
0.0GlnTrp: 0.0 ± 0.0
0.506GlnTyr: 0.506 ± 0.273
0.0GlnXaa: 0.0 ± 0.0
Arg
1.013ArgAla: 1.013 ± 0.458
1.772ArgCys: 1.772 ± 0.869
3.797ArgAsp: 3.797 ± 0.569
3.797ArgGlu: 3.797 ± 0.779
3.038ArgPhe: 3.038 ± 0.928
3.544ArgGly: 3.544 ± 0.793
1.772ArgHis: 1.772 ± 0.534
3.038ArgIle: 3.038 ± 0.721
4.557ArgLys: 4.557 ± 1.242
6.582ArgLeu: 6.582 ± 1.757
1.772ArgMet: 1.772 ± 0.463
3.038ArgAsn: 3.038 ± 0.811
2.532ArgPro: 2.532 ± 0.411
1.519ArgGln: 1.519 ± 0.79
3.038ArgArg: 3.038 ± 0.587
4.81ArgSer: 4.81 ± 0.708
2.278ArgThr: 2.278 ± 0.686
3.038ArgVal: 3.038 ± 1.258
1.772ArgTrp: 1.772 ± 0.8
2.025ArgTyr: 2.025 ± 0.392
0.0ArgXaa: 0.0 ± 0.0
Ser
2.532SerAla: 2.532 ± 0.757
1.772SerCys: 1.772 ± 0.741
5.823SerAsp: 5.823 ± 1.904
7.848SerGlu: 7.848 ± 1.273
2.025SerPhe: 2.025 ± 0.709
5.57SerGly: 5.57 ± 1.153
2.278SerHis: 2.278 ± 0.606
4.304SerIle: 4.304 ± 1.354
5.57SerLys: 5.57 ± 1.103
8.861SerLeu: 8.861 ± 0.976
2.025SerMet: 2.025 ± 0.604
3.038SerAsn: 3.038 ± 1.134
6.076SerPro: 6.076 ± 0.826
2.278SerGln: 2.278 ± 0.438
5.57SerArg: 5.57 ± 0.773
8.861SerSer: 8.861 ± 1.082
3.291SerThr: 3.291 ± 0.536
6.835SerVal: 6.835 ± 0.942
3.291SerTrp: 3.291 ± 0.808
3.038SerTyr: 3.038 ± 1.611
0.0SerXaa: 0.0 ± 0.0
Thr
3.291ThrAla: 3.291 ± 0.999
0.506ThrCys: 0.506 ± 0.21
2.025ThrAsp: 2.025 ± 1.436
2.532ThrGlu: 2.532 ± 0.517
1.519ThrPhe: 1.519 ± 0.482
4.304ThrGly: 4.304 ± 0.523
0.253ThrHis: 0.253 ± 0.147
2.532ThrIle: 2.532 ± 0.448
1.772ThrLys: 1.772 ± 0.43
4.81ThrLeu: 4.81 ± 1.376
1.013ThrMet: 1.013 ± 0.402
1.772ThrAsn: 1.772 ± 0.321
2.785ThrPro: 2.785 ± 0.711
0.759ThrGln: 0.759 ± 0.41
3.544ThrArg: 3.544 ± 0.848
6.835ThrSer: 6.835 ± 2.481
2.278ThrThr: 2.278 ± 0.361
4.304ThrVal: 4.304 ± 1.179
0.506ThrTrp: 0.506 ± 0.21
0.253ThrTyr: 0.253 ± 0.271
0.0ThrXaa: 0.0 ± 0.0
Val
2.785ValAla: 2.785 ± 1.202
1.266ValCys: 1.266 ± 0.566
4.557ValAsp: 4.557 ± 0.415
2.532ValGlu: 2.532 ± 0.612
3.797ValPhe: 3.797 ± 0.695
3.291ValGly: 3.291 ± 0.501
0.506ValHis: 0.506 ± 0.21
3.291ValIle: 3.291 ± 1.08
4.304ValLys: 4.304 ± 0.969
7.848ValLeu: 7.848 ± 0.971
1.013ValMet: 1.013 ± 0.53
3.797ValAsn: 3.797 ± 0.697
2.025ValPro: 2.025 ± 0.464
1.013ValGln: 1.013 ± 0.94
3.797ValArg: 3.797 ± 0.794
4.051ValSer: 4.051 ± 1.295
5.823ValThr: 5.823 ± 1.061
2.785ValVal: 2.785 ± 0.73
1.266ValTrp: 1.266 ± 0.248
2.278ValTyr: 2.278 ± 0.739
0.0ValXaa: 0.0 ± 0.0
Trp
0.506TrpAla: 0.506 ± 0.295
0.0TrpCys: 0.0 ± 0.0
1.013TrpAsp: 1.013 ± 0.59
2.025TrpGlu: 2.025 ± 0.673
1.013TrpPhe: 1.013 ± 0.568
1.266TrpGly: 1.266 ± 0.515
0.0TrpHis: 0.0 ± 0.0
1.519TrpIle: 1.519 ± 0.448
1.013TrpLys: 1.013 ± 0.304
1.519TrpLeu: 1.519 ± 0.278
0.506TrpMet: 0.506 ± 0.273
1.519TrpAsn: 1.519 ± 0.657
0.759TrpPro: 0.759 ± 0.241
0.0TrpGln: 0.0 ± 0.0
1.266TrpArg: 1.266 ± 0.711
1.519TrpSer: 1.519 ± 0.515
1.519TrpThr: 1.519 ± 0.629
2.532TrpVal: 2.532 ± 0.718
0.253TrpTrp: 0.253 ± 0.271
1.013TrpTyr: 1.013 ± 0.218
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.772TyrAla: 1.772 ± 0.73
0.506TyrCys: 0.506 ± 0.387
2.025TyrAsp: 2.025 ± 1.014
1.266TyrGlu: 1.266 ± 0.433
2.278TyrPhe: 2.278 ± 0.277
1.266TyrGly: 1.266 ± 0.567
0.759TyrHis: 0.759 ± 0.377
1.772TyrIle: 1.772 ± 1.187
3.291TyrLys: 3.291 ± 0.555
2.532TyrLeu: 2.532 ± 0.6
0.506TyrMet: 0.506 ± 0.622
1.013TyrAsn: 1.013 ± 0.34
1.519TyrPro: 1.519 ± 0.564
1.519TyrGln: 1.519 ± 0.515
1.519TyrArg: 1.519 ± 0.555
3.291TyrSer: 3.291 ± 0.654
0.759TyrThr: 0.759 ± 0.278
1.013TyrVal: 1.013 ± 0.59
0.506TyrTrp: 0.506 ± 0.704
0.759TyrTyr: 0.759 ± 0.458
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (3951 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski