Amino acid dipepetide frequency for Tacheng Tick Virus 7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.537AlaAla: 4.537 ± 2.019
2.042AlaCys: 2.042 ± 0.597
2.495AlaAsp: 2.495 ± 0.799
3.403AlaGlu: 3.403 ± 0.572
1.588AlaPhe: 1.588 ± 0.493
3.857AlaGly: 3.857 ± 0.5
0.907AlaHis: 0.907 ± 0.328
3.63AlaIle: 3.63 ± 0.923
3.403AlaLys: 3.403 ± 0.763
8.167AlaLeu: 8.167 ± 1.46
0.681AlaMet: 0.681 ± 0.306
1.588AlaAsn: 1.588 ± 0.694
3.63AlaPro: 3.63 ± 0.556
3.176AlaGln: 3.176 ± 1.682
3.176AlaArg: 3.176 ± 0.737
7.033AlaSer: 7.033 ± 0.603
3.403AlaThr: 3.403 ± 0.867
5.672AlaVal: 5.672 ± 1.225
0.907AlaTrp: 0.907 ± 0.308
2.269AlaTyr: 2.269 ± 0.777
0.0AlaXaa: 0.0 ± 0.0
Cys
1.588CysAla: 1.588 ± 0.514
1.361CysCys: 1.361 ± 0.777
0.681CysAsp: 0.681 ± 0.399
0.454CysGlu: 0.454 ± 0.253
0.227CysPhe: 0.227 ± 0.343
0.681CysGly: 0.681 ± 0.484
0.454CysHis: 0.454 ± 0.189
0.454CysIle: 0.454 ± 0.259
1.134CysLys: 1.134 ± 0.654
1.588CysLeu: 1.588 ± 1.02
0.907CysMet: 0.907 ± 0.357
1.361CysAsn: 1.361 ± 0.405
1.588CysPro: 1.588 ± 0.472
1.134CysGln: 1.134 ± 0.851
0.907CysArg: 0.907 ± 0.259
1.361CysSer: 1.361 ± 0.491
0.454CysThr: 0.454 ± 0.46
1.134CysVal: 1.134 ± 0.532
0.227CysTrp: 0.227 ± 0.133
0.907CysTyr: 0.907 ± 0.419
0.0CysXaa: 0.0 ± 0.0
Asp
1.815AspAla: 1.815 ± 0.46
0.454AspCys: 0.454 ± 0.346
2.722AspAsp: 2.722 ± 0.913
1.361AspGlu: 1.361 ± 0.509
1.361AspPhe: 1.361 ± 0.381
2.495AspGly: 2.495 ± 0.698
1.588AspHis: 1.588 ± 0.694
3.63AspIle: 3.63 ± 1.645
2.269AspLys: 2.269 ± 0.652
6.125AspLeu: 6.125 ± 0.656
1.588AspMet: 1.588 ± 0.445
2.722AspAsn: 2.722 ± 0.465
4.537AspPro: 4.537 ± 0.95
1.588AspGln: 1.588 ± 0.45
1.588AspArg: 1.588 ± 0.847
4.31AspSer: 4.31 ± 0.799
3.176AspThr: 3.176 ± 0.614
2.042AspVal: 2.042 ± 0.792
1.588AspTrp: 1.588 ± 0.511
1.588AspTyr: 1.588 ± 0.568
0.0AspXaa: 0.0 ± 0.0
Glu
2.722GluAla: 2.722 ± 1.131
0.907GluCys: 0.907 ± 0.289
2.269GluAsp: 2.269 ± 0.537
3.403GluGlu: 3.403 ± 1.926
1.815GluPhe: 1.815 ± 0.587
3.403GluGly: 3.403 ± 0.798
0.907GluHis: 0.907 ± 0.498
2.722GluIle: 2.722 ± 0.535
2.722GluLys: 2.722 ± 0.641
5.898GluLeu: 5.898 ± 0.628
1.361GluMet: 1.361 ± 0.489
1.134GluAsn: 1.134 ± 0.498
2.269GluPro: 2.269 ± 0.702
2.495GluGln: 2.495 ± 0.661
3.403GluArg: 3.403 ± 0.992
2.495GluSer: 2.495 ± 0.742
2.042GluThr: 2.042 ± 0.706
3.63GluVal: 3.63 ± 1.072
1.588GluTrp: 1.588 ± 0.711
2.042GluTyr: 2.042 ± 0.569
0.0GluXaa: 0.0 ± 0.0
Phe
1.588PheAla: 1.588 ± 0.518
0.907PheCys: 0.907 ± 0.642
1.361PheAsp: 1.361 ± 0.627
2.722PheGlu: 2.722 ± 0.823
1.815PhePhe: 1.815 ± 0.518
2.269PheGly: 2.269 ± 0.673
0.907PheHis: 0.907 ± 0.518
2.269PheIle: 2.269 ± 0.477
1.361PheLys: 1.361 ± 0.292
3.63PheLeu: 3.63 ± 0.928
1.134PheMet: 1.134 ± 0.408
1.361PheAsn: 1.361 ± 0.567
2.042PhePro: 2.042 ± 0.447
1.815PheGln: 1.815 ± 0.49
0.681PheArg: 0.681 ± 0.363
3.403PheSer: 3.403 ± 0.854
2.269PheThr: 2.269 ± 0.445
1.134PheVal: 1.134 ± 0.314
0.0PheTrp: 0.0 ± 0.0
0.907PheTyr: 0.907 ± 0.308
0.0PheXaa: 0.0 ± 0.0
Gly
3.176GlyAla: 3.176 ± 0.839
0.907GlyCys: 0.907 ± 0.552
3.176GlyAsp: 3.176 ± 1.143
1.815GlyGlu: 1.815 ± 1.226
3.403GlyPhe: 3.403 ± 0.841
4.537GlyGly: 4.537 ± 0.772
1.588GlyHis: 1.588 ± 0.553
2.722GlyIle: 2.722 ± 0.917
2.042GlyLys: 2.042 ± 0.771
3.857GlyLeu: 3.857 ± 0.99
1.361GlyMet: 1.361 ± 0.414
2.722GlyAsn: 2.722 ± 0.958
3.63GlyPro: 3.63 ± 1.04
2.269GlyGln: 2.269 ± 0.686
3.176GlyArg: 3.176 ± 1.609
4.764GlySer: 4.764 ± 1.453
3.176GlyThr: 3.176 ± 0.962
4.083GlyVal: 4.083 ± 0.788
2.269GlyTrp: 2.269 ± 0.681
1.815GlyTyr: 1.815 ± 0.532
0.0GlyXaa: 0.0 ± 0.0
His
2.495HisAla: 2.495 ± 0.88
0.907HisCys: 0.907 ± 0.532
1.588HisAsp: 1.588 ± 0.386
0.907HisGlu: 0.907 ± 0.389
1.134HisPhe: 1.134 ± 0.498
1.361HisGly: 1.361 ± 0.45
2.269HisHis: 2.269 ± 0.449
2.042HisIle: 2.042 ± 0.912
2.042HisLys: 2.042 ± 0.501
3.857HisLeu: 3.857 ± 0.629
1.815HisMet: 1.815 ± 0.575
1.134HisAsn: 1.134 ± 0.32
2.495HisPro: 2.495 ± 0.856
1.588HisGln: 1.588 ± 0.567
1.588HisArg: 1.588 ± 0.584
2.495HisSer: 2.495 ± 1.277
1.588HisThr: 1.588 ± 0.557
1.588HisVal: 1.588 ± 0.836
0.227HisTrp: 0.227 ± 0.133
0.907HisTyr: 0.907 ± 0.328
0.0HisXaa: 0.0 ± 0.0
Ile
2.949IleAla: 2.949 ± 0.761
0.227IleCys: 0.227 ± 0.23
3.403IleAsp: 3.403 ± 0.79
1.361IleGlu: 1.361 ± 0.34
2.269IlePhe: 2.269 ± 0.732
2.042IleGly: 2.042 ± 0.575
2.269IleHis: 2.269 ± 0.482
4.083IleIle: 4.083 ± 1.115
2.269IleLys: 2.269 ± 0.349
4.991IleLeu: 4.991 ± 1.179
2.495IleMet: 2.495 ± 0.77
2.042IleAsn: 2.042 ± 0.464
4.083IlePro: 4.083 ± 1.015
3.176IleGln: 3.176 ± 0.633
2.269IleArg: 2.269 ± 0.558
5.672IleSer: 5.672 ± 0.931
3.403IleThr: 3.403 ± 0.552
2.722IleVal: 2.722 ± 0.419
0.681IleTrp: 0.681 ± 0.413
2.269IleTyr: 2.269 ± 0.602
0.0IleXaa: 0.0 ± 0.0
Lys
3.176LysAla: 3.176 ± 0.867
1.361LysCys: 1.361 ± 0.561
1.588LysAsp: 1.588 ± 0.507
2.949LysGlu: 2.949 ± 1.025
1.588LysPhe: 1.588 ± 0.625
4.537LysGly: 4.537 ± 0.825
1.588LysHis: 1.588 ± 0.821
2.722LysIle: 2.722 ± 0.962
2.495LysLys: 2.495 ± 0.81
5.445LysLeu: 5.445 ± 1.086
1.588LysMet: 1.588 ± 0.378
0.907LysAsn: 0.907 ± 0.8
0.681LysPro: 0.681 ± 0.363
2.042LysGln: 2.042 ± 0.58
2.722LysArg: 2.722 ± 0.554
4.31LysSer: 4.31 ± 0.981
3.176LysThr: 3.176 ± 0.319
2.949LysVal: 2.949 ± 1.026
1.588LysTrp: 1.588 ± 0.545
2.269LysTyr: 2.269 ± 0.686
0.0LysXaa: 0.0 ± 0.0
Leu
9.301LeuAla: 9.301 ± 1.074
1.588LeuCys: 1.588 ± 0.813
4.991LeuAsp: 4.991 ± 0.597
6.125LeuGlu: 6.125 ± 1.539
4.31LeuPhe: 4.31 ± 0.807
4.991LeuGly: 4.991 ± 1.01
3.63LeuHis: 3.63 ± 0.838
7.713LeuIle: 7.713 ± 1.161
5.218LeuLys: 5.218 ± 0.713
12.704LeuLeu: 12.704 ± 1.709
2.949LeuMet: 2.949 ± 0.494
3.63LeuAsn: 3.63 ± 0.338
5.218LeuPro: 5.218 ± 1.147
3.176LeuGln: 3.176 ± 0.744
4.991LeuArg: 4.991 ± 0.991
9.074LeuSer: 9.074 ± 1.093
9.301LeuThr: 9.301 ± 1.912
7.713LeuVal: 7.713 ± 0.659
2.042LeuTrp: 2.042 ± 0.745
3.857LeuTyr: 3.857 ± 0.812
0.0LeuXaa: 0.0 ± 0.0
Met
2.269MetAla: 2.269 ± 0.677
0.454MetCys: 0.454 ± 0.339
1.134MetAsp: 1.134 ± 0.491
2.042MetGlu: 2.042 ± 0.464
1.134MetPhe: 1.134 ± 0.411
2.042MetGly: 2.042 ± 0.612
0.454MetHis: 0.454 ± 0.266
1.134MetIle: 1.134 ± 0.549
1.815MetLys: 1.815 ± 0.342
3.176MetLeu: 3.176 ± 0.652
0.907MetMet: 0.907 ± 0.359
0.907MetAsn: 0.907 ± 0.378
1.588MetPro: 1.588 ± 0.766
0.227MetGln: 0.227 ± 0.133
2.269MetArg: 2.269 ± 0.516
3.176MetSer: 3.176 ± 1.09
1.815MetThr: 1.815 ± 0.965
1.361MetVal: 1.361 ± 0.865
0.681MetTrp: 0.681 ± 0.519
0.907MetTyr: 0.907 ± 0.389
0.0MetXaa: 0.0 ± 0.0
Asn
2.722AsnAla: 2.722 ± 0.996
0.454AsnCys: 0.454 ± 0.339
0.454AsnAsp: 0.454 ± 0.46
0.907AsnGlu: 0.907 ± 0.312
0.907AsnPhe: 0.907 ± 0.532
0.907AsnGly: 0.907 ± 0.532
0.907AsnHis: 0.907 ± 0.449
1.361AsnIle: 1.361 ± 0.34
2.042AsnLys: 2.042 ± 0.411
3.403AsnLeu: 3.403 ± 0.59
0.907AsnMet: 0.907 ± 0.498
1.588AsnAsn: 1.588 ± 0.445
3.63AsnPro: 3.63 ± 0.884
2.042AsnGln: 2.042 ± 0.75
1.361AsnArg: 1.361 ± 0.653
3.176AsnSer: 3.176 ± 0.726
2.495AsnThr: 2.495 ± 0.695
1.588AsnVal: 1.588 ± 0.445
0.907AsnTrp: 0.907 ± 0.328
0.907AsnTyr: 0.907 ± 0.419
0.0AsnXaa: 0.0 ± 0.0
Pro
2.949ProAla: 2.949 ± 0.73
0.681ProCys: 0.681 ± 0.346
3.403ProAsp: 3.403 ± 0.742
2.949ProGlu: 2.949 ± 1.452
1.588ProPhe: 1.588 ± 0.801
1.815ProGly: 1.815 ± 0.563
3.403ProHis: 3.403 ± 1.074
2.949ProIle: 2.949 ± 0.907
3.403ProLys: 3.403 ± 0.509
9.301ProLeu: 9.301 ± 2.097
1.134ProMet: 1.134 ± 0.545
0.681ProAsn: 0.681 ± 0.506
4.083ProPro: 4.083 ± 1.927
1.815ProGln: 1.815 ± 0.345
3.176ProArg: 3.176 ± 1.099
5.445ProSer: 5.445 ± 0.887
4.991ProThr: 4.991 ± 1.192
4.31ProVal: 4.31 ± 1.483
0.227ProTrp: 0.227 ± 0.343
2.269ProTyr: 2.269 ± 0.573
0.0ProXaa: 0.0 ± 0.0
Gln
4.537GlnAla: 4.537 ± 1.073
0.454GlnCys: 0.454 ± 0.46
2.495GlnAsp: 2.495 ± 0.911
3.176GlnGlu: 3.176 ± 1.383
0.907GlnPhe: 0.907 ± 0.397
2.269GlnGly: 2.269 ± 0.711
1.134GlnHis: 1.134 ± 0.307
1.815GlnIle: 1.815 ± 0.636
2.269GlnLys: 2.269 ± 0.725
5.218GlnLeu: 5.218 ± 1.312
0.907GlnMet: 0.907 ± 0.858
1.134GlnAsn: 1.134 ± 0.498
1.588GlnPro: 1.588 ± 0.499
0.681GlnGln: 0.681 ± 0.275
1.815GlnArg: 1.815 ± 0.364
3.63GlnSer: 3.63 ± 0.861
0.907GlnThr: 0.907 ± 0.253
1.815GlnVal: 1.815 ± 0.527
1.134GlnTrp: 1.134 ± 0.284
1.588GlnTyr: 1.588 ± 0.741
0.0GlnXaa: 0.0 ± 0.0
Arg
5.218ArgAla: 5.218 ± 0.802
0.681ArgCys: 0.681 ± 0.233
2.949ArgAsp: 2.949 ± 0.683
2.949ArgGlu: 2.949 ± 0.621
1.815ArgPhe: 1.815 ± 1.09
2.949ArgGly: 2.949 ± 0.647
1.361ArgHis: 1.361 ± 0.405
2.722ArgIle: 2.722 ± 0.366
2.269ArgLys: 2.269 ± 0.705
5.445ArgLeu: 5.445 ± 1.443
1.588ArgMet: 1.588 ± 0.741
0.907ArgAsn: 0.907 ± 0.9
2.722ArgPro: 2.722 ± 0.881
2.042ArgGln: 2.042 ± 0.525
3.63ArgArg: 3.63 ± 1.222
3.857ArgSer: 3.857 ± 0.758
3.403ArgThr: 3.403 ± 0.611
4.31ArgVal: 4.31 ± 0.704
0.227ArgTrp: 0.227 ± 0.343
1.588ArgTyr: 1.588 ± 0.553
0.0ArgXaa: 0.0 ± 0.0
Ser
4.083SerAla: 4.083 ± 0.563
2.042SerCys: 2.042 ± 0.532
3.63SerAsp: 3.63 ± 0.704
4.31SerGlu: 4.31 ± 0.952
3.403SerPhe: 3.403 ± 0.613
4.537SerGly: 4.537 ± 1.421
4.537SerHis: 4.537 ± 0.873
2.949SerIle: 2.949 ± 0.517
3.63SerLys: 3.63 ± 0.793
11.116SerLeu: 11.116 ± 0.729
3.176SerMet: 3.176 ± 0.794
2.949SerAsn: 2.949 ± 1.127
7.033SerPro: 7.033 ± 1.126
4.31SerGln: 4.31 ± 0.85
5.218SerArg: 5.218 ± 0.98
7.94SerSer: 7.94 ± 1.241
4.764SerThr: 4.764 ± 0.695
6.579SerVal: 6.579 ± 0.745
0.227SerTrp: 0.227 ± 0.271
2.269SerTyr: 2.269 ± 0.558
0.0SerXaa: 0.0 ± 0.0
Thr
2.949ThrAla: 2.949 ± 1.164
1.134ThrCys: 1.134 ± 0.411
4.991ThrAsp: 4.991 ± 1.096
2.495ThrGlu: 2.495 ± 1.131
1.588ThrPhe: 1.588 ± 1.512
5.218ThrGly: 5.218 ± 2.007
2.495ThrHis: 2.495 ± 0.957
2.722ThrIle: 2.722 ± 0.973
4.083ThrLys: 4.083 ± 0.78
6.579ThrLeu: 6.579 ± 1.218
1.134ThrMet: 1.134 ± 0.409
2.042ThrAsn: 2.042 ± 0.46
3.63ThrPro: 3.63 ± 0.839
2.722ThrGln: 2.722 ± 1.017
3.176ThrArg: 3.176 ± 1.025
6.352ThrSer: 6.352 ± 1.161
4.991ThrThr: 4.991 ± 1.062
3.176ThrVal: 3.176 ± 0.66
0.907ThrTrp: 0.907 ± 0.518
0.681ThrTyr: 0.681 ± 0.399
0.0ThrXaa: 0.0 ± 0.0
Val
4.083ValAla: 4.083 ± 1.169
1.134ValCys: 1.134 ± 0.532
2.269ValAsp: 2.269 ± 0.358
3.403ValGlu: 3.403 ± 0.25
1.361ValPhe: 1.361 ± 0.414
3.403ValGly: 3.403 ± 0.594
2.042ValHis: 2.042 ± 1.62
3.63ValIle: 3.63 ± 1.26
2.949ValLys: 2.949 ± 0.801
5.672ValLeu: 5.672 ± 1.151
2.269ValMet: 2.269 ± 0.732
1.134ValAsn: 1.134 ± 0.554
2.949ValPro: 2.949 ± 0.654
2.495ValGln: 2.495 ± 0.51
3.403ValArg: 3.403 ± 1.176
6.352ValSer: 6.352 ± 1.671
5.218ValThr: 5.218 ± 1.313
2.949ValVal: 2.949 ± 0.751
1.815ValTrp: 1.815 ± 0.384
1.815ValTyr: 1.815 ± 0.575
0.0ValXaa: 0.0 ± 0.0
Trp
1.588TrpAla: 1.588 ± 0.807
0.227TrpCys: 0.227 ± 0.133
1.361TrpAsp: 1.361 ± 0.377
1.134TrpGlu: 1.134 ± 0.689
0.454TrpPhe: 0.454 ± 0.266
1.588TrpGly: 1.588 ± 0.741
0.227TrpHis: 0.227 ± 0.133
1.815TrpIle: 1.815 ± 0.879
0.454TrpLys: 0.454 ± 0.259
2.042TrpLeu: 2.042 ± 0.464
0.681TrpMet: 0.681 ± 0.233
1.134TrpAsn: 1.134 ± 0.478
1.134TrpPro: 1.134 ± 0.307
0.227TrpGln: 0.227 ± 0.42
0.907TrpArg: 0.907 ± 0.253
0.907TrpSer: 0.907 ± 0.403
1.588TrpThr: 1.588 ± 0.572
0.454TrpVal: 0.454 ± 0.266
0.0TrpTrp: 0.0 ± 0.0
0.227TrpTyr: 0.227 ± 0.343
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.815TyrAla: 1.815 ± 0.304
0.681TyrCys: 0.681 ± 0.399
1.588TyrAsp: 1.588 ± 0.454
1.361TyrGlu: 1.361 ± 0.482
0.907TyrPhe: 0.907 ± 0.328
1.588TyrGly: 1.588 ± 0.499
1.361TyrHis: 1.361 ± 0.567
1.588TyrIle: 1.588 ± 0.546
1.815TyrLys: 1.815 ± 0.438
4.537TyrLeu: 4.537 ± 0.45
0.681TyrMet: 0.681 ± 0.891
1.588TyrAsn: 1.588 ± 0.716
2.042TyrPro: 2.042 ± 0.626
0.454TyrGln: 0.454 ± 0.266
3.176TyrArg: 3.176 ± 0.353
2.949TyrSer: 2.949 ± 0.493
0.907TyrThr: 0.907 ± 0.529
1.134TyrVal: 1.134 ± 0.411
0.907TyrTrp: 0.907 ± 0.546
0.907TyrTyr: 0.907 ± 0.308
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (4409 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski