Amino acid dipepetide frequency for Human T-cell leukemia virus 3 (strain 2026ND) (HTLV-3)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.19AlaAla: 5.19 ± 0.806
2.307AlaCys: 2.307 ± 0.902
1.442AlaAsp: 1.442 ± 0.566
2.307AlaGlu: 2.307 ± 0.66
3.172AlaPhe: 3.172 ± 0.888
3.749AlaGly: 3.749 ± 0.7
1.442AlaHis: 1.442 ± 0.368
7.497AlaIle: 7.497 ± 1.371
1.153AlaLys: 1.153 ± 0.386
9.227AlaLeu: 9.227 ± 1.619
0.577AlaMet: 0.577 ± 0.592
2.307AlaAsn: 2.307 ± 0.66
9.804AlaPro: 9.804 ± 1.455
3.749AlaGln: 3.749 ± 0.647
2.595AlaArg: 2.595 ± 0.743
5.767AlaSer: 5.767 ± 0.934
2.884AlaThr: 2.884 ± 1.035
3.172AlaVal: 3.172 ± 0.625
0.0AlaTrp: 0.0 ± 0.0
2.018AlaTyr: 2.018 ± 0.724
0.0AlaXaa: 0.0 ± 0.0
Cys
0.577CysAla: 0.577 ± 0.596
0.288CysCys: 0.288 ± 0.351
0.0CysAsp: 0.0 ± 0.0
1.153CysGlu: 1.153 ± 0.386
1.442CysPhe: 1.442 ± 0.405
1.73CysGly: 1.73 ± 0.724
0.577CysHis: 0.577 ± 0.313
0.288CysIle: 0.288 ± 0.192
1.73CysLys: 1.73 ± 0.32
3.749CysLeu: 3.749 ± 0.472
0.288CysMet: 0.288 ± 0.351
0.865CysAsn: 0.865 ± 0.379
5.19CysPro: 5.19 ± 0.88
4.325CysGln: 4.325 ± 0.844
0.577CysArg: 0.577 ± 0.384
1.442CysSer: 1.442 ± 0.368
0.577CysThr: 0.577 ± 0.702
1.153CysVal: 1.153 ± 0.684
0.0CysTrp: 0.0 ± 0.0
0.288CysTyr: 0.288 ± 0.351
0.0CysXaa: 0.0 ± 0.0
Asp
1.73AspAla: 1.73 ± 0.453
2.595AspCys: 2.595 ± 0.673
0.865AspAsp: 0.865 ± 0.627
0.288AspGlu: 0.288 ± 0.342
0.865AspPhe: 0.865 ± 0.362
1.153AspGly: 1.153 ± 0.96
0.865AspHis: 0.865 ± 0.635
2.018AspIle: 2.018 ± 0.628
1.442AspLys: 1.442 ± 0.491
7.497AspLeu: 7.497 ± 1.485
0.0AspMet: 0.0 ± 0.295
1.73AspAsn: 1.73 ± 0.32
7.785AspPro: 7.785 ± 0.991
1.442AspGln: 1.442 ± 0.529
0.865AspArg: 0.865 ± 0.525
2.307AspSer: 2.307 ± 0.396
2.884AspThr: 2.884 ± 1.097
0.577AspVal: 0.577 ± 0.384
0.288AspTrp: 0.288 ± 0.342
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
4.325GluAla: 4.325 ± 0.826
0.865GluCys: 0.865 ± 0.362
2.307GluAsp: 2.307 ± 0.933
1.442GluGlu: 1.442 ± 0.671
1.153GluPhe: 1.153 ± 0.386
1.442GluGly: 1.442 ± 0.368
1.153GluHis: 1.153 ± 0.359
1.153GluIle: 1.153 ± 0.417
0.865GluLys: 0.865 ± 0.4
2.307GluLeu: 2.307 ± 0.92
0.865GluMet: 0.865 ± 0.362
0.288GluAsn: 0.288 ± 0.342
1.73GluPro: 1.73 ± 0.249
2.307GluGln: 2.307 ± 0.678
2.595GluArg: 2.595 ± 0.568
0.865GluSer: 0.865 ± 0.527
4.037GluThr: 4.037 ± 0.773
2.018GluVal: 2.018 ± 0.56
0.0GluTrp: 0.0 ± 0.0
1.153GluTyr: 1.153 ± 0.417
0.0GluXaa: 0.0 ± 0.0
Phe
0.288PheAla: 0.288 ± 0.192
1.153PheCys: 1.153 ± 0.684
1.73PheAsp: 1.73 ± 0.724
0.577PheGlu: 0.577 ± 0.313
0.577PhePhe: 0.577 ± 0.313
1.73PheGly: 1.73 ± 0.597
2.018PheHis: 2.018 ± 0.579
1.153PheIle: 1.153 ± 0.388
0.865PheLys: 0.865 ± 0.402
5.19PheLeu: 5.19 ± 1.297
0.865PheMet: 0.865 ± 0.362
0.288PheAsn: 0.288 ± 0.342
3.172PhePro: 3.172 ± 1.13
2.307PheGln: 2.307 ± 1.063
1.73PheArg: 1.73 ± 0.548
4.325PheSer: 4.325 ± 1.884
1.442PheThr: 1.442 ± 0.574
1.153PheVal: 1.153 ± 0.442
0.288PheTrp: 0.288 ± 0.351
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
5.19GlyAla: 5.19 ± 0.894
0.288GlyCys: 0.288 ± 0.351
0.577GlyAsp: 0.577 ± 0.313
2.595GlyGlu: 2.595 ± 0.569
0.865GlyPhe: 0.865 ± 0.527
3.46GlyGly: 3.46 ± 0.466
1.73GlyHis: 1.73 ± 0.637
2.018GlyIle: 2.018 ± 0.458
2.018GlyLys: 2.018 ± 0.452
8.939GlyLeu: 8.939 ± 0.791
0.288GlyMet: 0.288 ± 0.181
1.442GlyAsn: 1.442 ± 0.272
6.92GlyPro: 6.92 ± 1.338
3.46GlyGln: 3.46 ± 0.919
2.018GlyArg: 2.018 ± 0.543
4.614GlySer: 4.614 ± 0.837
2.595GlyThr: 2.595 ± 0.691
0.288GlyVal: 0.288 ± 0.351
0.288GlyTrp: 0.288 ± 0.351
2.307GlyTyr: 2.307 ± 0.637
0.0GlyXaa: 0.0 ± 0.0
His
1.442HisAla: 1.442 ± 0.68
0.865HisCys: 0.865 ± 0.635
1.442HisAsp: 1.442 ± 0.272
0.865HisGlu: 0.865 ± 0.362
0.865HisPhe: 0.865 ± 0.4
1.442HisGly: 1.442 ± 0.491
2.595HisHis: 2.595 ± 0.862
2.307HisIle: 2.307 ± 0.763
0.288HisLys: 0.288 ± 0.342
3.46HisLeu: 3.46 ± 0.812
0.0HisMet: 0.0 ± 0.0
0.577HisAsn: 0.577 ± 0.384
2.307HisPro: 2.307 ± 0.676
3.46HisGln: 3.46 ± 0.972
2.307HisArg: 2.307 ± 0.38
1.153HisSer: 1.153 ± 0.388
2.307HisThr: 2.307 ± 1.058
1.153HisVal: 1.153 ± 0.684
3.172HisTrp: 3.172 ± 0.928
0.577HisTyr: 0.577 ± 0.313
0.0HisXaa: 0.0 ± 0.0
Ile
2.307IleAla: 2.307 ± 0.98
0.288IleCys: 0.288 ± 0.192
1.73IleAsp: 1.73 ± 0.321
0.288IleGlu: 0.288 ± 0.192
2.018IlePhe: 2.018 ± 0.984
1.442IleGly: 1.442 ± 0.423
2.595IleHis: 2.595 ± 0.711
1.442IleIle: 1.442 ± 0.68
2.307IleLys: 2.307 ± 0.773
10.092IleLeu: 10.092 ± 1.515
0.288IleMet: 0.288 ± 0.192
3.749IleAsn: 3.749 ± 1.322
5.479IlePro: 5.479 ± 0.814
4.037IleGln: 4.037 ± 0.723
1.73IleArg: 1.73 ± 0.371
3.749IleSer: 3.749 ± 1.308
2.884IleThr: 2.884 ± 0.665
1.73IleVal: 1.73 ± 0.495
1.153IleTrp: 1.153 ± 0.388
0.577IleTyr: 0.577 ± 0.492
0.0IleXaa: 0.0 ± 0.0
Lys
2.595LysAla: 2.595 ± 0.707
0.288LysCys: 0.288 ± 0.351
3.46LysAsp: 3.46 ± 1.449
3.172LysGlu: 3.172 ± 0.759
2.018LysPhe: 2.018 ± 0.437
1.442LysGly: 1.442 ± 0.368
0.865LysHis: 0.865 ± 0.39
2.595LysIle: 2.595 ± 0.489
1.442LysLys: 1.442 ± 0.368
2.884LysLeu: 2.884 ± 0.843
0.0LysMet: 0.0 ± 0.0
3.749LysAsn: 3.749 ± 0.917
1.73LysPro: 1.73 ± 0.32
2.884LysGln: 2.884 ± 0.824
2.307LysArg: 2.307 ± 0.38
1.73LysSer: 1.73 ± 0.758
5.767LysThr: 5.767 ± 1.459
1.442LysVal: 1.442 ± 0.478
0.865LysTrp: 0.865 ± 0.576
1.153LysTyr: 1.153 ± 0.514
0.0LysXaa: 0.0 ± 0.0
Leu
12.111LeuAla: 12.111 ± 2.452
2.307LeuCys: 2.307 ± 0.489
4.614LeuAsp: 4.614 ± 0.619
3.172LeuGlu: 3.172 ± 0.42
3.46LeuPhe: 3.46 ± 1.677
5.767LeuGly: 5.767 ± 1.012
6.632LeuHis: 6.632 ± 0.682
8.074LeuIle: 8.074 ± 0.832
3.749LeuLys: 3.749 ± 0.26
13.264LeuLeu: 13.264 ± 1.477
1.73LeuMet: 1.73 ± 0.425
6.055LeuAsn: 6.055 ± 1.035
14.129LeuPro: 14.129 ± 2.573
11.246LeuGln: 11.246 ± 1.713
9.227LeuArg: 9.227 ± 1.458
6.055LeuSer: 6.055 ± 2.018
5.479LeuThr: 5.479 ± 1.591
3.46LeuVal: 3.46 ± 0.718
1.442LeuTrp: 1.442 ± 0.491
3.172LeuTyr: 3.172 ± 1.569
0.0LeuXaa: 0.0 ± 0.0
Met
0.577MetAla: 0.577 ± 0.313
0.0MetCys: 0.0 ± 0.0
0.577MetAsp: 0.577 ± 0.403
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
1.153MetGly: 1.153 ± 0.345
0.0MetHis: 0.0 ± 0.0
0.288MetIle: 0.288 ± 0.342
0.865MetLys: 0.865 ± 0.362
1.73MetLeu: 1.73 ± 0.637
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.865MetPro: 0.865 ± 0.804
0.865MetGln: 0.865 ± 0.362
0.0MetArg: 0.0 ± 0.0
0.288MetSer: 0.288 ± 0.342
0.288MetThr: 0.288 ± 0.342
1.153MetVal: 1.153 ± 0.345
0.288MetTrp: 0.288 ± 0.342
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.442AsnAla: 1.442 ± 0.491
0.865AsnCys: 0.865 ± 0.627
0.288AsnAsp: 0.288 ± 0.192
0.577AsnGlu: 0.577 ± 0.313
1.153AsnPhe: 1.153 ± 0.345
1.73AsnGly: 1.73 ± 0.637
1.442AsnHis: 1.442 ± 0.677
2.307AsnIle: 2.307 ± 0.773
2.307AsnLys: 2.307 ± 0.921
2.884AsnLeu: 2.884 ± 0.643
0.0AsnMet: 0.0 ± 0.0
2.595AsnAsn: 2.595 ± 0.321
5.479AsnPro: 5.479 ± 1.192
3.46AsnGln: 3.46 ± 1.059
0.577AsnArg: 0.577 ± 0.702
3.172AsnSer: 3.172 ± 0.666
2.595AsnThr: 2.595 ± 0.694
3.172AsnVal: 3.172 ± 0.394
0.288AsnTrp: 0.288 ± 0.351
2.595AsnTyr: 2.595 ± 0.591
0.0AsnXaa: 0.0 ± 0.0
Pro
4.902ProAla: 4.902 ± 1.011
5.767ProCys: 5.767 ± 1.167
1.73ProAsp: 1.73 ± 0.514
5.19ProGlu: 5.19 ± 1.258
3.172ProPhe: 3.172 ± 0.646
7.785ProGly: 7.785 ± 1.279
2.595ProHis: 2.595 ± 0.356
6.92ProIle: 6.92 ± 1.194
7.209ProLys: 7.209 ± 1.507
10.381ProLeu: 10.381 ± 1.187
0.865ProMet: 0.865 ± 0.845
4.902ProAsn: 4.902 ± 0.675
14.994ProPro: 14.994 ± 2.511
5.19ProGln: 5.19 ± 1.266
4.902ProArg: 4.902 ± 1.086
8.651ProSer: 8.651 ± 2.558
6.632ProThr: 6.632 ± 1.225
7.209ProVal: 7.209 ± 1.484
3.46ProTrp: 3.46 ± 0.75
3.46ProTyr: 3.46 ± 1.061
0.0ProXaa: 0.0 ± 0.0
Gln
9.804GlnAla: 9.804 ± 1.26
2.307GlnCys: 2.307 ± 0.488
2.595GlnAsp: 2.595 ± 0.486
3.749GlnGlu: 3.749 ± 0.928
2.884GlnPhe: 2.884 ± 0.843
4.902GlnGly: 4.902 ± 1.365
1.153GlnHis: 1.153 ± 0.622
1.153GlnIle: 1.153 ± 0.625
3.749GlnLys: 3.749 ± 1.04
6.344GlnLeu: 6.344 ± 0.979
1.153GlnMet: 1.153 ± 0.386
2.018GlnAsn: 2.018 ± 0.185
8.362GlnPro: 8.362 ± 1.395
4.902GlnGln: 4.902 ± 1.472
0.865GlnArg: 0.865 ± 0.971
3.172GlnSer: 3.172 ± 0.75
3.46GlnThr: 3.46 ± 0.809
3.172GlnVal: 3.172 ± 1.015
1.153GlnTrp: 1.153 ± 0.312
2.307GlnTyr: 2.307 ± 0.38
0.0GlnXaa: 0.0 ± 0.0
Arg
2.595ArgAla: 2.595 ± 0.722
1.442ArgCys: 1.442 ± 0.594
5.767ArgAsp: 5.767 ± 1.34
2.595ArgGlu: 2.595 ± 0.729
1.153ArgPhe: 1.153 ± 0.807
3.172ArgGly: 3.172 ± 0.323
0.288ArgHis: 0.288 ± 0.342
0.865ArgIle: 0.865 ± 0.362
3.749ArgLys: 3.749 ± 0.766
6.055ArgLeu: 6.055 ± 0.558
0.288ArgMet: 0.288 ± 0.192
0.577ArgAsn: 0.577 ± 0.313
5.767ArgPro: 5.767 ± 1.884
1.153ArgGln: 1.153 ± 0.312
2.595ArgArg: 2.595 ± 0.878
2.884ArgSer: 2.884 ± 0.563
2.018ArgThr: 2.018 ± 0.701
2.307ArgVal: 2.307 ± 0.678
0.577ArgTrp: 0.577 ± 0.384
0.577ArgTyr: 0.577 ± 0.384
0.0ArgXaa: 0.0 ± 0.0
Ser
6.632SerAla: 6.632 ± 0.933
1.153SerCys: 1.153 ± 0.519
2.884SerAsp: 2.884 ± 1.841
2.595SerGlu: 2.595 ± 0.512
3.46SerPhe: 3.46 ± 1.131
2.307SerGly: 2.307 ± 0.619
1.442SerHis: 1.442 ± 0.68
2.018SerIle: 2.018 ± 1.024
2.884SerLys: 2.884 ± 0.983
11.246SerLeu: 11.246 ± 2.304
0.577SerMet: 0.577 ± 0.403
3.172SerAsn: 3.172 ± 0.764
8.651SerPro: 8.651 ± 1.977
4.614SerGln: 4.614 ± 0.76
3.749SerArg: 3.749 ± 0.875
11.246SerSer: 11.246 ± 3.713
3.749SerThr: 3.749 ± 2.033
3.172SerVal: 3.172 ± 0.323
0.865SerTrp: 0.865 ± 0.635
0.865SerTyr: 0.865 ± 0.635
0.0SerXaa: 0.0 ± 0.0
Thr
2.884ThrAla: 2.884 ± 0.605
1.153ThrCys: 1.153 ± 0.388
2.595ThrAsp: 2.595 ± 0.848
0.0ThrGlu: 0.0 ± 0.0
1.153ThrPhe: 1.153 ± 0.791
4.325ThrGly: 4.325 ± 1.361
2.307ThrHis: 2.307 ± 0.498
4.325ThrIle: 4.325 ± 1.218
2.307ThrLys: 2.307 ± 0.448
6.92ThrLeu: 6.92 ± 1.131
0.288ThrMet: 0.288 ± 0.342
2.884ThrAsn: 2.884 ± 0.772
9.227ThrPro: 9.227 ± 1.875
2.884ThrGln: 2.884 ± 0.648
3.46ThrArg: 3.46 ± 0.779
2.307ThrSer: 2.307 ± 1.184
2.307ThrThr: 2.307 ± 1.185
1.73ThrVal: 1.73 ± 0.632
2.884ThrTrp: 2.884 ± 0.613
2.018ThrTyr: 2.018 ± 0.452
0.0ThrXaa: 0.0 ± 0.0
Val
4.037ValAla: 4.037 ± 1.004
1.73ValCys: 1.73 ± 0.249
1.442ValAsp: 1.442 ± 0.629
1.153ValGlu: 1.153 ± 0.386
0.577ValPhe: 0.577 ± 0.492
1.442ValGly: 1.442 ± 0.272
1.73ValHis: 1.73 ± 0.625
2.307ValIle: 2.307 ± 0.787
1.153ValLys: 1.153 ± 0.388
6.055ValLeu: 6.055 ± 1.238
0.288ValMet: 0.288 ± 0.327
0.288ValAsn: 0.288 ± 0.192
1.153ValPro: 1.153 ± 0.626
4.037ValGln: 4.037 ± 1.012
1.442ValArg: 1.442 ± 0.491
6.055ValSer: 6.055 ± 0.97
2.307ValThr: 2.307 ± 0.831
1.73ValVal: 1.73 ± 1.141
1.73ValTrp: 1.73 ± 0.32
0.865ValTyr: 0.865 ± 0.527
0.0ValXaa: 0.0 ± 0.0
Trp
1.153TrpAla: 1.153 ± 0.345
0.288TrpCys: 0.288 ± 0.342
0.865TrpAsp: 0.865 ± 0.4
0.577TrpGlu: 0.577 ± 0.596
0.288TrpPhe: 0.288 ± 0.351
0.865TrpGly: 0.865 ± 0.4
0.577TrpHis: 0.577 ± 0.492
0.577TrpIle: 0.577 ± 0.311
1.153TrpLys: 1.153 ± 0.345
3.172TrpLeu: 3.172 ± 0.903
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
1.153TrpPro: 1.153 ± 0.903
1.442TrpGln: 1.442 ± 0.491
1.442TrpArg: 1.442 ± 0.677
1.73TrpSer: 1.73 ± 0.32
2.595TrpThr: 2.595 ± 0.383
0.865TrpVal: 0.865 ± 0.576
0.0TrpTrp: 0.0 ± 0.0
0.288TrpTyr: 0.288 ± 0.192
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.577TyrAla: 0.577 ± 0.311
0.288TyrCys: 0.288 ± 0.192
0.577TyrAsp: 0.577 ± 0.702
0.577TyrGlu: 0.577 ± 0.311
0.577TyrPhe: 0.577 ± 0.311
0.865TyrGly: 0.865 ± 0.39
0.577TyrHis: 0.577 ± 0.702
0.577TyrIle: 0.577 ± 0.311
1.153TyrLys: 1.153 ± 0.529
3.749TyrLeu: 3.749 ± 0.691
0.288TyrMet: 0.288 ± 0.192
1.442TyrAsn: 1.442 ± 0.4
2.018TyrPro: 2.018 ± 0.702
1.442TyrGln: 1.442 ± 0.368
1.442TyrArg: 1.442 ± 0.486
5.767TyrSer: 5.767 ± 1.717
1.153TyrThr: 1.153 ± 0.967
0.865TyrVal: 0.865 ± 0.4
0.288TyrTrp: 0.288 ± 0.423
1.153TyrTyr: 1.153 ± 0.514
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3469 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski