0%| | 0/4460 [00:00> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:51:27,299 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:51:29,208 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 10.524, 'learning_rate': 1.0000000000000001e-07, 'epoch': 0.0} [WARNING|modeling_utils.py:388] 2022-03-02 21:51:31,100 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%| | 1/4460 [00:08<10:12:57, 8.25s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:51:33,217 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:51:35,093 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:51:37,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 10.5614, 'learning_rate': 2.0000000000000002e-07, 'epoch': 0.0} [WARNING|modeling_utils.py:388] 2022-03-02 21:51:39,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%| | 2/4460 [00:16<10:16:06, 8.29s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:51:41,482 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:51:43,361 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:51:45,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:51:46,984 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%| | 3/4460 [00:23<9:46:14, 7.89s/it] 0%| | 3/4460 [00:23<9:46:14, 7.89s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:51:48,858 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:51:50,649 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:51:52,456 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:51:54,242 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%| | 4/4460 [00:31<9:27:30, 7.64s/it] 0%| | 4/4460 [00:31<9:27:30, 7.64s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:51:56,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:51:57,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:51:59,703 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 10.5023, 'learning_rate': 5.000000000000001e-07, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-02 21:52:01,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%| | 5/4460 [00:38<9:15:53, 7.49s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:52:03,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:52:05,184 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 10.4477, 'learning_rate': 6.000000000000001e-07, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-02 21:52:06,968 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:52:08,739 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%| | 6/4460 [00:45<9:10:42, 7.42s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:52:10,644 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:52:12,357 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:52:14,138 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 10.4006, 'learning_rate': 7.000000000000001e-07, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-02 21:52:15,861 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▏ | 7/4460 [00:52<9:03:19, 7.32s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:52:17,744 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:52:19,495 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:52:21,274 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 10.3794, 'learning_rate': 8.000000000000001e-07, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-02 21:52:23,034 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▏ | 8/4460 [01:00<8:59:47, 7.27s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:52:24,933 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:52:26,688 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:52:28,453 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 10.3246, 'learning_rate': 9e-07, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-02 21:52:30,203 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▏ | 9/4460 [01:07<8:57:07, 7.24s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:52:32,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:52:33,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:52:35,562 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 10.2437, 'learning_rate': 1.0000000000000002e-06, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-02 21:52:37,305 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▏ | 10/4460 [01:14<8:53:52, 7.20s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:52:39,175 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:52:40,864 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:52:42,573 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 10.2119, 'learning_rate': 1.1e-06, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-02 21:52:44,273 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▏ | 11/4460 [01:21<8:48:34, 7.13s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:52:46,111 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:52:47,798 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:52:49,530 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 10.2233, 'learning_rate': 1.2000000000000002e-06, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-02 21:52:51,241 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▏ | 12/4460 [01:28<8:44:46, 7.08s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:52:53,069 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:52:54,792 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:52:56,483 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 10.079, 'learning_rate': 1.3e-06, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-02 21:52:58,199 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▏ | 13/4460 [01:35<8:41:56, 7.04s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:53:00,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:53:01,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:53:03,483 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 10.0996, 'learning_rate': 1.4000000000000001e-06, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-03-02 21:53:05,160 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▏ | 14/4460 [01:42<8:40:06, 7.02s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:53:07,000 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:53:08,700 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:53:10,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:53:12,024 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▎ | 15/4460 [01:49<8:36:26, 6.97s/it] 0%|▎ | 15/4460 [01:49<8:36:26, 6.97s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:53:13,848 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:53:15,528 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:53:17,149 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 9.9863, 'learning_rate': 1.6000000000000001e-06, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-03-02 21:53:18,802 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▎ | 16/4460 [01:55<8:32:00, 6.91s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:53:20,552 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:53:22,175 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:53:23,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 9.9272, 'learning_rate': 1.7000000000000002e-06, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-03-02 21:53:25,458 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▎ | 17/4460 [02:02<8:26:13, 6.84s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:53:27,179 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:53:28,750 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:53:30,383 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 9.9023, 'learning_rate': 1.8e-06, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-03-02 21:53:32,062 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▎ | 18/4460 [02:09<8:20:56, 6.77s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:53:33,850 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:53:35,466 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:53:37,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:53:38,757 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▎ | 19/4460 [02:15<8:19:13, 6.74s/it] 0%|▎ | 19/4460 [02:15<8:19:13, 6.74s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:53:40,511 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:53:42,180 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:53:43,796 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 9.8505, 'learning_rate': 2.0000000000000003e-06, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-03-02 21:53:45,434 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▎ | 20/4460 [02:22<8:17:33, 6.72s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:53:47,128 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:53:48,757 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:53:50,332 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 9.8002, 'learning_rate': 2.1000000000000002e-06, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-03-02 21:53:51,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▎ | 21/4460 [02:28<8:12:05, 6.65s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:53:53,631 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:53:55,213 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:53:56,832 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 9.7338, 'learning_rate': 2.2e-06, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-03-02 21:53:58,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▍ | 22/4460 [02:35<8:08:45, 6.61s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:54:00,104 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:54:01,657 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:54:03,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 9.7399, 'learning_rate': 2.3e-06, 'epoch': 0.03} [WARNING|modeling_utils.py:388] 2022-03-02 21:54:04,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▍ | 23/4460 [02:41<8:05:08, 6.56s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:54:06,592 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:54:08,196 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:54:09,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:54:11,326 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▍ | 24/4460 [02:48<8:02:42, 6.53s/it] 1%|▍ | 24/4460 [02:48<8:02:42, 6.53s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:54:12,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:54:14,562 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:54:16,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 9.6684, 'learning_rate': 2.5e-06, 'epoch': 0.03} [WARNING|modeling_utils.py:388] 2022-03-02 21:54:18,307 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:54:21,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:54:20,034 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:54:21,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:54:20,034 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:54:23,159 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 9.7359, 'learning_rate': 2.6e-06, 'epoch': 0.03} [WARNING|modeling_utils.py:388] 2022-03-02 21:54:24,701 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▍ | 26/4460 [03:01<8:06:29, 6.58s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:54:26,325 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:54:29,431 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:54:26,325 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:54:29,431 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:54:26,325 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▍ | 27/4460 [03:07<7:59:46, 6.49s/it]g-point operations will not be computed-02 21:54:26,325 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▍ | 27/4460 [03:07<7:59:46, 6.49s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:54:32,612 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:54:35,706 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:54:32,612 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:54:35,706 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:54:32,612 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▍ | 28/4460 [03:14<7:54:31, 6.42s/it]g-point operations will not be computed-02 21:54:32,612 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▍ | 28/4460 [03:14<7:54:31, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:54:38,907 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:54:41,937 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:54:38,907 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▌ | 29/4460 [03:20<7:50:23, 6.37s/it]g-point operations will not be computed-02 21:54:38,907 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▌ | 29/4460 [03:20<7:50:23, 6.37s/it]g-point operations will not be computed-02 21:54:38,907 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▌ | 29/4460 [03:20<7:50:23, 6.37s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:54:45,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:54:48,142 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:54:45,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:54:48,142 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:54:45,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▌ | 30/4460 [03:26<7:45:14, 6.30s/it]g-point operations will not be computed-02 21:54:45,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▌ | 30/4460 [03:26<7:45:14, 6.30s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:54:51,248 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:54:54,252 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:54:51,248 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:54:54,252 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:54:51,248 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▌ | 31/4460 [03:32<7:41:09, 6.25s/it]g-point operations will not be computed-02 21:54:51,248 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▌ | 31/4460 [03:32<7:41:09, 6.25s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:54:57,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:55:00,343 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:54:57,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:55:00,343 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:54:57,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▌ | 32/4460 [03:38<7:36:24, 6.18s/it]g-point operations will not be computed-02 21:54:57,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▌ | 32/4460 [03:38<7:36:24, 6.18s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:55:03,393 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:55:06,340 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:55:03,393 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:55:06,340 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:55:03,393 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▌ | 33/4460 [03:44<7:32:18, 6.13s/it]g-point operations will not be computed-02 21:55:03,393 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▌ | 33/4460 [03:44<7:32:18, 6.13s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:55:09,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:55:12,143 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:55:09,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:55:12,143 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:55:09,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▌ | 34/4460 [03:50<7:23:58, 6.02s/it]g-point operations will not be computed-02 21:55:09,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▌ | 34/4460 [03:50<7:23:58, 6.02s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:55:15,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:55:17,893 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:55:15,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:55:17,893 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:55:15,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▌ | 35/4460 [03:56<7:17:47, 5.94s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:55:20,807 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▌ | 35/4460 [03:56<7:17:47, 5.94s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:55:20,807 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▋ | 36/4460 [04:01<7:11:10, 5.85s/it]g-point operations will not be computed-02 21:55:20,807 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▋ | 36/4460 [04:01<7:11:10, 5.85s/it]g-point operations will not be computed-02 21:55:20,807 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▋ | 36/4460 [04:01<7:11:10, 5.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:55:26,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:55:29,053 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:55:26,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▋ | 37/4460 [04:07<7:02:09, 5.73s/it]g-point operations will not be computed-02 21:55:26,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▋ | 37/4460 [04:07<7:02:09, 5.73s/it]g-point operations will not be computed-02 21:55:26,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▋ | 37/4460 [04:07<7:02:09, 5.73s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:55:31,817 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:55:34,466 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:55:31,817 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:55:34,466 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:55:31,817 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▋ | 38/4460 [04:12<6:53:58, 5.62s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:55:37,169 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▋ | 38/4460 [04:12<6:53:58, 5.62s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:55:37,169 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▋ | 39/4460 [04:18<6:46:45, 5.52s/it]g-point operations will not be computed-02 21:55:37,169 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▋ | 39/4460 [04:18<6:46:45, 5.52s/it]g-point operations will not be computed-02 21:55:37,169 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▋ | 39/4460 [04:18<6:46:45, 5.52s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:55:42,405 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:55:44,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:55:42,405 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:55:44,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:55:42,405 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▋ | 40/4460 [04:23<6:36:50, 5.39s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:55:47,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▋ | 40/4460 [04:23<6:36:50, 5.39s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:55:47,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▋ | 41/4460 [04:28<6:28:03, 5.27s/it]g-point operations will not be computed-02 21:55:47,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▋ | 41/4460 [04:28<6:28:03, 5.27s/it]g-point operations will not be computed-02 21:55:47,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▋ | 41/4460 [04:28<6:28:03, 5.27s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:55:52,405 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▋ | 42/4460 [04:32<6:16:30, 5.11s/it]g-point operations will not be computed-02 21:55:52,405 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▋ | 42/4460 [04:32<6:16:30, 5.11s/it]g-point operations will not be computed-02 21:55:52,405 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▋ | 42/4460 [04:32<6:16:30, 5.11s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:55:57,079 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:55:59,195 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:55:57,079 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:55:59,195 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:55:57,079 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▊ | 43/4460 [04:37<6:00:20, 4.89s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:56:01,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:56:03,359 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:56:01,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:56:03,359 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:56:01,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▊ | 44/4460 [04:41<5:41:28, 4.64s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:56:05,324 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▊ | 45/4460 [04:44<5:19:52, 4.35s/it]g-point operations will not be computed-02 21:56:05,324 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▊ | 45/4460 [04:44<5:19:52, 4.35s/it]g-point operations will not be computed-02 21:56:05,324 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▊ | 45/4460 [04:44<5:19:52, 4.35s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:56:08,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▊ | 46/4460 [04:48<4:56:06, 4.03s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:56:12,072 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▊ | 46/4460 [04:48<4:56:06, 4.03s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:56:12,072 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:56:13,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:56:12,072 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:56:13,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:56:12,072 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:56:16,189 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:56:14,953 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▊ | 48/4460 [04:53<4:08:11, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:56:17,475 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▊ | 48/4460 [04:53<4:08:11, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:56:17,475 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▊ | 49/4460 [04:56<3:46:16, 3.08s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:56:19,829 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▊ | 49/4460 [04:56<3:46:16, 3.08s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:56:19,829 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 50/4460 [04:59<3:38:59, 2.98s/it]g-point operations will not be computed-02 21:56:19,829 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 50/4460 [04:59<3:38:59, 2.98s/it]g-point operations will not be computed-02 21:56:19,829 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 50/4460 [04:59<3:38:59, 2.98s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:56:24,007 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 50/4460 [04:59<3:38:59, 2.98s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:56:24,007 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:56:27,678 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:56:24,007 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 51/4460 [05:06<5:20:25, 4.36s/it]g-point operations will not be computed-02 21:56:24,007 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 51/4460 [05:06<5:20:25, 4.36s/it]g-point operations will not be computed-02 21:56:24,007 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 51/4460 [05:06<5:20:25, 4.36s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:56:31,442 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 51/4460 [05:06<5:20:25, 4.36s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:56:31,442 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:56:35,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:56:31,442 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:56:35,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:56:31,442 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 52/4460 [05:13<6:25:17, 5.24s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:56:38,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 52/4460 [05:13<6:25:17, 5.24s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:56:38,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:56:42,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:56:38,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 53/4460 [05:21<7:12:20, 5.89s/it]g-point operations will not be computed-02 21:56:38,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 53/4460 [05:21<7:12:20, 5.89s/it]g-point operations will not be computed-02 21:56:38,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 53/4460 [05:21<7:12:20, 5.89s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:56:46,070 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 53/4460 [05:21<7:12:20, 5.89s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:56:46,070 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:56:49,604 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:56:46,070 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 54/4460 [05:28<7:40:54, 6.28s/it]g-point operations will not be computed-02 21:56:46,070 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 54/4460 [05:28<7:40:54, 6.28s/it]g-point operations will not be computed-02 21:56:46,070 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 54/4460 [05:28<7:40:54, 6.28s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:56:53,243 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:56:56,799 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:56:53,243 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:56:56,799 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:56:53,243 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 55/4460 [05:35<8:00:10, 6.54s/it]g-point operations will not be computed-02 21:56:53,243 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 55/4460 [05:35<8:00:10, 6.54s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:57:00,400 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 55/4460 [05:35<8:00:10, 6.54s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:57:00,400 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:57:03,946 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:57:00,400 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:57:03,946 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:57:00,400 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 56/4460 [05:42<8:13:11, 6.72s/it]g-point operations will not be computed-02 21:57:00,400 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 56/4460 [05:42<8:13:11, 6.72s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:57:07,559 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:57:11,070 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:57:07,559 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:57:11,070 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:57:07,559 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█ | 57/4460 [05:49<8:22:30, 6.85s/it]g-point operations will not be computed-02 21:57:07,559 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█ | 57/4460 [05:49<8:22:30, 6.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:57:14,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█ | 57/4460 [05:49<8:22:30, 6.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:57:14,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:57:18,153 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:57:14,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:57:18,153 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:57:14,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█ | 58/4460 [05:56<8:26:30, 6.90s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:57:21,701 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█ | 58/4460 [05:56<8:26:30, 6.90s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:57:21,701 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:57:25,107 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:57:21,701 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:57:25,107 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:57:21,701 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█ | 59/4460 [06:03<8:28:11, 6.93s/it]g-point operations will not be computed-02 21:57:21,701 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█ | 59/4460 [06:03<8:28:11, 6.93s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:57:28,658 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█ | 59/4460 [06:03<8:28:11, 6.93s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:57:28,658 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:57:32,104 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:57:28,658 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:57:32,104 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:57:28,658 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█ | 60/4460 [06:10<8:29:01, 6.94s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:57:35,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█ | 60/4460 [06:10<8:29:01, 6.94s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:57:35,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:57:39,025 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:57:35,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:57:39,025 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:57:35,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█ | 61/4460 [06:17<8:27:46, 6.93s/it]g-point operations will not be computed-02 21:57:35,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█ | 61/4460 [06:17<8:27:46, 6.93s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:57:42,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:57:45,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:57:42,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:57:45,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:57:42,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█ | 62/4460 [06:24<8:28:34, 6.94s/it]g-point operations will not be computed-02 21:57:42,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█ | 62/4460 [06:24<8:28:34, 6.94s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:57:49,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█ | 62/4460 [06:24<8:28:34, 6.94s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:57:49,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:57:52,760 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:57:49,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:57:52,760 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:57:49,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█ | 63/4460 [06:31<8:23:53, 6.88s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:57:56,149 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█ | 63/4460 [06:31<8:23:53, 6.88s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:57:56,149 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:57:59,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:57:56,149 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:57:59,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:57:56,149 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█▏ | 64/4460 [06:38<8:22:29, 6.86s/it]g-point operations will not be computed-02 21:57:56,149 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█▏ | 64/4460 [06:38<8:22:29, 6.86s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:58:03,016 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:58:06,305 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:58:03,016 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:58:06,305 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:58:03,016 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█▏ | 65/4460 [06:44<8:19:51, 6.82s/it]g-point operations will not be computed-02 21:58:03,016 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█▏ | 65/4460 [06:44<8:19:51, 6.82s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:58:09,707 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█▏ | 65/4460 [06:44<8:19:51, 6.82s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:58:09,707 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█▏ | 66/4460 [06:51<8:17:07, 6.79s/it]g-point operations will not be computed-02 21:58:09,707 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█▏ | 66/4460 [06:51<8:17:07, 6.79s/it]g-point operations will not be computed-02 21:58:09,707 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█▏ | 66/4460 [06:51<8:17:07, 6.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:58:16,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█▏ | 66/4460 [06:51<8:17:07, 6.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:58:16,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:58:19,640 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:58:16,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:58:19,640 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:58:16,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▏ | 67/4460 [06:58<8:13:09, 6.74s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:58:23,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▏ | 67/4460 [06:58<8:13:09, 6.74s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:58:23,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:58:26,251 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:58:23,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:58:26,251 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:58:23,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▏ | 68/4460 [07:04<8:09:53, 6.69s/it]g-point operations will not be computed-02 21:58:23,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▏ | 68/4460 [07:04<8:09:53, 6.69s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:58:29,579 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:58:32,850 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:58:29,579 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▏ | 69/4460 [07:11<8:07:36, 6.66s/it]g-point operations will not be computed-02 21:58:29,579 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▏ | 69/4460 [07:11<8:07:36, 6.66s/it]g-point operations will not be computed-02 21:58:29,579 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▏ | 69/4460 [07:11<8:07:36, 6.66s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:58:36,226 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▏ | 69/4460 [07:11<8:07:36, 6.66s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:58:36,226 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▏ | 70/4460 [07:18<8:05:14, 6.63s/it]g-point operations will not be computed-02 21:58:36,226 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▏ | 70/4460 [07:18<8:05:14, 6.63s/it]g-point operations will not be computed-02 21:58:36,226 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▏ | 70/4460 [07:18<8:05:14, 6.63s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:58:42,814 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▏ | 70/4460 [07:18<8:05:14, 6.63s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:58:42,814 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:58:46,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:58:42,814 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:58:46,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:58:42,814 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▎ | 71/4460 [07:24<8:04:32, 6.62s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:58:49,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▎ | 71/4460 [07:24<8:04:32, 6.62s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:58:49,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:58:52,552 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:58:49,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:58:52,552 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:58:49,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▎ | 72/4460 [07:31<8:02:04, 6.59s/it]g-point operations will not be computed-02 21:58:49,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▎ | 72/4460 [07:31<8:02:04, 6.59s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:58:55,851 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:58:58,976 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:58:55,851 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:58:58,976 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:58:55,851 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▎ | 73/4460 [07:37<7:57:19, 6.53s/it]g-point operations will not be computed-02 21:58:55,851 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▎ | 73/4460 [07:37<7:57:19, 6.53s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:59:02,240 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:59:05,397 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:59:02,240 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▎ | 74/4460 [07:44<7:56:01, 6.51s/it]g-point operations will not be computed-02 21:59:02,240 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▎ | 74/4460 [07:44<7:56:01, 6.51s/it]g-point operations will not be computed-02 21:59:02,240 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▎ | 74/4460 [07:44<7:56:01, 6.51s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:59:08,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▎ | 74/4460 [07:44<7:56:01, 6.51s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:59:08,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:59:11,795 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:59:08,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:59:11,795 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:59:08,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▎ | 75/4460 [07:51<8:05:25, 6.64s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:59:15,687 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▎ | 75/4460 [07:51<8:05:25, 6.64s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:59:15,687 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:59:18,782 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:59:15,687 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:59:18,782 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:59:15,687 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▎ | 76/4460 [07:57<7:58:57, 6.56s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:59:21,927 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▎ | 76/4460 [07:57<7:58:57, 6.56s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:59:21,927 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:59:24,988 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:59:21,927 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:59:24,988 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:59:21,927 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▎ | 77/4460 [08:03<7:51:21, 6.45s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:59:28,119 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▎ | 77/4460 [08:03<7:51:21, 6.45s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:59:28,119 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:59:31,157 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:59:28,119 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:59:31,157 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:59:28,119 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▍ | 78/4460 [08:09<7:44:42, 6.36s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:59:34,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▍ | 78/4460 [08:09<7:44:42, 6.36s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:59:34,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:59:37,279 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:59:34,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:59:37,279 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:59:34,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▍ | 79/4460 [08:15<7:38:58, 6.29s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:59:40,390 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▍ | 79/4460 [08:15<7:38:58, 6.29s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:59:40,390 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:59:43,326 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:59:40,390 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:59:43,326 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:59:40,390 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▍ | 80/4460 [08:21<7:33:58, 6.22s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:59:46,451 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▍ | 80/4460 [08:21<7:33:58, 6.22s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:59:46,451 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:59:49,384 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:59:46,451 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:59:49,384 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:59:46,451 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▍ | 81/4460 [08:27<7:28:48, 6.15s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:59:52,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▍ | 81/4460 [08:27<7:28:48, 6.15s/it][WARNING|modeling_utils.py:388] 2022-03-02 21:59:52,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:59:56,736 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:59:52,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 21:59:56,736 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:59:52,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 8.6813, 'learning_rate': 8.200000000000001e-06, 'epoch': 0.09} [WARNING|modeling_utils.py:388] 2022-03-02 21:59:56,736 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:59:52,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:00:02,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:59:52,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:00:02,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:59:52,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 8.7635, 'learning_rate': 8.3e-06, 'epoch': 0.09} [WARNING|modeling_utils.py:388] 2022-03-02 22:00:06,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:59:52,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▍ | 84/4460 [08:45<7:10:51, 5.91s/it]g-point operations will not be computed-02 21:59:52,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▍ | 84/4460 [08:45<7:10:51, 5.91s/it]g-point operations will not be computed-02 21:59:52,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 8.6311, 'learning_rate': 8.400000000000001e-06, 'epoch': 0.09} [WARNING|modeling_utils.py:388] 2022-03-02 22:00:12,558 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:59:52,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▌ | 85/4460 [08:50<7:05:28, 5.84s/it]g-point operations will not be computed-02 21:59:52,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▌ | 85/4460 [08:50<7:05:28, 5.84s/it]g-point operations will not be computed-02 21:59:52,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:00:16,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:59:52,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:00:16,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 21:59:52,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▌ | 86/4460 [08:56<6:58:45, 5.74s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:00:20,936 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▌ | 86/4460 [08:56<6:58:45, 5.74s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:00:20,936 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:00:23,624 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:20,936 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:00:23,624 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:20,936 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▌ | 87/4460 [09:01<6:53:16, 5.67s/it]g-point operations will not be computed-02 22:00:20,936 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:00:27,714 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:20,936 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:00:27,714 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:20,936 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:00:27,714 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:20,936 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▌ | 88/4460 [09:07<6:46:24, 5.58s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:00:31,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▌ | 88/4460 [09:07<6:46:24, 5.58s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:00:31,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▌ | 88/4460 [09:07<6:46:24, 5.58s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:00:31,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:00:35,526 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:31,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:00:35,526 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:31,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:00:39,359 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:31,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:00:39,359 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:31,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▌ | 90/4460 [09:17<6:28:21, 5.33s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▌ | 90/4460 [09:17<6:28:21, 5.33s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▌ | 90/4460 [09:17<6:28:21, 5.33s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:00:45,323 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:00:47,671 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:00:47,671 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:00:49,892 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:00:52,104 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:00:54,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:00:54,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:00:56,266 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:00:58,137 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:00:58,137 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:01:00,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:01:00,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:01:01,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:01:03,715 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:01:03,715 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:01:05,295 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:01:08,257 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:01:08,257 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:01:09,716 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:01:09,716 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:01:12,276 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:01:12,276 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:01:13,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:01:13,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:01:16,167 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:01:16,167 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:01:20,186 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:01:20,186 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:01:20,186 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:01:23,823 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:01:23,823 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:01:27,537 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:01:27,537 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:01:27,537 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▊ | 102/4460 [10:08<6:23:56, 5.29s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:01:34,761 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:01:34,761 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:01:34,761 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▊ | 103/4460 [10:15<7:04:14, 5.84s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▊ | 103/4460 [10:15<7:04:14, 5.84s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▊ | 103/4460 [10:15<7:04:14, 5.84s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▊ | 103/4460 [10:15<7:04:14, 5.84s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▊ | 103/4460 [10:15<7:04:14, 5.84s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▊ | 104/4460 [10:22<7:32:44, 6.24s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:01:49,087 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:01:49,087 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▊ | 105/4460 [10:29<7:52:25, 6.51s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▊ | 105/4460 [10:29<7:52:25, 6.51s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 8.4726, 'learning_rate': 1.05e-05, 'epoch': 0.12} 2%|█▊ | 105/4460 [10:29<7:52:25, 6.51s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▊ | 105/4460 [10:29<7:52:25, 6.51s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▊ | 105/4460 [10:29<7:52:25, 6.51s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▊ | 106/4460 [10:36<8:06:06, 6.70s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:02:03,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:02:03,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:02:03,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▊ | 107/4460 [10:43<8:10:47, 6.76s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▊ | 107/4460 [10:43<8:10:47, 6.76s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▊ | 107/4460 [10:43<8:10:47, 6.76s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:02:13,645 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:02:13,645 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 8.5058, 'learning_rate': 1.08e-05, 'epoch': 0.12} [WARNING|modeling_utils.py:388] 2022-03-02 22:02:13,645 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:02:13,645 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:02:13,645 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▉ | 109/4460 [10:57<8:15:29, 6.83s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:02:24,005 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:02:24,005 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▉ | 110/4460 [11:04<8:17:09, 6.86s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▉ | 110/4460 [11:04<8:17:09, 6.86s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 8.4654, 'learning_rate': 1.1000000000000001e-05, 'epoch': 0.12} 2%|█▉ | 110/4460 [11:04<8:17:09, 6.86s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:02:34,183 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:02:34,183 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 8.4735, 'learning_rate': 1.11e-05, 'epoch': 0.12} [WARNING|modeling_utils.py:388] 2022-03-02 22:02:34,183 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:02:34,183 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:02:34,183 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|█▉ | 112/4460 [11:17<8:14:41, 6.83s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|█▉ | 112/4460 [11:17<8:14:41, 6.83s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:02:46,061 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|█▉ | 113/4460 [11:24<8:11:45, 6.79s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|█▉ | 113/4460 [11:24<8:11:45, 6.79s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 8.3146, 'learning_rate': 1.13e-05, 'epoch': 0.13} 3%|█▉ | 113/4460 [11:24<8:11:45, 6.79s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:02:54,396 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:02:54,396 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 8.3611, 'learning_rate': 1.1400000000000001e-05, 'epoch': 0.13} [WARNING|modeling_utils.py:388] 2022-03-02 22:02:54,396 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:02:54,396 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:02:54,396 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██ | 115/4460 [11:38<8:09:00, 6.75s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:03:04,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:03:04,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██ | 116/4460 [11:44<8:06:40, 6.72s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██ | 116/4460 [11:44<8:06:40, 6.72s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:03:11,108 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:03:11,108 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██ | 117/4460 [11:51<8:02:56, 6.67s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██ | 117/4460 [11:51<8:02:56, 6.67s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 8.3655, 'learning_rate': 1.1700000000000001e-05, 'epoch': 0.13} [WARNING|modeling_utils.py:388] 2022-03-02 22:03:19,291 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██ | 118/4460 [11:57<8:00:44, 6.64s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██ | 118/4460 [11:57<8:00:44, 6.64s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 8.345, 'learning_rate': 1.18e-05, 'epoch': 0.13} 3%|██ | 118/4460 [11:57<8:00:44, 6.64s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:03:27,385 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:03:27,385 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 8.3265, 'learning_rate': 1.19e-05, 'epoch': 0.13} [WARNING|modeling_utils.py:388] 2022-03-02 22:03:27,385 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:03:27,385 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:03:27,385 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██ | 120/4460 [12:10<7:56:11, 6.58s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:03:37,287 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:03:37,287 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██ | 121/4460 [12:17<7:55:04, 6.57s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██ | 121/4460 [12:17<7:55:04, 6.57s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 8.2575, 'learning_rate': 1.2100000000000001e-05, 'epoch': 0.14} [WARNING|modeling_utils.py:388] 2022-03-02 22:03:45,365 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▏ | 122/4460 [12:23<7:52:57, 6.54s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▏ | 122/4460 [12:23<7:52:57, 6.54s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 8.3287, 'learning_rate': 1.22e-05, 'epoch': 0.14} [WARNING|modeling_utils.py:388] 2022-03-02 22:03:51,756 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▏ | 123/4460 [12:30<7:49:12, 6.49s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▏ | 123/4460 [12:30<7:49:12, 6.49s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 8.1648, 'learning_rate': 1.23e-05, 'epoch': 0.14} 3%|██▏ | 123/4460 [12:30<7:49:12, 6.49s/it]g-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:03:59,730 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:03:59,730 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 8.2166, 'learning_rate': 1.24e-05, 'epoch': 0.14} [WARNING|modeling_utils.py:388] 2022-03-02 22:03:59,730 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:03:59,730 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:00:41,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▏ | 125/4460 [12:43<7:54:40, 6.57s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:04:08,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▏ | 125/4460 [12:43<7:54:40, 6.57s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:04:08,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 8.3121, 'learning_rate': 1.25e-05, 'epoch': 0.14} 3%|██▏ | 125/4460 [12:43<7:54:40, 6.57s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:04:08,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▏ | 126/4460 [12:50<7:51:58, 6.53s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:04:08,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▏ | 126/4460 [12:50<7:51:58, 6.53s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:04:08,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:04:16,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:04:08,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:04:16,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:04:08,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:04:16,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:04:08,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▏ | 127/4460 [12:56<7:46:50, 6.46s/it]g-point operations will not be computed-02 22:04:08,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:04:22,420 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:04:08,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:04:22,420 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:04:08,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:04:22,420 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:04:08,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▏ | 128/4460 [13:02<7:40:00, 6.37s/it]g-point operations will not be computed-02 22:04:08,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:04:28,565 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:04:08,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:04:28,565 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:04:08,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:04:28,565 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:04:08,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 129/4460 [13:08<7:33:27, 6.28s/it]g-point operations will not be computed-02 22:04:08,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:04:34,620 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:04:08,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:04:34,620 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:04:08,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:04:34,620 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:04:08,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 130/4460 [13:14<7:28:23, 6.21s/it]g-point operations will not be computed-02 22:04:08,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:04:40,640 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:04:08,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:04:40,640 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:04:08,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:04:40,640 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:04:08,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 131/4460 [13:20<7:23:31, 6.15s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:04:45,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 131/4460 [13:20<7:23:31, 6.15s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:04:45,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 131/4460 [13:20<7:23:31, 6.15s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:04:45,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 131/4460 [13:20<7:23:31, 6.15s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:04:45,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 132/4460 [13:26<7:20:11, 6.10s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:04:51,116 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 132/4460 [13:26<7:20:11, 6.10s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:04:51,116 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 132/4460 [13:26<7:20:11, 6.10s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:04:51,116 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 132/4460 [13:26<7:20:11, 6.10s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:04:51,116 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 133/4460 [13:32<7:14:42, 6.03s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:04:56,914 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 133/4460 [13:32<7:14:42, 6.03s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:04:56,914 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 133/4460 [13:32<7:14:42, 6.03s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:04:56,914 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:05:01,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:04:56,914 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:05:01,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:04:56,914 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:05:05,360 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:04:56,914 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:05:05,360 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:04:56,914 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 135/4460 [13:43<7:00:41, 5.84s/it]g-point operations will not be computed-02 22:04:56,914 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:05:09,583 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:04:56,914 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:05:09,583 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:04:56,914 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:05:09,583 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:04:56,914 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▍ | 136/4460 [13:49<6:55:12, 5.76s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:05:13,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▍ | 136/4460 [13:49<6:55:12, 5.76s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:05:13,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:05:17,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:13,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:05:17,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:13,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 8.1425, 'learning_rate': 1.3700000000000001e-05, 'epoch': 0.15} [WARNING|modeling_utils.py:388] 2022-03-02 22:05:21,790 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:13,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▍ | 138/4460 [14:00<6:40:10, 5.56s/it]g-point operations will not be computed-02 22:05:13,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▍ | 138/4460 [14:00<6:40:10, 5.56s/it]g-point operations will not be computed-02 22:05:13,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:05:25,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:13,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:05:28,144 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:13,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:05:28,144 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:13,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 8.0094, 'learning_rate': 1.3900000000000002e-05, 'epoch': 0.16} [WARNING|modeling_utils.py:388] 2022-03-02 22:05:31,807 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:13,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:05:31,807 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:13,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▍ | 140/4460 [14:10<6:17:40, 5.25s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:05:34,248 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:05:36,458 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:34,248 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▍ | 141/4460 [14:14<6:01:57, 5.03s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▍ | 141/4460 [14:14<6:01:57, 5.03s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 8.1178, 'learning_rate': 1.4099999999999999e-05, 'epoch': 0.16} [WARNING|modeling_utils.py:388] 2022-03-02 22:05:41,760 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:05:41,760 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:05:43,837 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:05:45,738 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:05:45,738 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:05:47,665 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:05:47,665 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:05:49,387 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:05:52,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:05:52,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:05:54,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:05:55,982 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:05:55,982 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:05:58,873 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:05:58,873 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:06:00,373 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:06:00,373 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:06:02,889 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:06:04,000 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:06:04,000 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:06:06,706 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:06:06,706 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.2391, 'learning_rate': 1.5e-05, 'epoch': 0.17} [WARNING|modeling_utils.py:388] 2022-03-02 22:06:10,595 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:06:10,595 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:06:14,276 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:06:14,276 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 8.4578, 'learning_rate': 1.51e-05, 'epoch': 0.17} [WARNING|modeling_utils.py:388] 2022-03-02 22:06:17,989 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:06:17,989 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:06:17,989 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:06:21,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:06:25,203 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:06:25,203 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▋ | 153/4460 [15:05<6:56:23, 5.80s/it]g-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▋ | 153/4460 [15:05<6:56:23, 5.80s/it]g-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 8.1074, 'learning_rate': 1.53e-05, 'epoch': 0.17} 3%|██▋ | 153/4460 [15:05<6:56:23, 5.80s/it]g-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▋ | 153/4460 [15:05<6:56:23, 5.80s/it]g-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▋ | 154/4460 [15:12<7:25:32, 6.21s/it]g-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▋ | 154/4460 [15:12<7:25:32, 6.21s/it]g-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:06:39,488 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:06:39,488 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▋ | 155/4460 [15:20<7:45:21, 6.49s/it]g-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▋ | 155/4460 [15:20<7:45:21, 6.49s/it]g-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.8258, 'learning_rate': 1.55e-05, 'epoch': 0.17} 3%|██▋ | 155/4460 [15:20<7:45:21, 6.49s/it]g-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:06:50,088 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:06:50,088 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:06:50,088 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:06:53,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:06:53,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▋ | 157/4460 [15:34<8:04:09, 6.75s/it]g-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▋ | 157/4460 [15:34<8:04:09, 6.75s/it]g-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.977, 'learning_rate': 1.5700000000000002e-05, 'epoch': 0.18} 4%|██▋ | 157/4460 [15:34<8:04:09, 6.75s/it]g-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▋ | 157/4460 [15:34<8:04:09, 6.75s/it]g-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:07:04,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:07:04,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:07:04,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:07:04,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:07:04,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 159/4460 [15:47<8:10:06, 6.84s/it]g-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 159/4460 [15:47<8:10:06, 6.84s/it]g-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:07:16,090 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:07:16,090 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 160/4460 [15:54<8:10:33, 6.84s/it]g-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 160/4460 [15:54<8:10:33, 6.84s/it]g-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 160/4460 [15:54<8:10:33, 6.84s/it]g-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 160/4460 [15:54<8:10:33, 6.84s/it]g-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 160/4460 [15:54<8:10:33, 6.84s/it]g-point operations will not be computed-02 22:05:38,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 161/4460 [16:01<8:10:37, 6.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 161/4460 [16:01<8:10:37, 6.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 161/4460 [16:01<8:10:37, 6.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 161/4460 [16:01<8:10:37, 6.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 162/4460 [16:08<8:08:12, 6.82s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 162/4460 [16:08<8:08:12, 6.82s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:07:36,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:07:36,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 163/4460 [16:15<8:08:15, 6.82s/it]g-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 163/4460 [16:15<8:08:15, 6.82s/it]g-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 163/4460 [16:15<8:08:15, 6.82s/it]g-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:07:44,903 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:07:44,903 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.735, 'learning_rate': 1.6400000000000002e-05, 'epoch': 0.18} [WARNING|modeling_utils.py:388] 2022-03-02 22:07:44,903 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:07:44,903 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:07:44,903 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 165/4460 [16:28<8:03:08, 6.75s/it]g-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:07:54,912 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:07:54,912 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:07:54,912 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 166/4460 [16:35<7:59:55, 6.71s/it]g-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 166/4460 [16:35<7:59:55, 6.71s/it]g-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:08:03,128 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:08:03,128 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 167/4460 [16:41<7:56:54, 6.67s/it]g-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 167/4460 [16:41<7:56:54, 6.67s/it]g-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 167/4460 [16:41<7:56:54, 6.67s/it]g-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:08:11,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:08:11,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.7084, 'learning_rate': 1.6800000000000002e-05, 'epoch': 0.19} [WARNING|modeling_utils.py:388] 2022-03-02 22:08:11,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:08:17,911 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:08:17,911 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.6877, 'learning_rate': 1.69e-05, 'epoch': 0.19} [WARNING|modeling_utils.py:388] 2022-03-02 22:08:21,246 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:08:21,246 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:08:21,246 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 170/4460 [17:01<7:50:54, 6.59s/it]g-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:08:27,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:08:27,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:08:27,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 171/4460 [17:07<7:48:24, 6.55s/it]g-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 171/4460 [17:07<7:48:24, 6.55s/it]g-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:08:35,759 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 172/4460 [17:14<7:45:59, 6.52s/it]g-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 172/4460 [17:14<7:45:59, 6.52s/it]g-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.7142, 'learning_rate': 1.7199999999999998e-05, 'epoch': 0.19} [WARNING|modeling_utils.py:388] 2022-03-02 22:08:42,144 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 173/4460 [17:20<7:42:30, 6.47s/it]g-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 173/4460 [17:20<7:42:30, 6.47s/it]g-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.6729, 'learning_rate': 1.73e-05, 'epoch': 0.19} [WARNING|modeling_utils.py:388] 2022-03-02 22:08:48,411 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 174/4460 [17:26<7:37:52, 6.41s/it]g-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 174/4460 [17:26<7:37:52, 6.41s/it]g-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.6169, 'learning_rate': 1.74e-05, 'epoch': 0.2} [WARNING|modeling_utils.py:388] 2022-03-02 22:08:54,670 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:08:54,670 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:07:26,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 175/4460 [17:33<7:46:20, 6.53s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:08:58,530 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 175/4460 [17:33<7:46:20, 6.53s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:08:58,530 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 175/4460 [17:33<7:46:20, 6.53s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:08:58,530 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 175/4460 [17:33<7:46:20, 6.53s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:08:58,530 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 176/4460 [17:40<7:42:02, 6.47s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:09:04,737 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 176/4460 [17:40<7:42:02, 6.47s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:09:04,737 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 176/4460 [17:40<7:42:02, 6.47s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:09:04,737 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 176/4460 [17:40<7:42:02, 6.47s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:09:04,737 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 177/4460 [17:46<7:34:18, 6.36s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:09:10,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 177/4460 [17:46<7:34:18, 6.36s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:09:10,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 177/4460 [17:46<7:34:18, 6.36s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:09:10,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 177/4460 [17:46<7:34:18, 6.36s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:09:10,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 178/4460 [17:52<7:29:04, 6.29s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:09:10,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:09:18,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:09:10,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:09:18,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:09:10,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:09:18,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:09:10,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 179/4460 [17:58<7:23:10, 6.21s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:09:22,969 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 179/4460 [17:58<7:23:10, 6.21s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:09:22,969 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 179/4460 [17:58<7:23:10, 6.21s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:09:22,969 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 179/4460 [17:58<7:23:10, 6.21s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:09:22,969 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 180/4460 [18:04<7:20:24, 6.17s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:09:29,025 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 180/4460 [18:04<7:20:24, 6.17s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:09:29,025 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 180/4460 [18:04<7:20:24, 6.17s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:09:29,025 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 180/4460 [18:04<7:20:24, 6.17s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:09:29,025 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 181/4460 [18:10<7:16:13, 6.12s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:09:35,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 181/4460 [18:10<7:16:13, 6.12s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:09:35,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:09:39,399 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:09:35,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:09:39,399 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:09:35,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.4313, 'learning_rate': 1.8200000000000002e-05, 'epoch': 0.2} [WARNING|modeling_utils.py:388] 2022-03-02 22:09:39,399 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:09:35,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:09:45,209 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:09:35,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:09:45,209 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:09:35,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.4639, 'learning_rate': 1.83e-05, 'epoch': 0.21} [WARNING|modeling_utils.py:388] 2022-03-02 22:09:49,502 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:09:35,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 184/4460 [18:27<7:00:29, 5.90s/it]g-point operations will not be computed-02 22:09:35,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 184/4460 [18:27<7:00:29, 5.90s/it]g-point operations will not be computed-02 22:09:35,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:09:53,737 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:09:35,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:09:53,737 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:09:35,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 185/4460 [18:33<6:53:25, 5.80s/it]g-point operations will not be computed-02 22:09:35,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 185/4460 [18:33<6:53:25, 5.80s/it]g-point operations will not be computed-02 22:09:35,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:09:59,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:09:35,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:09:59,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:09:35,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:09:59,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:09:35,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▎ | 186/4460 [18:39<6:47:32, 5.72s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:10:03,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▎ | 186/4460 [18:39<6:47:32, 5.72s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:10:03,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:10:07,482 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:10:03,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:10:07,482 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:10:03,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.4953, 'learning_rate': 1.87e-05, 'epoch': 0.21} [WARNING|modeling_utils.py:388] 2022-03-02 22:10:11,580 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:10:03,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▎ | 188/4460 [18:49<6:36:12, 5.56s/it]g-point operations will not be computed-02 22:10:03,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▎ | 188/4460 [18:49<6:36:12, 5.56s/it]g-point operations will not be computed-02 22:10:03,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:10:15,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:10:03,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:10:18,073 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:10:03,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:10:18,073 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:10:03,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.3922, 'learning_rate': 1.8900000000000002e-05, 'epoch': 0.21} [WARNING|modeling_utils.py:388] 2022-03-02 22:10:21,924 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:10:03,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▎ | 190/4460 [19:00<6:20:29, 5.35s/it]g-point operations will not be computed-02 22:10:03,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▎ | 190/4460 [19:00<6:20:29, 5.35s/it]g-point operations will not be computed-02 22:10:03,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:10:25,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:10:03,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:10:28,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:10:03,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:10:28,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:10:03,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:10:30,395 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:10:03,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:10:30,395 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:10:03,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▎ | 192/4460 [19:09<5:58:09, 5.04s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:10:33,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▎ | 192/4460 [19:09<5:58:09, 5.04s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:10:33,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:10:35,956 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:10:33,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 193/4460 [19:13<5:42:34, 4.82s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:10:38,097 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 193/4460 [19:13<5:42:34, 4.82s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:10:38,097 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:10:40,005 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:10:38,097 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 194/4460 [19:17<5:23:48, 4.55s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:10:41,938 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 194/4460 [19:17<5:23:48, 4.55s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:10:41,938 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 195/4460 [19:21<5:05:31, 4.30s/it]g-point operations will not be computed-02 22:10:41,938 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 195/4460 [19:21<5:05:31, 4.30s/it]g-point operations will not be computed-02 22:10:41,938 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 195/4460 [19:21<5:05:31, 4.30s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:10:45,601 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 196/4460 [19:24<4:45:32, 4.02s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:10:48,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 196/4460 [19:24<4:45:32, 4.02s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:10:48,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:10:50,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:10:48,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:10:52,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:10:51,724 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:10:52,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:10:51,724 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 198/4460 [19:30<4:00:25, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:10:54,270 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 198/4460 [19:30<4:00:25, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:10:54,270 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 199/4460 [19:32<3:38:13, 3.07s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:10:56,587 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 199/4460 [19:32<3:38:13, 3.07s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:10:56,587 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 200/4460 [19:35<3:31:26, 2.98s/it]g-point operations will not be computed-02 22:10:56,587 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 200/4460 [19:35<3:31:26, 2.98s/it]g-point operations will not be computed-02 22:10:56,587 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 200/4460 [19:35<3:31:26, 2.98s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:11:00,729 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 200/4460 [19:35<3:31:26, 2.98s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:11:00,729 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:11:04,433 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:11:00,729 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 201/4460 [19:43<5:08:54, 4.35s/it]g-point operations will not be computed-02 22:11:00,729 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 201/4460 [19:43<5:08:54, 4.35s/it]g-point operations will not be computed-02 22:11:00,729 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 201/4460 [19:43<5:08:54, 4.35s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 201/4460 [19:43<5:08:54, 4.35s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:11:11,757 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 202/4460 [19:50<6:11:49, 5.24s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 202/4460 [19:50<6:11:49, 5.24s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.4642, 'learning_rate': 2.0200000000000003e-05, 'epoch': 0.23} [WARNING|modeling_utils.py:388] 2022-03-02 22:11:19,051 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:11:19,051 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 203/4460 [19:57<6:55:11, 5.85s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 203/4460 [19:57<6:55:11, 5.85s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 203/4460 [19:57<6:55:11, 5.85s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 203/4460 [19:57<6:55:11, 5.85s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 203/4460 [19:57<6:55:11, 5.85s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 204/4460 [20:05<7:25:00, 6.27s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 204/4460 [20:05<7:25:00, 6.27s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:11:33,462 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:11:33,462 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 205/4460 [20:12<7:43:50, 6.54s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 205/4460 [20:12<7:43:50, 6.54s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 205/4460 [20:12<7:43:50, 6.54s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 205/4460 [20:12<7:43:50, 6.54s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 205/4460 [20:12<7:43:50, 6.54s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 206/4460 [20:19<7:55:02, 6.70s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 206/4460 [20:19<7:55:02, 6.70s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:11:47,759 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 207/4460 [20:26<8:04:41, 6.84s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 207/4460 [20:26<8:04:41, 6.84s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.3341, 'learning_rate': 2.07e-05, 'epoch': 0.23} 5%|███▌ | 207/4460 [20:26<8:04:41, 6.84s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 207/4460 [20:26<8:04:41, 6.84s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 207/4460 [20:26<8:04:41, 6.84s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 208/4460 [20:33<8:09:01, 6.90s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:12:00,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:12:00,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:12:00,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 209/4460 [20:40<8:09:20, 6.91s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 209/4460 [20:40<8:09:20, 6.91s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 209/4460 [20:40<8:09:20, 6.91s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:12:10,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:12:10,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.2414, 'learning_rate': 2.1e-05, 'epoch': 0.24} [WARNING|modeling_utils.py:388] 2022-03-02 22:12:10,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:12:10,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:12:10,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 211/4460 [20:54<8:08:31, 6.90s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:12:20,712 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:12:20,712 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 212/4460 [21:01<8:05:44, 6.86s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 212/4460 [21:01<8:05:44, 6.86s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.1922, 'learning_rate': 2.12e-05, 'epoch': 0.24} 5%|███▋ | 212/4460 [21:01<8:05:44, 6.86s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 212/4460 [21:01<8:05:44, 6.86s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 212/4460 [21:01<8:05:44, 6.86s/it]g-point operations will not be computed-02 22:11:08,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 213/4460 [21:07<8:03:19, 6.83s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:12:32,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 213/4460 [21:07<8:03:19, 6.83s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:12:32,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 213/4460 [21:07<8:03:19, 6.83s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:12:32,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 213/4460 [21:07<8:03:19, 6.83s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:12:32,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 214/4460 [21:14<8:00:39, 6.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:12:32,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:12:40,934 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:12:32,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:12:40,934 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:12:32,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 215/4460 [21:21<7:57:55, 6.76s/it]g-point operations will not be computed-02 22:12:32,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 215/4460 [21:21<7:57:55, 6.76s/it]g-point operations will not be computed-02 22:12:32,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.2171, 'learning_rate': 2.15e-05, 'epoch': 0.24} [WARNING|modeling_utils.py:388] 2022-03-02 22:12:49,254 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:12:32,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 216/4460 [21:27<7:57:58, 6.76s/it]g-point operations will not be computed-02 22:12:32,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 216/4460 [21:27<7:57:58, 6.76s/it]g-point operations will not be computed-02 22:12:32,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.2786, 'learning_rate': 2.16e-05, 'epoch': 0.24} 5%|███▊ | 216/4460 [21:27<7:57:58, 6.76s/it]g-point operations will not be computed-02 22:12:32,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:12:57,547 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:12:32,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:12:57,547 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:12:32,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.1871, 'learning_rate': 2.1700000000000002e-05, 'epoch': 0.24} [WARNING|modeling_utils.py:388] 2022-03-02 22:12:57,547 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:12:32,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:12:57,547 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:12:32,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 218/4460 [21:41<7:52:51, 6.69s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 218/4460 [21:41<7:52:51, 6.69s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.1653, 'learning_rate': 2.18e-05, 'epoch': 0.24} 5%|███▊ | 218/4460 [21:41<7:52:51, 6.69s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 219/4460 [21:47<7:48:54, 6.63s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 219/4460 [21:47<7:48:54, 6.63s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.1119, 'learning_rate': 2.19e-05, 'epoch': 0.25} [WARNING|modeling_utils.py:388] 2022-03-02 22:13:15,692 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 220/4460 [21:54<7:49:05, 6.64s/it]g-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 220/4460 [21:54<7:49:05, 6.64s/it]g-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.3016, 'learning_rate': 2.2000000000000003e-05, 'epoch': 0.25} 5%|███▊ | 220/4460 [21:54<7:49:05, 6.64s/it]g-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:13:23,864 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:13:23,864 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.2058, 'learning_rate': 2.2100000000000002e-05, 'epoch': 0.25} [WARNING|modeling_utils.py:388] 2022-03-02 22:13:23,864 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:13:30,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:13:30,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.1048, 'learning_rate': 2.22e-05, 'epoch': 0.25} [WARNING|modeling_utils.py:388] 2022-03-02 22:13:30,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:13:30,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▉ | 223/4460 [22:13<7:43:10, 6.56s/it]g-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▉ | 223/4460 [22:13<7:43:10, 6.56s/it]g-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:13:40,140 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:13:40,140 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▉ | 224/4460 [22:20<7:39:17, 6.51s/it]g-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▉ | 224/4460 [22:20<7:39:17, 6.51s/it]g-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:13:46,466 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:13:46,466 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▉ | 225/4460 [22:27<7:46:10, 6.60s/it]g-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▉ | 225/4460 [22:27<7:46:10, 6.60s/it]g-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.1367, 'learning_rate': 2.25e-05, 'epoch': 0.25} 5%|███▉ | 225/4460 [22:27<7:46:10, 6.60s/it]g-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:13:56,490 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:13:56,490 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.2791, 'learning_rate': 2.26e-05, 'epoch': 0.25} [WARNING|modeling_utils.py:388] 2022-03-02 22:13:56,490 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:14:02,717 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:14:02,717 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.1357, 'learning_rate': 2.2700000000000003e-05, 'epoch': 0.25} [WARNING|modeling_utils.py:388] 2022-03-02 22:14:02,717 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:14:02,717 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:14:08,994 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:14:08,994 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:14:08,994 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:14:08,994 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:14:15,071 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:14:15,071 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:14:15,071 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:14:15,071 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:14:21,178 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:14:21,178 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:14:25,695 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:14:25,695 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████ | 231/4460 [23:04<7:15:05, 6.17s/it]g-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████ | 231/4460 [23:04<7:15:05, 6.17s/it]g-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:14:31,700 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:14:31,700 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████ | 232/4460 [23:10<7:11:06, 6.12s/it]g-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████ | 232/4460 [23:10<7:11:06, 6.12s/it]g-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:14:37,655 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:14:37,655 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████ | 233/4460 [23:16<7:08:53, 6.09s/it]g-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:14:42,196 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:14:42,196 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:14:42,196 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████ | 234/4460 [23:22<7:02:46, 6.00s/it]g-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:14:47,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:14:47,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:14:47,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:13:05,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████ | 235/4460 [23:27<6:55:13, 5.90s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:14:52,134 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████ | 235/4460 [23:27<6:55:13, 5.90s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:14:52,134 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:14:56,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:14:52,134 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:14:56,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:14:52,134 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0119, 'learning_rate': 2.36e-05, 'epoch': 0.26} [WARNING|modeling_utils.py:388] 2022-03-02 22:14:56,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:14:52,134 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:01,604 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:14:52,134 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:01,604 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:14:52,134 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:04,355 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:14:52,134 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:04,355 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:14:52,134 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:04,355 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:14:52,134 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▏ | 238/4460 [23:43<6:32:06, 5.57s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:15:08,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▏ | 238/4460 [23:43<6:32:06, 5.57s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:15:08,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:12,187 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:08,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:12,187 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:08,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0406, 'learning_rate': 2.39e-05, 'epoch': 0.27} [WARNING|modeling_utils.py:388] 2022-03-02 22:15:16,069 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:08,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:16,069 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:08,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▏ | 240/4460 [23:54<6:17:09, 5.36s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▏ | 240/4460 [23:54<6:17:09, 5.36s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▏ | 240/4460 [23:54<6:17:09, 5.36s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:22,115 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:24,450 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:26,597 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:26,597 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:28,773 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:30,744 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:30,744 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:32,773 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:34,595 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:34,595 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:36,501 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:38,223 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:38,223 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:39,960 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:39,960 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:41,509 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:44,522 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:44,522 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:47,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:47,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:48,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:48,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:50,785 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:52,353 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:52,353 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6483, 'learning_rate': 2.5e-05, 'epoch': 0.28} [WARNING|modeling_utils.py:388] 2022-03-02 22:15:56,288 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:56,288 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:56,288 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:59,947 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:15:59,947 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:16:03,680 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:16:07,246 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:16:07,246 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.3796, 'learning_rate': 2.5200000000000003e-05, 'epoch': 0.28} [WARNING|modeling_utils.py:388] 2022-03-02 22:16:07,246 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:16:07,246 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 253/4460 [24:51<6:49:44, 5.84s/it]g-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 253/4460 [24:51<6:49:44, 5.84s/it]g-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.2914, 'learning_rate': 2.5300000000000002e-05, 'epoch': 0.28} 6%|████▍ | 253/4460 [24:51<6:49:44, 5.84s/it]g-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:16:21,672 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:16:21,672 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.3444, 'learning_rate': 2.54e-05, 'epoch': 0.28} [WARNING|modeling_utils.py:388] 2022-03-02 22:16:21,672 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:16:21,672 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:16:21,672 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 255/4460 [25:05<7:35:59, 6.51s/it]g-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 255/4460 [25:05<7:35:59, 6.51s/it]g-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 255/4460 [25:05<7:35:59, 6.51s/it]g-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:16:35,838 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:16:35,838 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.2058, 'learning_rate': 2.5600000000000002e-05, 'epoch': 0.29} [WARNING|modeling_utils.py:388] 2022-03-02 22:16:35,838 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:16:35,838 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:16:35,838 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 257/4460 [25:19<7:55:10, 6.78s/it]g-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 257/4460 [25:19<7:55:10, 6.78s/it]g-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:16:48,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 258/4460 [25:26<8:01:01, 6.87s/it]g-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 258/4460 [25:26<8:01:01, 6.87s/it]g-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.1093, 'learning_rate': 2.58e-05, 'epoch': 0.29} 6%|████▌ | 258/4460 [25:26<8:01:01, 6.87s/it]g-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 258/4460 [25:26<8:01:01, 6.87s/it]g-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 258/4460 [25:26<8:01:01, 6.87s/it]g-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 259/4460 [25:33<8:03:40, 6.91s/it]g-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:17:00,476 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:17:00,476 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 260/4460 [25:40<8:04:33, 6.92s/it]g-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 260/4460 [25:40<8:04:33, 6.92s/it]g-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.1595, 'learning_rate': 2.6000000000000002e-05, 'epoch': 0.29} 6%|████▌ | 260/4460 [25:40<8:04:33, 6.92s/it]g-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 260/4460 [25:40<8:04:33, 6.92s/it]g-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 260/4460 [25:40<8:04:33, 6.92s/it]g-point operations will not be computed-02 22:15:18,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 261/4460 [25:47<8:02:36, 6.90s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:17:12,530 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 261/4460 [25:47<8:02:36, 6.90s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:17:12,530 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 261/4460 [25:47<8:02:36, 6.90s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:17:12,530 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 262/4460 [25:54<8:01:13, 6.88s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:17:12,530 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 262/4460 [25:54<8:01:13, 6.88s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:17:12,530 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:17:21,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:17:12,530 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:17:21,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:17:12,530 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 263/4460 [26:01<8:01:06, 6.88s/it]g-point operations will not be computed-02 22:17:12,530 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 263/4460 [26:01<8:01:06, 6.88s/it]g-point operations will not be computed-02 22:17:12,530 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0958, 'learning_rate': 2.6300000000000002e-05, 'epoch': 0.29} 6%|████▌ | 263/4460 [26:01<8:01:06, 6.88s/it]g-point operations will not be computed-02 22:17:12,530 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 263/4460 [26:01<8:01:06, 6.88s/it]g-point operations will not be computed-02 22:17:12,530 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 263/4460 [26:01<8:01:06, 6.88s/it]g-point operations will not be computed-02 22:17:12,530 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 264/4460 [26:08<7:58:03, 6.84s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:17:32,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 264/4460 [26:08<7:58:03, 6.84s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:17:32,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 264/4460 [26:08<7:58:03, 6.84s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:17:32,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 265/4460 [26:14<7:54:51, 6.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:17:32,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 265/4460 [26:14<7:54:51, 6.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:17:32,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:17:41,303 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:17:32,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:17:41,303 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:17:32,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 266/4460 [26:21<7:53:40, 6.78s/it]g-point operations will not be computed-02 22:17:32,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 266/4460 [26:21<7:53:40, 6.78s/it]g-point operations will not be computed-02 22:17:32,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.1614, 'learning_rate': 2.6600000000000003e-05, 'epoch': 0.3} 6%|████▋ | 266/4460 [26:21<7:53:40, 6.78s/it]g-point operations will not be computed-02 22:17:32,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:17:51,319 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:17:32,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:17:51,319 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:17:32,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.1645, 'learning_rate': 2.6700000000000002e-05, 'epoch': 0.3} [WARNING|modeling_utils.py:388] 2022-03-02 22:17:51,319 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:17:32,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:17:51,319 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:17:32,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 268/4460 [26:35<7:50:53, 6.74s/it]g-point operations will not be computed-02 22:17:32,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 268/4460 [26:35<7:50:53, 6.74s/it]g-point operations will not be computed-02 22:17:32,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:18:01,379 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:17:32,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:18:01,379 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:17:32,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 269/4460 [26:41<7:47:12, 6.69s/it]g-point operations will not be computed-02 22:17:32,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 269/4460 [26:41<7:47:12, 6.69s/it]g-point operations will not be computed-02 22:17:32,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0047, 'learning_rate': 2.6900000000000003e-05, 'epoch': 0.3} [WARNING|modeling_utils.py:388] 2022-03-02 22:18:09,580 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:17:32,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 270/4460 [26:48<7:45:37, 6.67s/it]g-point operations will not be computed-02 22:17:32,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 270/4460 [26:48<7:45:37, 6.67s/it]g-point operations will not be computed-02 22:17:32,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0709, 'learning_rate': 2.7000000000000002e-05, 'epoch': 0.3} [WARNING|modeling_utils.py:388] 2022-03-02 22:18:16,137 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:17:32,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 271/4460 [26:54<7:41:58, 6.62s/it]g-point operations will not be computed-02 22:17:32,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 271/4460 [26:54<7:41:58, 6.62s/it]g-point operations will not be computed-02 22:17:32,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9883, 'learning_rate': 2.7100000000000005e-05, 'epoch': 0.3} 6%|████▋ | 271/4460 [26:54<7:41:58, 6.62s/it]g-point operations will not be computed-02 22:17:32,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 271/4460 [26:54<7:41:58, 6.62s/it]g-point operations will not be computed-02 22:17:32,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 271/4460 [26:54<7:41:58, 6.62s/it]g-point operations will not be computed-02 22:17:32,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 272/4460 [27:01<7:39:23, 6.58s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:18:25,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 272/4460 [27:01<7:39:23, 6.58s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:18:25,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 272/4460 [27:01<7:39:23, 6.58s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:18:25,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 273/4460 [27:07<7:36:21, 6.54s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:18:25,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 273/4460 [27:07<7:36:21, 6.54s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:18:25,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:18:33,940 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:18:25,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:18:33,940 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:18:25,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 274/4460 [27:14<7:34:26, 6.51s/it]g-point operations will not be computed-02 22:18:25,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 274/4460 [27:14<7:34:26, 6.51s/it]g-point operations will not be computed-02 22:18:25,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:18:40,363 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:18:25,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:18:40,363 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:18:25,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 275/4460 [27:21<7:43:04, 6.64s/it]g-point operations will not be computed-02 22:18:25,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 275/4460 [27:21<7:43:04, 6.64s/it]g-point operations will not be computed-02 22:18:25,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.1359, 'learning_rate': 2.7500000000000004e-05, 'epoch': 0.31} 6%|████▊ | 275/4460 [27:21<7:43:04, 6.64s/it]g-point operations will not be computed-02 22:18:25,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:18:50,412 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:18:25,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:18:50,412 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:18:25,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9874, 'learning_rate': 2.7600000000000003e-05, 'epoch': 0.31} [WARNING|modeling_utils.py:388] 2022-03-02 22:18:50,412 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:18:25,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:18:56,663 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:18:25,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:18:56,663 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:18:25,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0387, 'learning_rate': 2.7700000000000002e-05, 'epoch': 0.31} [WARNING|modeling_utils.py:388] 2022-03-02 22:18:56,663 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:18:25,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:19:02,937 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:18:25,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:19:02,937 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:18:25,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9855, 'learning_rate': 2.7800000000000005e-05, 'epoch': 0.31} [WARNING|modeling_utils.py:388] 2022-03-02 22:19:02,937 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:18:25,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:19:02,937 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:18:25,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:19:02,937 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:18:25,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 279/4460 [27:46<7:22:04, 6.34s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:19:10,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 279/4460 [27:46<7:22:04, 6.34s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:19:10,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 279/4460 [27:46<7:22:04, 6.34s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:19:10,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 279/4460 [27:46<7:22:04, 6.34s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:19:10,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 280/4460 [27:52<7:17:01, 6.27s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:19:16,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 280/4460 [27:52<7:17:01, 6.27s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:19:16,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:19:21,308 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:19:16,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:19:21,308 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:19:16,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8919, 'learning_rate': 2.8100000000000005e-05, 'epoch': 0.32} [WARNING|modeling_utils.py:388] 2022-03-02 22:19:21,308 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:19:16,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:19:27,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:19:16,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:19:27,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:19:16,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0904, 'learning_rate': 2.8199999999999998e-05, 'epoch': 0.32} [WARNING|modeling_utils.py:388] 2022-03-02 22:19:27,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:19:16,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:19:33,145 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:19:16,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:19:33,145 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:19:16,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8346, 'learning_rate': 2.83e-05, 'epoch': 0.32} [WARNING|modeling_utils.py:388] 2022-03-02 22:19:37,451 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:19:16,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 284/4460 [28:15<6:54:37, 5.96s/it]g-point operations will not be computed-02 22:19:16,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 284/4460 [28:15<6:54:37, 5.96s/it]g-point operations will not be computed-02 22:19:16,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:19:41,809 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:19:16,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:19:41,809 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:19:16,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 285/4460 [28:21<6:49:43, 5.89s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:19:46,075 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 285/4460 [28:21<6:49:43, 5.89s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:19:46,075 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0456, 'learning_rate': 2.8499999999999998e-05, 'epoch': 0.32} [WARNING|modeling_utils.py:388] 2022-03-02 22:19:50,156 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:19:46,075 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:19:50,156 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:19:46,075 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8849, 'learning_rate': 2.86e-05, 'epoch': 0.32} [WARNING|modeling_utils.py:388] 2022-03-02 22:19:54,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:19:46,075 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|█████ | 287/4460 [28:32<6:36:21, 5.70s/it]g-point operations will not be computed-02 22:19:46,075 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|█████ | 287/4460 [28:32<6:36:21, 5.70s/it]g-point operations will not be computed-02 22:19:46,075 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0776, 'learning_rate': 2.87e-05, 'epoch': 0.32} [WARNING|modeling_utils.py:388] 2022-03-02 22:19:59,657 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:19:46,075 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:19:59,657 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:19:46,075 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|█████ | 288/4460 [28:37<6:27:49, 5.58s/it]g-point operations will not be computed-02 22:19:46,075 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:03,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:19:46,075 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:03,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:19:46,075 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:06,051 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:19:46,075 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:06,051 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:19:46,075 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:09,810 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:19:46,075 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:09,810 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:19:46,075 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████ | 290/4460 [28:47<6:07:08, 5.28s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:14,481 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:14,481 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████ | 291/4460 [28:52<5:53:43, 5.09s/it]g-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:17,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:17,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:20,053 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:22,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:22,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:24,221 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:26,239 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:26,239 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:28,066 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:29,986 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:29,986 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:31,697 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:34,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:34,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:36,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:36,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:37,818 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:37,818 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:40,453 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:42,788 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:42,788 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:43,980 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:43,980 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:45,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:45,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:49,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:53,231 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:53,231 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.1777, 'learning_rate': 3.01e-05, 'epoch': 0.34} [WARNING|modeling_utils.py:388] 2022-03-02 22:20:56,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:56,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:20:56,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:21:00,529 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:21:00,529 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:21:00,529 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:21:00,529 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:21:00,529 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 303/4460 [29:44<6:46:55, 5.87s/it]g-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:21:11,532 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:21:11,532 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 304/4460 [29:52<7:14:42, 6.28s/it]g-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 304/4460 [29:52<7:14:42, 6.28s/it]g-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.1978, 'learning_rate': 3.04e-05, 'epoch': 0.34} 7%|█████▎ | 304/4460 [29:52<7:14:42, 6.28s/it]g-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 304/4460 [29:52<7:14:42, 6.28s/it]g-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 304/4460 [29:52<7:14:42, 6.28s/it]g-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 305/4460 [29:59<7:32:12, 6.53s/it]g-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:21:25,816 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:21:25,816 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 306/4460 [30:06<7:43:48, 6.70s/it]g-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 306/4460 [30:06<7:43:48, 6.70s/it]g-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0552, 'learning_rate': 3.06e-05, 'epoch': 0.34} 7%|█████▎ | 306/4460 [30:06<7:43:48, 6.70s/it]g-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 306/4460 [30:06<7:43:48, 6.70s/it]g-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 306/4460 [30:06<7:43:48, 6.70s/it]g-point operations will not be computed-02 22:20:12,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 307/4460 [30:13<7:49:58, 6.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:21:38,164 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 307/4460 [30:13<7:49:58, 6.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:21:38,164 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 307/4460 [30:13<7:49:58, 6.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:21:38,164 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 307/4460 [30:13<7:49:58, 6.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:21:38,164 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 308/4460 [30:20<7:56:00, 6.88s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:21:38,164 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 308/4460 [30:20<7:56:00, 6.88s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:21:38,164 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 308/4460 [30:20<7:56:00, 6.88s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:21:38,164 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 308/4460 [30:20<7:56:00, 6.88s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:21:38,164 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 308/4460 [30:20<7:56:00, 6.88s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:21:38,164 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 309/4460 [30:27<7:59:05, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:21:52,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 309/4460 [30:27<7:59:05, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:21:52,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 309/4460 [30:27<7:59:05, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:21:52,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 310/4460 [30:34<8:00:46, 6.95s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:21:52,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 310/4460 [30:34<8:00:46, 6.95s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:21:52,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0675, 'learning_rate': 3.1e-05, 'epoch': 0.35} [WARNING|modeling_utils.py:388] 2022-03-02 22:22:02,621 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:21:52,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 311/4460 [30:41<7:58:38, 6.92s/it]g-point operations will not be computed-02 22:21:52,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 311/4460 [30:41<7:58:38, 6.92s/it]g-point operations will not be computed-02 22:21:52,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.1561, 'learning_rate': 3.1100000000000004e-05, 'epoch': 0.35} 7%|█████▍ | 311/4460 [30:41<7:58:38, 6.92s/it]g-point operations will not be computed-02 22:21:52,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 311/4460 [30:41<7:58:38, 6.92s/it]g-point operations will not be computed-02 22:21:52,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 311/4460 [30:41<7:58:38, 6.92s/it]g-point operations will not be computed-02 22:21:52,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 312/4460 [30:48<7:58:21, 6.92s/it]g-point operations will not be computed-02 22:21:52,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:22:14,878 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:21:52,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:22:14,878 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:21:52,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 313/4460 [30:55<8:01:39, 6.97s/it]g-point operations will not be computed-02 22:21:52,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 313/4460 [30:55<8:01:39, 6.97s/it]g-point operations will not be computed-02 22:21:52,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0917, 'learning_rate': 3.13e-05, 'epoch': 0.35} 7%|█████▍ | 313/4460 [30:55<8:01:39, 6.97s/it]g-point operations will not be computed-02 22:21:52,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:22:25,097 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:21:52,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:22:25,097 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:21:52,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0052, 'learning_rate': 3.1400000000000004e-05, 'epoch': 0.35} [WARNING|modeling_utils.py:388] 2022-03-02 22:22:25,097 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:21:52,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:22:25,097 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:21:52,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:22:25,097 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:21:52,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 315/4460 [31:08<7:53:23, 6.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 315/4460 [31:08<7:53:23, 6.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 315/4460 [31:08<7:53:23, 6.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 316/4460 [31:15<7:50:25, 6.81s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 316/4460 [31:15<7:50:25, 6.81s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:22:41,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:22:41,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 317/4460 [31:22<7:47:14, 6.77s/it]g-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 317/4460 [31:22<7:47:14, 6.77s/it]g-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9869, 'learning_rate': 3.1700000000000005e-05, 'epoch': 0.36} 7%|█████▌ | 317/4460 [31:22<7:47:14, 6.77s/it]g-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:22:51,798 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:22:51,798 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9017, 'learning_rate': 3.18e-05, 'epoch': 0.36} [WARNING|modeling_utils.py:388] 2022-03-02 22:22:51,798 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:22:58,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:22:58,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:22:58,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8957, 'learning_rate': 3.19e-05, 'epoch': 0.36} [WARNING|modeling_utils.py:388] 2022-03-02 22:22:58,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 320/4460 [31:42<7:39:31, 6.66s/it]g-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 320/4460 [31:42<7:39:31, 6.66s/it]g-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0838, 'learning_rate': 3.2000000000000005e-05, 'epoch': 0.36} [WARNING|modeling_utils.py:388] 2022-03-02 22:23:09,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 321/4460 [31:48<7:37:29, 6.63s/it]g-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 321/4460 [31:48<7:37:29, 6.63s/it]g-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.006, 'learning_rate': 3.21e-05, 'epoch': 0.36} 7%|█████▌ | 321/4460 [31:48<7:37:29, 6.63s/it]g-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:23:18,108 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:23:18,108 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9025, 'learning_rate': 3.2200000000000003e-05, 'epoch': 0.36} [WARNING|modeling_utils.py:388] 2022-03-02 22:23:18,108 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:23:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:23:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9927, 'learning_rate': 3.2300000000000006e-05, 'epoch': 0.36} [WARNING|modeling_utils.py:388] 2022-03-02 22:23:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:23:31,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:23:31,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0682, 'learning_rate': 3.24e-05, 'epoch': 0.36} [WARNING|modeling_utils.py:388] 2022-03-02 22:23:31,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:23:31,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:23:31,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 325/4460 [32:15<7:38:19, 6.65s/it]g-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:23:41,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:23:41,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:23:41,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 326/4460 [32:21<7:33:12, 6.58s/it]g-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 326/4460 [32:21<7:33:12, 6.58s/it]g-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:23:49,141 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:23:49,141 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 327/4460 [32:27<7:27:26, 6.50s/it]g-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 327/4460 [32:27<7:27:26, 6.50s/it]g-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:23:55,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 328/4460 [32:33<7:21:52, 6.42s/it]g-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 328/4460 [32:33<7:21:52, 6.42s/it]g-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0354, 'learning_rate': 3.2800000000000004e-05, 'epoch': 0.37} [WARNING|modeling_utils.py:388] 2022-03-02 22:24:01,591 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:24:01,591 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▊ | 329/4460 [32:40<7:16:21, 6.34s/it]g-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▊ | 329/4460 [32:40<7:16:21, 6.34s/it]g-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:24:07,707 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:24:07,707 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▊ | 330/4460 [32:46<7:11:57, 6.28s/it]g-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▊ | 330/4460 [32:46<7:11:57, 6.28s/it]g-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:24:13,783 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:24:13,783 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▊ | 331/4460 [32:52<7:07:17, 6.21s/it]g-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▊ | 331/4460 [32:52<7:07:17, 6.21s/it]g-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:24:19,796 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:24:19,796 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▊ | 332/4460 [32:58<7:02:46, 6.14s/it]g-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▊ | 332/4460 [32:58<7:02:46, 6.14s/it]g-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:24:25,738 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:24:25,738 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▊ | 333/4460 [33:04<6:57:46, 6.07s/it]g-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:24:30,128 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:24:30,128 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:24:30,128 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:22:33,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▊ | 334/4460 [33:09<6:50:47, 5.97s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:24:34,428 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▊ | 334/4460 [33:09<6:50:47, 5.97s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:24:34,428 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:24:38,593 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:24:34,428 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:24:38,593 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:24:34,428 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0789, 'learning_rate': 3.35e-05, 'epoch': 0.38} [WARNING|modeling_utils.py:388] 2022-03-02 22:24:38,593 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:24:34,428 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:24:44,164 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:24:34,428 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:24:44,164 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:24:34,428 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8872, 'learning_rate': 3.3600000000000004e-05, 'epoch': 0.38} [WARNING|modeling_utils.py:388] 2022-03-02 22:24:48,917 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:24:34,428 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|█████▉ | 337/4460 [33:27<6:52:13, 6.00s/it]g-point operations will not be computed-02 22:24:34,428 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|█████▉ | 337/4460 [33:27<6:52:13, 6.00s/it]g-point operations will not be computed-02 22:24:34,428 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.876, 'learning_rate': 3.3700000000000006e-05, 'epoch': 0.38} [WARNING|modeling_utils.py:388] 2022-03-02 22:24:54,871 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:24:34,428 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:24:54,871 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:24:34,428 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|█████▉ | 338/4460 [33:33<6:42:45, 5.86s/it]g-point operations will not be computed-02 22:24:34,428 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:24:59,033 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:24:34,428 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:24:59,033 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:24:34,428 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:24:59,033 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:24:34,428 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|█████▉ | 339/4460 [33:38<6:33:50, 5.73s/it]g-point operations will not be computed-02 22:24:34,428 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:25:05,164 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:24:34,428 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:25:05,164 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:24:34,428 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:25:05,164 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:24:34,428 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|█████▉ | 340/4460 [33:44<6:38:33, 5.80s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:25:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:25:11,229 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:25:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:25:11,229 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:25:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|█████▉ | 341/4460 [33:49<6:17:25, 5.50s/it]g-point operations will not be computed-02 22:25:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:25:14,815 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:25:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:25:17,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:25:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:25:17,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:25:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:25:19,295 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:25:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:25:19,295 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:25:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:25:19,295 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:25:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|█████▉ | 343/4460 [33:58<5:41:49, 4.98s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:25:22,522 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 344/4460 [34:02<5:21:39, 4.69s/it]g-point operations will not be computed-02 22:25:22,522 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 344/4460 [34:02<5:21:39, 4.69s/it]g-point operations will not be computed-02 22:25:22,522 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 344/4460 [34:02<5:21:39, 4.69s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:25:26,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 345/4460 [34:06<4:58:58, 4.36s/it]g-point operations will not be computed-02 22:25:26,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 345/4460 [34:06<4:58:58, 4.36s/it]g-point operations will not be computed-02 22:25:26,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 345/4460 [34:06<4:58:58, 4.36s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:25:29,920 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 345/4460 [34:06<4:58:58, 4.36s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:25:29,920 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 346/4460 [34:09<4:37:31, 4.05s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:25:33,218 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:25:34,687 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:25:33,218 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:25:34,687 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:25:33,218 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 347/4460 [34:12<4:17:11, 3.75s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:25:36,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 347/4460 [34:12<4:17:11, 3.75s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:25:36,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 348/4460 [34:15<3:55:36, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:25:38,791 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 349/4460 [34:17<3:35:01, 3.14s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:25:41,184 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 349/4460 [34:17<3:35:01, 3.14s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:25:41,184 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 350/4460 [34:20<3:27:20, 3.03s/it]g-point operations will not be computed-02 22:25:41,184 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 350/4460 [34:20<3:27:20, 3.03s/it]g-point operations will not be computed-02 22:25:41,184 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 350/4460 [34:20<3:27:20, 3.03s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:25:45,427 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 350/4460 [34:20<3:27:20, 3.03s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:25:45,427 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:25:49,135 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:25:45,427 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▏ | 351/4460 [34:28<5:03:21, 4.43s/it]g-point operations will not be computed-02 22:25:45,427 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▏ | 351/4460 [34:28<5:03:21, 4.43s/it]g-point operations will not be computed-02 22:25:45,427 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▏ | 351/4460 [34:28<5:03:21, 4.43s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▏ | 351/4460 [34:28<5:03:21, 4.43s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:25:56,663 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:25:56,663 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▏ | 352/4460 [34:35<6:05:32, 5.34s/it]g-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:26:02,183 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:26:02,183 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▏ | 353/4460 [34:42<6:46:19, 5.94s/it]g-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▏ | 353/4460 [34:42<6:46:19, 5.94s/it]g-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9757, 'learning_rate': 3.53e-05, 'epoch': 0.4} 8%|██████▏ | 353/4460 [34:42<6:46:19, 5.94s/it]g-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▏ | 353/4460 [34:42<6:46:19, 5.94s/it]g-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▏ | 353/4460 [34:42<6:46:19, 5.94s/it]g-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▏ | 354/4460 [34:50<7:13:22, 6.33s/it]g-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▏ | 354/4460 [34:50<7:13:22, 6.33s/it]g-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▏ | 354/4460 [34:50<7:13:22, 6.33s/it]g-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:26:20,198 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:26:20,198 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0725, 'learning_rate': 3.55e-05, 'epoch': 0.4} [WARNING|modeling_utils.py:388] 2022-03-02 22:26:20,198 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:26:20,198 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▏ | 356/4460 [35:04<7:42:55, 6.77s/it]g-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▏ | 356/4460 [35:04<7:42:55, 6.77s/it]g-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9315, 'learning_rate': 3.56e-05, 'epoch': 0.4} 8%|██████▏ | 356/4460 [35:04<7:42:55, 6.77s/it]g-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:26:34,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:26:34,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.1701, 'learning_rate': 3.57e-05, 'epoch': 0.4} [WARNING|modeling_utils.py:388] 2022-03-02 22:26:34,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:26:34,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 358/4460 [35:18<7:55:32, 6.96s/it]g-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 358/4460 [35:18<7:55:32, 6.96s/it]g-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9165, 'learning_rate': 3.58e-05, 'epoch': 0.4} [WARNING|modeling_utils.py:388] 2022-03-02 22:26:47,032 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 359/4460 [35:25<7:58:18, 7.00s/it]g-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 359/4460 [35:25<7:58:18, 7.00s/it]g-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0283, 'learning_rate': 3.59e-05, 'epoch': 0.4} 8%|██████▎ | 359/4460 [35:25<7:58:18, 7.00s/it]g-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 359/4460 [35:25<7:58:18, 7.00s/it]g-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 360/4460 [35:32<7:58:20, 7.00s/it]g-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 360/4460 [35:32<7:58:20, 7.00s/it]g-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:26:59,296 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:26:59,296 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 361/4460 [35:39<7:56:21, 6.97s/it]g-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 361/4460 [35:39<7:56:21, 6.97s/it]g-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9093, 'learning_rate': 3.61e-05, 'epoch': 0.4} 8%|██████▎ | 361/4460 [35:39<7:56:21, 6.97s/it]g-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 361/4460 [35:39<7:56:21, 6.97s/it]g-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 361/4460 [35:39<7:56:21, 6.97s/it]g-point operations will not be computed-02 22:25:53,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 362/4460 [35:46<7:56:48, 6.98s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 362/4460 [35:46<7:56:48, 6.98s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 362/4460 [35:46<7:56:48, 6.98s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 363/4460 [35:53<7:53:37, 6.94s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 363/4460 [35:53<7:53:37, 6.94s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9259, 'learning_rate': 3.63e-05, 'epoch': 0.41} 8%|██████▎ | 363/4460 [35:53<7:53:37, 6.94s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:27:23,432 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:27:23,432 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9824, 'learning_rate': 3.6400000000000004e-05, 'epoch': 0.41} [WARNING|modeling_utils.py:388] 2022-03-02 22:27:23,432 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:27:23,432 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:27:23,432 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▍ | 365/4460 [36:07<7:49:59, 6.89s/it]g-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:27:33,660 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:27:33,660 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▍ | 366/4460 [36:14<7:47:51, 6.86s/it]g-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▍ | 366/4460 [36:14<7:47:51, 6.86s/it]g-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8664, 'learning_rate': 3.66e-05, 'epoch': 0.41} [WARNING|modeling_utils.py:388] 2022-03-02 22:27:42,053 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▍ | 367/4460 [36:20<7:44:32, 6.81s/it]g-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▍ | 367/4460 [36:20<7:44:32, 6.81s/it]g-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9216, 'learning_rate': 3.6700000000000004e-05, 'epoch': 0.41} 8%|██████▍ | 367/4460 [36:20<7:44:32, 6.81s/it]g-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:27:50,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:27:50,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9015, 'learning_rate': 3.68e-05, 'epoch': 0.41} [WARNING|modeling_utils.py:388] 2022-03-02 22:27:50,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:27:50,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▍ | 369/4460 [36:34<7:40:50, 6.76s/it]g-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▍ | 369/4460 [36:34<7:40:50, 6.76s/it]g-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:28:00,482 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:28:00,482 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▍ | 370/4460 [36:40<7:38:40, 6.73s/it]g-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▍ | 370/4460 [36:40<7:38:40, 6.73s/it]g-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0073, 'learning_rate': 3.7e-05, 'epoch': 0.41} [WARNING|modeling_utils.py:388] 2022-03-02 22:28:08,761 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:28:08,761 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▍ | 371/4460 [36:47<7:36:17, 6.70s/it]g-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▍ | 371/4460 [36:47<7:36:17, 6.70s/it]g-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▍ | 371/4460 [36:47<7:36:17, 6.70s/it]g-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▍ | 371/4460 [36:47<7:36:17, 6.70s/it]g-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:28:16,977 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:28:16,977 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:28:16,977 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:28:23,483 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:28:23,483 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9379, 'learning_rate': 3.73e-05, 'epoch': 0.42} [WARNING|modeling_utils.py:388] 2022-03-02 22:28:23,483 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:28:23,483 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:28:23,483 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:27:11,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▌ | 374/4460 [37:06<7:26:19, 6.55s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▌ | 374/4460 [37:06<7:26:19, 6.55s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▌ | 374/4460 [37:06<7:26:19, 6.55s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▌ | 374/4460 [37:06<7:26:19, 6.55s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▌ | 375/4460 [37:13<7:32:28, 6.65s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▌ | 375/4460 [37:13<7:32:28, 6.65s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:28:41,688 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:28:41,688 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▌ | 376/4460 [37:20<7:28:38, 6.59s/it]g-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▌ | 376/4460 [37:20<7:28:38, 6.59s/it]g-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:28:47,985 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:28:47,985 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▌ | 377/4460 [37:26<7:23:04, 6.51s/it]g-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▌ | 377/4460 [37:26<7:23:04, 6.51s/it]g-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:28:54,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:28:54,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▌ | 378/4460 [37:32<7:16:34, 6.42s/it]g-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▌ | 378/4460 [37:32<7:16:34, 6.42s/it]g-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:29:00,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:29:00,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▋ | 379/4460 [37:38<7:11:24, 6.34s/it]g-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▋ | 379/4460 [37:38<7:11:24, 6.34s/it]g-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:29:06,489 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:29:06,489 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▋ | 380/4460 [37:45<7:06:07, 6.27s/it]g-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▋ | 380/4460 [37:45<7:06:07, 6.27s/it]g-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:29:12,584 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:29:12,584 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▋ | 381/4460 [37:51<7:02:06, 6.21s/it]g-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▋ | 381/4460 [37:51<7:02:06, 6.21s/it]g-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:29:18,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:29:18,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▋ | 382/4460 [37:57<6:57:08, 6.14s/it]g-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▋ | 382/4460 [37:57<6:57:08, 6.14s/it]g-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:29:24,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:29:24,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▋ | 383/4460 [38:03<6:53:11, 6.08s/it]g-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▋ | 383/4460 [38:03<6:53:11, 6.08s/it]g-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:29:30,343 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:29:30,343 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:28:31,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▋ | 384/4460 [38:08<6:46:25, 5.98s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:29:33,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▋ | 384/4460 [38:08<6:46:25, 5.98s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:29:33,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:29:37,449 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:29:33,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:29:37,449 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:29:33,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8673, 'learning_rate': 3.85e-05, 'epoch': 0.43} [WARNING|modeling_utils.py:388] 2022-03-02 22:29:41,701 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:29:33,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▊ | 386/4460 [38:20<6:34:38, 5.81s/it]g-point operations will not be computed-02 22:29:33,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▊ | 386/4460 [38:20<6:34:38, 5.81s/it]g-point operations will not be computed-02 22:29:33,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9269, 'learning_rate': 3.86e-05, 'epoch': 0.43} [WARNING|modeling_utils.py:388] 2022-03-02 22:29:47,227 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:29:33,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:29:47,227 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:29:33,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▊ | 387/4460 [38:25<6:29:27, 5.74s/it]g-point operations will not be computed-02 22:29:33,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:29:51,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:29:33,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:29:51,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:29:33,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:29:51,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:29:33,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▊ | 388/4460 [38:30<6:20:56, 5.61s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:29:55,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▊ | 388/4460 [38:30<6:20:56, 5.61s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:29:55,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:29:59,165 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:29:55,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:29:59,165 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:29:55,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:30:01,773 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:29:55,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:30:01,773 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:29:55,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:30:01,773 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:29:55,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▊ | 390/4460 [38:41<6:04:01, 5.37s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:30:05,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▊ | 390/4460 [38:41<6:04:01, 5.37s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:30:05,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:30:09,104 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:30:05,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:30:09,104 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:30:05,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:30:11,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:30:05,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:30:13,767 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:30:05,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:30:13,767 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:30:05,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:30:15,984 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:30:05,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:30:18,120 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:30:05,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:30:18,120 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:30:05,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8662, 'learning_rate': 3.9300000000000007e-05, 'epoch': 0.44} [WARNING|modeling_utils.py:388] 2022-03-02 22:30:21,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:30:05,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:30:21,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:30:05,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 394/4460 [38:59<5:14:39, 4.64s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:30:23,369 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 395/4460 [39:03<4:57:19, 4.39s/it]g-point operations will not be computed-02 22:30:23,369 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 395/4460 [39:03<4:57:19, 4.39s/it]g-point operations will not be computed-02 22:30:23,369 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 395/4460 [39:03<4:57:19, 4.39s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:30:27,091 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 396/4460 [39:06<4:39:25, 4.13s/it]g-point operations will not be computed-02 22:30:27,091 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 396/4460 [39:06<4:39:25, 4.13s/it]g-point operations will not be computed-02 22:30:27,091 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:30:31,979 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:30:30,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:30:31,979 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:30:30,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 397/4460 [39:09<4:18:05, 3.81s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:30:33,462 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 397/4460 [39:09<4:18:05, 3.81s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:30:33,462 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 398/4460 [39:12<3:54:18, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:30:36,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 399/4460 [39:14<3:32:53, 3.15s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:30:38,394 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 399/4460 [39:14<3:32:53, 3.15s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:30:38,394 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:30:39,471 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:30:38,394 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:30:39,471 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:30:38,394 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 400/4460 [39:17<3:25:38, 3.04s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:30:42,738 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 400/4460 [39:17<3:25:38, 3.04s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:30:42,738 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:30:46,485 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:30:42,738 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 401/4460 [39:25<5:02:35, 4.47s/it]g-point operations will not be computed-02 22:30:42,738 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 401/4460 [39:25<5:02:35, 4.47s/it]g-point operations will not be computed-02 22:30:42,738 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 401/4460 [39:25<5:02:35, 4.47s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:30:50,285 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 401/4460 [39:25<5:02:35, 4.47s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:30:50,285 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:30:53,875 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:30:50,285 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 402/4460 [39:32<6:00:22, 5.33s/it]g-point operations will not be computed-02 22:30:50,285 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 402/4460 [39:32<6:00:22, 5.33s/it]g-point operations will not be computed-02 22:30:50,285 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.2003, 'learning_rate': 4.02e-05, 'epoch': 0.45} 9%|███████ | 402/4460 [39:32<6:00:22, 5.33s/it]g-point operations will not be computed-02 22:30:50,285 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 402/4460 [39:32<6:00:22, 5.33s/it]g-point operations will not be computed-02 22:30:50,285 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 402/4460 [39:32<6:00:22, 5.33s/it]g-point operations will not be computed-02 22:30:50,285 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 403/4460 [39:40<6:41:30, 5.94s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 403/4460 [39:40<6:41:30, 5.94s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 403/4460 [39:40<6:41:30, 5.94s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 404/4460 [39:47<7:07:00, 6.32s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 404/4460 [39:47<7:07:00, 6.32s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0276, 'learning_rate': 4.0400000000000006e-05, 'epoch': 0.45} 9%|███████ | 404/4460 [39:47<7:07:00, 6.32s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:31:17,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:31:17,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.1858, 'learning_rate': 4.05e-05, 'epoch': 0.45} [WARNING|modeling_utils.py:388] 2022-03-02 22:31:17,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:31:17,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 406/4460 [40:01<7:35:57, 6.75s/it]g-point operations will not be computed-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 406/4460 [40:01<7:35:57, 6.75s/it]g-point operations will not be computed-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9825, 'learning_rate': 4.0600000000000004e-05, 'epoch': 0.46} 9%|███████ | 406/4460 [40:01<7:35:57, 6.75s/it]g-point operations will not be computed-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:31:31,616 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:31:31,616 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9807, 'learning_rate': 4.07e-05, 'epoch': 0.46} [WARNING|modeling_utils.py:388] 2022-03-02 22:31:31,616 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:31:31,616 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 408/4460 [40:15<7:49:14, 6.95s/it]g-point operations will not be computed-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 408/4460 [40:15<7:49:14, 6.95s/it]g-point operations will not be computed-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8985, 'learning_rate': 4.08e-05, 'epoch': 0.46} 9%|███████▏ | 408/4460 [40:15<7:49:14, 6.95s/it]g-point operations will not be computed-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:31:45,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:31:45,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.1197, 'learning_rate': 4.09e-05, 'epoch': 0.46} [WARNING|modeling_utils.py:388] 2022-03-02 22:31:45,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:31:45,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:31:45,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 410/4460 [40:29<7:51:31, 6.99s/it]g-point operations will not be computed-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:31:56,356 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:31:59,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:31:59,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9675, 'learning_rate': 4.11e-05, 'epoch': 0.46} [WARNING|modeling_utils.py:388] 2022-03-02 22:31:59,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:31:59,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:31:59,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:31:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 412/4460 [40:43<7:46:18, 6.91s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:32:08,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 412/4460 [40:43<7:46:18, 6.91s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:32:08,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 412/4460 [40:43<7:46:18, 6.91s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:32:08,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 412/4460 [40:43<7:46:18, 6.91s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:32:08,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 413/4460 [40:50<7:45:33, 6.90s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:32:08,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 413/4460 [40:50<7:45:33, 6.90s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:32:08,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 413/4460 [40:50<7:45:33, 6.90s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:32:08,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 413/4460 [40:50<7:45:33, 6.90s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:32:08,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:32:20,255 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:08,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:32:20,255 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:08,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:32:20,255 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:08,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:32:20,255 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:08,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:32:20,255 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:08,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 415/4460 [41:04<7:41:03, 6.84s/it]g-point operations will not be computed-02 22:32:08,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:32:30,451 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:08,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:32:30,451 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:08,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:32:30,451 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:08,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 416/4460 [41:10<7:39:07, 6.81s/it]g-point operations will not be computed-02 22:32:08,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 416/4460 [41:10<7:39:07, 6.81s/it]g-point operations will not be computed-02 22:32:08,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:32:38,858 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:08,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:32:38,858 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:08,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 417/4460 [41:17<7:37:54, 6.80s/it]g-point operations will not be computed-02 22:32:08,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 417/4460 [41:17<7:37:54, 6.80s/it]g-point operations will not be computed-02 22:32:08,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 417/4460 [41:17<7:37:54, 6.80s/it]g-point operations will not be computed-02 22:32:08,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 417/4460 [41:17<7:37:54, 6.80s/it]g-point operations will not be computed-02 22:32:08,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 417/4460 [41:17<7:37:54, 6.80s/it]g-point operations will not be computed-02 22:32:08,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 418/4460 [41:24<7:37:06, 6.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 418/4460 [41:24<7:37:06, 6.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 418/4460 [41:24<7:37:06, 6.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 418/4460 [41:24<7:37:06, 6.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 419/4460 [41:30<7:34:56, 6.75s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:32:57,368 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:32:57,368 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:32:57,368 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 420/4460 [41:37<7:33:01, 6.73s/it]g-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 420/4460 [41:37<7:33:01, 6.73s/it]g-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:33:05,574 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:33:05,574 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 421/4460 [41:44<7:29:16, 6.67s/it]g-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 421/4460 [41:44<7:29:16, 6.67s/it]g-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 421/4460 [41:44<7:29:16, 6.67s/it]g-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:33:13,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:33:13,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0222, 'learning_rate': 4.22e-05, 'epoch': 0.47} [WARNING|modeling_utils.py:388] 2022-03-02 22:33:13,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:33:20,305 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:33:20,305 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9353, 'learning_rate': 4.23e-05, 'epoch': 0.47} [WARNING|modeling_utils.py:388] 2022-03-02 22:33:20,305 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:33:20,305 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:33:20,305 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▍ | 424/4460 [42:03<7:23:41, 6.60s/it]g-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▍ | 424/4460 [42:03<7:23:41, 6.60s/it]g-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:33:31,694 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▍ | 425/4460 [42:10<7:30:33, 6.70s/it]g-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▍ | 425/4460 [42:10<7:30:33, 6.70s/it]g-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9778, 'learning_rate': 4.25e-05, 'epoch': 0.48} 10%|███████▍ | 425/4460 [42:10<7:30:33, 6.70s/it]g-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:33:40,284 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:33:40,284 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0824, 'learning_rate': 4.26e-05, 'epoch': 0.48} [WARNING|modeling_utils.py:388] 2022-03-02 22:33:40,284 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:33:46,647 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:33:46,647 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8798, 'learning_rate': 4.27e-05, 'epoch': 0.48} [WARNING|modeling_utils.py:388] 2022-03-02 22:33:46,647 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:33:46,647 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:33:46,647 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:32:49,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▍ | 428/4460 [42:29<7:16:13, 6.49s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:33:54,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▍ | 428/4460 [42:29<7:16:13, 6.49s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:33:54,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▍ | 428/4460 [42:29<7:16:13, 6.49s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:33:54,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▍ | 428/4460 [42:29<7:16:13, 6.49s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:33:54,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▌ | 429/4460 [42:36<7:11:05, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:34:00,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▌ | 429/4460 [42:36<7:11:05, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:34:00,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▌ | 429/4460 [42:36<7:11:05, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:34:00,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▌ | 429/4460 [42:36<7:11:05, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:34:00,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▌ | 430/4460 [42:42<7:07:07, 6.36s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:34:07,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▌ | 430/4460 [42:42<7:07:07, 6.36s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:34:07,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▌ | 430/4460 [42:42<7:07:07, 6.36s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:34:07,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▌ | 430/4460 [42:42<7:07:07, 6.36s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:34:07,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▌ | 431/4460 [42:48<7:03:06, 6.30s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:34:13,210 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▌ | 431/4460 [42:48<7:03:06, 6.30s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:34:13,210 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▌ | 431/4460 [42:48<7:03:06, 6.30s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:34:13,210 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▌ | 431/4460 [42:48<7:03:06, 6.30s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:34:13,210 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▌ | 432/4460 [42:54<6:58:47, 6.24s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:34:13,210 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:34:20,829 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:34:13,210 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:34:20,829 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:34:13,210 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:34:20,829 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:34:13,210 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▌ | 433/4460 [43:00<6:55:25, 6.19s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:34:25,383 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▌ | 433/4460 [43:00<6:55:25, 6.19s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:34:25,383 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▌ | 433/4460 [43:00<6:55:25, 6.19s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:34:25,383 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▌ | 433/4460 [43:00<6:55:25, 6.19s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:34:25,383 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▌ | 434/4460 [43:06<6:51:43, 6.14s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:34:31,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▌ | 434/4460 [43:06<6:51:43, 6.14s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:34:31,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:34:35,559 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:34:31,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:34:35,559 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:34:31,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8328, 'learning_rate': 4.35e-05, 'epoch': 0.49} [WARNING|modeling_utils.py:388] 2022-03-02 22:34:35,559 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:34:31,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:34:41,330 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:34:31,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:34:41,330 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:34:31,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0409, 'learning_rate': 4.36e-05, 'epoch': 0.49} [WARNING|modeling_utils.py:388] 2022-03-02 22:34:45,622 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:34:31,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▋ | 437/4460 [43:23<6:32:18, 5.85s/it]g-point operations will not be computed-02 22:34:31,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▋ | 437/4460 [43:23<6:32:18, 5.85s/it]g-point operations will not be computed-02 22:34:31,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:34:49,745 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:34:31,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:34:49,745 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:34:31,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:34:49,745 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:34:31,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▋ | 438/4460 [43:29<6:23:09, 5.72s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:34:53,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▋ | 438/4460 [43:29<6:23:09, 5.72s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:34:53,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:34:57,654 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:34:53,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:34:57,654 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:34:53,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:35:00,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:34:53,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:35:00,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:34:53,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▋ | 440/4460 [43:39<6:03:46, 5.43s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:35:04,027 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▋ | 440/4460 [43:39<6:03:46, 5.43s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:35:04,027 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9306, 'learning_rate': 4.4000000000000006e-05, 'epoch': 0.49} [WARNING|modeling_utils.py:388] 2022-03-02 22:35:07,666 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:04,027 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:35:07,666 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:04,027 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:35:10,063 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:04,027 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:35:12,292 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:04,027 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:35:12,292 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:04,027 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8843, 'learning_rate': 4.4200000000000004e-05, 'epoch': 0.5} [WARNING|modeling_utils.py:388] 2022-03-02 22:35:15,577 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:04,027 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:35:15,577 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:04,027 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▋ | 443/4460 [43:53<5:25:03, 4.86s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:35:17,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:35:19,757 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:17,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:35:19,757 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:17,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▊ | 444/4460 [43:57<5:10:20, 4.64s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:35:21,825 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:35:23,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:21,825 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:35:23,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:21,825 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▊ | 445/4460 [44:01<4:54:00, 4.39s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:35:25,529 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▊ | 446/4460 [44:05<4:35:46, 4.12s/it]g-point operations will not be computed-02 22:35:25,529 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▊ | 446/4460 [44:05<4:35:46, 4.12s/it]g-point operations will not be computed-02 22:35:25,529 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:35:30,396 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:28,929 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▊ | 447/4460 [44:08<4:14:05, 3.80s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:35:31,890 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▊ | 447/4460 [44:08<4:14:05, 3.80s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:35:31,890 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▊ | 448/4460 [44:10<3:52:36, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:35:34,539 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▊ | 448/4460 [44:10<3:52:36, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:35:34,539 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▊ | 449/4460 [44:13<3:30:14, 3.14s/it]g-point operations will not be computed-02 22:35:34,539 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▊ | 449/4460 [44:13<3:30:14, 3.14s/it]g-point operations will not be computed-02 22:35:34,539 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:35:37,823 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:36,829 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:35:37,823 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:36,829 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▊ | 450/4460 [44:15<3:20:44, 3.00s/it]g-point operations will not be computed-02 22:35:36,829 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▊ | 450/4460 [44:15<3:20:44, 3.00s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:35:40,946 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:35:44,616 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:40,946 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:35:44,616 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:40,946 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 451/4460 [44:23<4:52:41, 4.38s/it]g-point operations will not be computed-02 22:35:40,946 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 451/4460 [44:23<4:52:41, 4.38s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 451/4460 [44:23<4:52:41, 4.38s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:35:52,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:35:52,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 452/4460 [44:30<5:51:20, 5.26s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 452/4460 [44:30<5:51:20, 5.26s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:35:59,210 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:35:59,210 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 453/4460 [44:37<6:30:39, 5.85s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 453/4460 [44:37<6:30:39, 5.85s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 453/4460 [44:37<6:30:39, 5.85s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 453/4460 [44:37<6:30:39, 5.85s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 453/4460 [44:37<6:30:39, 5.85s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 454/4460 [44:45<6:54:38, 6.21s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:36:11,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:36:11,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:36:11,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 455/4460 [44:52<7:11:05, 6.46s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 455/4460 [44:52<7:11:05, 6.46s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 455/4460 [44:52<7:11:05, 6.46s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:36:22,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:36:22,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0871, 'learning_rate': 4.5600000000000004e-05, 'epoch': 0.51} [WARNING|modeling_utils.py:388] 2022-03-02 22:36:22,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:36:22,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:36:22,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 457/4460 [45:06<7:29:03, 6.73s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 457/4460 [45:06<7:29:03, 6.73s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 457/4460 [45:06<7:29:03, 6.73s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:36:36,093 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:36:36,093 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9567, 'learning_rate': 4.58e-05, 'epoch': 0.51} [WARNING|modeling_utils.py:388] 2022-03-02 22:36:36,093 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:36:36,093 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:36:36,093 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████ | 459/4460 [45:20<7:38:11, 6.87s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████ | 459/4460 [45:20<7:38:11, 6.87s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:36:48,309 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:36:48,309 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████ | 460/4460 [45:26<7:37:46, 6.87s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████ | 460/4460 [45:26<7:37:46, 6.87s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████ | 460/4460 [45:26<7:37:46, 6.87s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:36:56,730 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:36:56,730 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.852, 'learning_rate': 4.61e-05, 'epoch': 0.52} [WARNING|modeling_utils.py:388] 2022-03-02 22:36:56,730 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:36:56,730 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:36:56,730 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████ | 462/4460 [45:40<7:34:36, 6.82s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████ | 462/4460 [45:40<7:34:36, 6.82s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:37:08,603 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████ | 463/4460 [45:47<7:32:30, 6.79s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████ | 463/4460 [45:47<7:32:30, 6.79s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8578, 'learning_rate': 4.630000000000001e-05, 'epoch': 0.52} 10%|████████ | 463/4460 [45:47<7:32:30, 6.79s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:37:17,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:37:17,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.836, 'learning_rate': 4.64e-05, 'epoch': 0.52} [WARNING|modeling_utils.py:388] 2022-03-02 22:37:17,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:37:17,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:37:17,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▏ | 465/4460 [46:00<7:30:46, 6.77s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▏ | 465/4460 [46:00<7:30:46, 6.77s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:37:28,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▏ | 466/4460 [46:07<7:30:16, 6.76s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▏ | 466/4460 [46:07<7:30:16, 6.76s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9268, 'learning_rate': 4.660000000000001e-05, 'epoch': 0.52} 10%|████████▏ | 466/4460 [46:07<7:30:16, 6.76s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:37:37,215 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:37:37,215 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8603, 'learning_rate': 4.6700000000000003e-05, 'epoch': 0.52} [WARNING|modeling_utils.py:388] 2022-03-02 22:37:37,215 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:37:37,215 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:37:37,215 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▏ | 468/4460 [46:20<7:27:28, 6.73s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:37:47,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:37:47,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▏ | 469/4460 [46:27<7:27:09, 6.72s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▏ | 469/4460 [46:27<7:27:09, 6.72s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8866, 'learning_rate': 4.69e-05, 'epoch': 0.53} [WARNING|modeling_utils.py:388] 2022-03-02 22:37:55,565 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▏ | 470/4460 [46:34<7:24:55, 6.69s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▏ | 470/4460 [46:34<7:24:55, 6.69s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9621, 'learning_rate': 4.7e-05, 'epoch': 0.53} 11%|████████▏ | 470/4460 [46:34<7:24:55, 6.69s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:38:03,799 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:38:03,799 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8627, 'learning_rate': 4.71e-05, 'epoch': 0.53} [WARNING|modeling_utils.py:388] 2022-03-02 22:38:03,799 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:38:10,449 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:38:10,449 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8953, 'learning_rate': 4.72e-05, 'epoch': 0.53} [WARNING|modeling_utils.py:388] 2022-03-02 22:38:10,449 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:38:10,449 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:38:10,449 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▎ | 473/4460 [46:53<7:19:34, 6.62s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:38:20,256 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:38:20,256 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▎ | 474/4460 [47:00<7:15:16, 6.55s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▎ | 474/4460 [47:00<7:15:16, 6.55s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8248, 'learning_rate': 4.74e-05, 'epoch': 0.53} [WARNING|modeling_utils.py:388] 2022-03-02 22:38:28,162 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▎ | 475/4460 [47:07<7:22:39, 6.66s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▎ | 475/4460 [47:07<7:22:39, 6.66s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9785, 'learning_rate': 4.75e-05, 'epoch': 0.53} 11%|████████▎ | 475/4460 [47:07<7:22:39, 6.66s/it]g-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:38:36,706 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:38:36,706 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9762, 'learning_rate': 4.76e-05, 'epoch': 0.53} [WARNING|modeling_utils.py:388] 2022-03-02 22:38:36,706 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:38:42,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:38:42,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.073, 'learning_rate': 4.77e-05, 'epoch': 0.53} [WARNING|modeling_utils.py:388] 2022-03-02 22:38:42,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:38:42,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:38:42,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:35:48,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▎ | 478/4460 [47:26<7:06:13, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:38:50,834 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▎ | 478/4460 [47:26<7:06:13, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:38:50,834 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▎ | 478/4460 [47:26<7:06:13, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:38:50,834 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▎ | 478/4460 [47:26<7:06:13, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:38:50,834 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▍ | 479/4460 [47:32<7:01:02, 6.35s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:38:56,994 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▍ | 479/4460 [47:32<7:01:02, 6.35s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:38:56,994 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▍ | 479/4460 [47:32<7:01:02, 6.35s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:38:56,994 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▍ | 479/4460 [47:32<7:01:02, 6.35s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:38:56,994 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▍ | 480/4460 [47:38<6:56:45, 6.28s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:39:03,111 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▍ | 480/4460 [47:38<6:56:45, 6.28s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:39:03,111 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▍ | 480/4460 [47:38<6:56:45, 6.28s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:39:03,111 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▍ | 480/4460 [47:38<6:56:45, 6.28s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:39:03,111 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▍ | 481/4460 [47:44<6:52:20, 6.22s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:39:09,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▍ | 481/4460 [47:44<6:52:20, 6.22s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:39:09,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:39:13,515 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:09,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:39:13,515 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:09,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8619, 'learning_rate': 4.82e-05, 'epoch': 0.54} [WARNING|modeling_utils.py:388] 2022-03-02 22:39:13,515 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:09,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:39:19,351 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:09,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:39:19,351 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:09,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8868, 'learning_rate': 4.83e-05, 'epoch': 0.54} [WARNING|modeling_utils.py:388] 2022-03-02 22:39:23,659 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:09,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▍ | 484/4460 [48:02<6:33:22, 5.94s/it]g-point operations will not be computed-02 22:39:09,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▍ | 484/4460 [48:02<6:33:22, 5.94s/it]g-point operations will not be computed-02 22:39:09,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9498, 'learning_rate': 4.8400000000000004e-05, 'epoch': 0.54} [WARNING|modeling_utils.py:388] 2022-03-02 22:39:29,240 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:09,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▍ | 485/4460 [48:07<6:25:34, 5.82s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:39:32,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▍ | 485/4460 [48:07<6:25:34, 5.82s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:39:32,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9514, 'learning_rate': 4.85e-05, 'epoch': 0.54} 11%|████████▍ | 485/4460 [48:07<6:25:34, 5.82s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:39:32,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▍ | 485/4460 [48:07<6:25:34, 5.82s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:39:32,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▍ | 486/4460 [48:13<6:21:36, 5.76s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:39:37,667 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:39:40,344 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:37,667 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:39:40,344 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:37,667 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▌ | 487/4460 [48:18<6:15:50, 5.68s/it]g-point operations will not be computed-02 22:39:37,667 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:39:44,419 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:37,667 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:39:44,419 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:37,667 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:39:44,419 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:37,667 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▌ | 488/4460 [48:23<6:07:47, 5.56s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:39:48,485 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▌ | 488/4460 [48:23<6:07:47, 5.56s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:39:48,485 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:39:52,258 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:48,485 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:39:52,258 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:48,485 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:39:54,844 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:48,485 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:39:54,844 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:48,485 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:39:54,844 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:48,485 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▌ | 490/4460 [48:34<5:53:07, 5.34s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:00,823 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:00,823 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▌ | 491/4460 [48:38<5:39:42, 5.14s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:04,239 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:04,239 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:06,361 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:08,566 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:08,566 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:10,501 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:12,480 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:12,480 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:14,290 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:16,100 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:16,100 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:17,744 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:19,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:19,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:22,440 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:22,440 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:23,801 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:23,801 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:26,505 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:28,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:28,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:30,160 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:30,160 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:31,808 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:31,808 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:35,877 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:35,877 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:35,877 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:39,577 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:39,577 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:43,365 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:47,029 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:47,029 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0552, 'learning_rate': 5.02e-05, 'epoch': 0.56} [WARNING|modeling_utils.py:388] 2022-03-02 22:40:47,029 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:47,029 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:40:47,029 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▊ | 503/4460 [49:31<6:29:46, 5.91s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▊ | 503/4460 [49:31<6:29:46, 5.91s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▊ | 503/4460 [49:31<6:29:46, 5.91s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:41:01,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:41:01,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.242, 'learning_rate': 5.0400000000000005e-05, 'epoch': 0.57} [WARNING|modeling_utils.py:388] 2022-03-02 22:41:01,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:41:01,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:41:01,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▊ | 505/4460 [49:45<7:10:50, 6.54s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▊ | 505/4460 [49:45<7:10:50, 6.54s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:41:13,879 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▊ | 506/4460 [49:52<7:19:57, 6.68s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▊ | 506/4460 [49:52<7:19:57, 6.68s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.1445, 'learning_rate': 5.0600000000000003e-05, 'epoch': 0.57} 11%|████████▊ | 506/4460 [49:52<7:19:57, 6.68s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▊ | 506/4460 [49:52<7:19:57, 6.68s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▊ | 506/4460 [49:52<7:19:57, 6.68s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▊ | 507/4460 [49:59<7:28:42, 6.81s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▊ | 507/4460 [49:59<7:28:42, 6.81s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:41:28,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 508/4460 [50:06<7:34:37, 6.90s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 508/4460 [50:06<7:34:37, 6.90s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9375, 'learning_rate': 5.08e-05, 'epoch': 0.57} 11%|████████▉ | 508/4460 [50:06<7:34:37, 6.90s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 508/4460 [50:06<7:34:37, 6.90s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 508/4460 [50:06<7:34:37, 6.90s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 509/4460 [50:13<7:36:42, 6.94s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:41:40,428 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:41:40,428 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 510/4460 [50:20<7:38:51, 6.97s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 510/4460 [50:20<7:38:51, 6.97s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9671, 'learning_rate': 5.1000000000000006e-05, 'epoch': 0.57} 11%|████████▉ | 510/4460 [50:20<7:38:51, 6.97s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:41:50,655 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:41:50,655 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9103, 'learning_rate': 5.11e-05, 'epoch': 0.57} [WARNING|modeling_utils.py:388] 2022-03-02 22:41:50,655 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:41:50,655 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:41:50,655 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 512/4460 [50:34<7:35:32, 6.92s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 512/4460 [50:34<7:35:32, 6.92s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:42:02,782 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|████████▉ | 513/4460 [50:41<7:33:43, 6.90s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|████████▉ | 513/4460 [50:41<7:33:43, 6.90s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0824, 'learning_rate': 5.130000000000001e-05, 'epoch': 0.58} 12%|████████▉ | 513/4460 [50:41<7:33:43, 6.90s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:42:11,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:42:11,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0204, 'learning_rate': 5.14e-05, 'epoch': 0.58} [WARNING|modeling_utils.py:388] 2022-03-02 22:42:11,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:42:11,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:42:11,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████ | 515/4460 [50:55<7:30:58, 6.86s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:42:21,502 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:42:21,502 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████ | 516/4460 [51:01<7:28:06, 6.82s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████ | 516/4460 [51:01<7:28:06, 6.82s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8061, 'learning_rate': 5.16e-05, 'epoch': 0.58} 12%|█████████ | 516/4460 [51:01<7:28:06, 6.82s/it]g-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:42:31,422 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:42:31,422 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0333, 'learning_rate': 5.17e-05, 'epoch': 0.58} [WARNING|modeling_utils.py:388] 2022-03-02 22:42:31,422 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:42:31,422 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:42:31,422 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:39:58,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████ | 518/4460 [51:14<7:19:46, 6.69s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:42:39,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████ | 518/4460 [51:14<7:19:46, 6.69s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:42:39,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████ | 518/4460 [51:14<7:19:46, 6.69s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:42:39,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████ | 519/4460 [51:21<7:18:12, 6.67s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:42:39,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████ | 519/4460 [51:21<7:18:12, 6.67s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:42:39,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:42:47,901 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:42:39,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:42:47,901 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:42:39,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████ | 520/4460 [51:28<7:14:24, 6.62s/it]g-point operations will not be computed-02 22:42:39,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████ | 520/4460 [51:28<7:14:24, 6.62s/it]g-point operations will not be computed-02 22:42:39,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0285, 'learning_rate': 5.2000000000000004e-05, 'epoch': 0.58} 12%|█████████ | 520/4460 [51:28<7:14:24, 6.62s/it]g-point operations will not be computed-02 22:42:39,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:42:57,558 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:42:39,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:42:57,558 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:42:39,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.872, 'learning_rate': 5.2100000000000006e-05, 'epoch': 0.58} [WARNING|modeling_utils.py:388] 2022-03-02 22:42:57,558 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:42:39,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:43:04,009 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:42:39,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:43:04,009 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:42:39,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8672, 'learning_rate': 5.22e-05, 'epoch': 0.59} [WARNING|modeling_utils.py:388] 2022-03-02 22:43:04,009 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:42:39,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:43:10,463 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:42:39,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:43:10,463 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:42:39,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.769, 'learning_rate': 5.2300000000000004e-05, 'epoch': 0.59} [WARNING|modeling_utils.py:388] 2022-03-02 22:43:10,463 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:42:39,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:43:10,463 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:42:39,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:43:10,463 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:42:39,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▏ | 524/4460 [51:53<7:04:28, 6.47s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▏ | 524/4460 [51:53<7:04:28, 6.47s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▏ | 524/4460 [51:53<7:04:28, 6.47s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▏ | 525/4460 [52:00<7:11:20, 6.58s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▏ | 525/4460 [52:00<7:11:20, 6.58s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0126, 'learning_rate': 5.25e-05, 'epoch': 0.59} [WARNING|modeling_utils.py:388] 2022-03-02 22:43:28,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▏ | 526/4460 [52:07<7:08:21, 6.53s/it]g-point operations will not be computed-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▏ | 526/4460 [52:07<7:08:21, 6.53s/it]g-point operations will not be computed-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8324, 'learning_rate': 5.2600000000000005e-05, 'epoch': 0.59} [WARNING|modeling_utils.py:388] 2022-03-02 22:43:34,853 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▏ | 527/4460 [52:13<7:03:38, 6.46s/it]g-point operations will not be computed-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▏ | 527/4460 [52:13<7:03:38, 6.46s/it]g-point operations will not be computed-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9212, 'learning_rate': 5.270000000000001e-05, 'epoch': 0.59} 12%|█████████▏ | 527/4460 [52:13<7:03:38, 6.46s/it]g-point operations will not be computed-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:43:42,640 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:43:42,640 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9677, 'learning_rate': 5.28e-05, 'epoch': 0.59} [WARNING|modeling_utils.py:388] 2022-03-02 22:43:42,640 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:43:48,791 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:43:48,791 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9959, 'learning_rate': 5.2900000000000005e-05, 'epoch': 0.59} [WARNING|modeling_utils.py:388] 2022-03-02 22:43:48,791 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:43:54,879 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:43:54,879 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0283, 'learning_rate': 5.300000000000001e-05, 'epoch': 0.59} [WARNING|modeling_utils.py:388] 2022-03-02 22:43:54,879 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:44:00,923 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:44:00,923 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9021, 'learning_rate': 5.31e-05, 'epoch': 0.6} [WARNING|modeling_utils.py:388] 2022-03-02 22:44:05,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▎ | 532/4460 [52:43<6:40:50, 6.12s/it]g-point operations will not be computed-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▎ | 532/4460 [52:43<6:40:50, 6.12s/it]g-point operations will not be computed-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8198, 'learning_rate': 5.3200000000000006e-05, 'epoch': 0.6} [WARNING|modeling_utils.py:388] 2022-03-02 22:44:11,287 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▎ | 533/4460 [52:49<6:35:29, 6.04s/it]g-point operations will not be computed-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▎ | 533/4460 [52:49<6:35:29, 6.04s/it]g-point operations will not be computed-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.836, 'learning_rate': 5.330000000000001e-05, 'epoch': 0.6} [WARNING|modeling_utils.py:388] 2022-03-02 22:44:17,130 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:44:17,130 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▎ | 534/4460 [52:55<6:30:18, 5.97s/it]g-point operations will not be computed-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:44:21,448 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:44:21,448 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:43:18,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▎ | 535/4460 [53:01<6:25:51, 5.90s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:44:25,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▎ | 535/4460 [53:01<6:25:51, 5.90s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:44:25,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8605, 'learning_rate': 5.3500000000000006e-05, 'epoch': 0.6} [WARNING|modeling_utils.py:388] 2022-03-02 22:44:29,869 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:25,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:44:29,869 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:25,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9906, 'learning_rate': 5.360000000000001e-05, 'epoch': 0.6} [WARNING|modeling_utils.py:388] 2022-03-02 22:44:34,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:25,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▍ | 537/4460 [53:12<6:15:21, 5.74s/it]g-point operations will not be computed-02 22:44:25,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▍ | 537/4460 [53:12<6:15:21, 5.74s/it]g-point operations will not be computed-02 22:44:25,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9186, 'learning_rate': 5.3700000000000004e-05, 'epoch': 0.6} [WARNING|modeling_utils.py:388] 2022-03-02 22:44:39,527 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:25,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▍ | 538/4460 [53:17<6:08:05, 5.63s/it]g-point operations will not be computed-02 22:44:25,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▍ | 538/4460 [53:17<6:08:05, 5.63s/it]g-point operations will not be computed-02 22:44:25,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:44:43,495 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:25,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:44:45,989 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:25,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:44:45,989 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:25,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9883, 'learning_rate': 5.390000000000001e-05, 'epoch': 0.6} [WARNING|modeling_utils.py:388] 2022-03-02 22:44:49,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:25,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:44:49,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:25,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▍ | 540/4460 [53:27<5:46:14, 5.30s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▍ | 540/4460 [53:27<5:46:14, 5.30s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:44:55,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:44:55,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:44:57,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:44:57,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:44:59,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:01,839 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:01,839 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:03,724 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:05,630 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:05,630 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:07,340 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:07,340 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:10,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:12,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:12,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:13,607 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:13,607 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:16,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:17,873 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:17,873 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:20,436 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:20,436 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:22,716 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:22,716 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:24,320 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:24,320 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:28,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:28,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:28,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:31,951 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:31,951 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:35,729 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:35,729 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:35,729 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:39,384 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:39,384 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:44,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:44,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 553/4460 [54:23<6:23:07, 5.88s/it]g-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 553/4460 [54:23<6:23:07, 5.88s/it]g-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 553/4460 [54:23<6:23:07, 5.88s/it]g-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 553/4460 [54:23<6:23:07, 5.88s/it]g-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 553/4460 [54:23<6:23:07, 5.88s/it]g-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 554/4460 [54:30<6:48:47, 6.28s/it]g-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 554/4460 [54:30<6:48:47, 6.28s/it]g-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:45:59,368 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 555/4460 [54:38<7:05:58, 6.54s/it]g-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 555/4460 [54:38<7:05:58, 6.54s/it]g-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0603, 'learning_rate': 5.550000000000001e-05, 'epoch': 0.62} 12%|█████████▋ | 555/4460 [54:38<7:05:58, 6.54s/it]g-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 555/4460 [54:38<7:05:58, 6.54s/it]g-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 555/4460 [54:38<7:05:58, 6.54s/it]g-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 556/4460 [54:45<7:17:59, 6.73s/it]g-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 556/4460 [54:45<7:17:59, 6.73s/it]g-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:46:13,596 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 557/4460 [54:52<7:24:20, 6.83s/it]g-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 557/4460 [54:52<7:24:20, 6.83s/it]g-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9658, 'learning_rate': 5.5700000000000005e-05, 'epoch': 0.62} 12%|█████████▋ | 557/4460 [54:52<7:24:20, 6.83s/it]g-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 557/4460 [54:52<7:24:20, 6.83s/it]g-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 557/4460 [54:52<7:24:20, 6.83s/it]g-point operations will not be computed-02 22:44:52,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 558/4460 [54:59<7:28:22, 6.89s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:46:24,168 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 558/4460 [54:59<7:28:22, 6.89s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:46:24,168 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 558/4460 [54:59<7:28:22, 6.89s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:46:24,168 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 558/4460 [54:59<7:28:22, 6.89s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:46:24,168 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 559/4460 [55:06<7:30:32, 6.93s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:46:24,168 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 559/4460 [55:06<7:30:32, 6.93s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:46:24,168 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 559/4460 [55:06<7:30:32, 6.93s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:46:24,168 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 559/4460 [55:06<7:30:32, 6.93s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:46:24,168 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 559/4460 [55:06<7:30:32, 6.93s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:46:24,168 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 560/4460 [55:13<7:31:29, 6.95s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:46:38,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 560/4460 [55:13<7:31:29, 6.95s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:46:38,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 560/4460 [55:13<7:31:29, 6.95s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:46:38,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 560/4460 [55:13<7:31:29, 6.95s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:46:38,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 561/4460 [55:20<7:30:46, 6.94s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:46:38,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 561/4460 [55:20<7:30:46, 6.94s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:46:38,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:46:48,470 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:46:38,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 562/4460 [55:27<7:29:07, 6.91s/it]g-point operations will not be computed-02 22:46:38,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 562/4460 [55:27<7:29:07, 6.91s/it]g-point operations will not be computed-02 22:46:38,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8454, 'learning_rate': 5.620000000000001e-05, 'epoch': 0.63} 13%|█████████▊ | 562/4460 [55:27<7:29:07, 6.91s/it]g-point operations will not be computed-02 22:46:38,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 562/4460 [55:27<7:29:07, 6.91s/it]g-point operations will not be computed-02 22:46:38,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 562/4460 [55:27<7:29:07, 6.91s/it]g-point operations will not be computed-02 22:46:38,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 563/4460 [55:33<7:26:30, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 563/4460 [55:33<7:26:30, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 563/4460 [55:33<7:26:30, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 563/4460 [55:33<7:26:30, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 564/4460 [55:40<7:24:02, 6.84s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 564/4460 [55:40<7:24:02, 6.84s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:47:08,799 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▉ | 565/4460 [55:47<7:22:24, 6.82s/it]g-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▉ | 565/4460 [55:47<7:22:24, 6.82s/it]g-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0916, 'learning_rate': 5.65e-05, 'epoch': 0.63} 13%|█████████▉ | 565/4460 [55:47<7:22:24, 6.82s/it]g-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:47:17,145 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:47:17,145 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7813, 'learning_rate': 5.66e-05, 'epoch': 0.63} [WARNING|modeling_utils.py:388] 2022-03-02 22:47:17,145 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:47:17,145 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:47:17,145 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▉ | 567/4460 [56:00<7:18:10, 6.75s/it]g-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:47:27,210 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:47:27,210 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:47:27,210 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▉ | 568/4460 [56:07<7:16:37, 6.73s/it]g-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▉ | 568/4460 [56:07<7:16:37, 6.73s/it]g-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:47:35,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▉ | 569/4460 [56:14<7:13:03, 6.68s/it]g-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▉ | 569/4460 [56:14<7:13:03, 6.68s/it]g-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8687, 'learning_rate': 5.69e-05, 'epoch': 0.64} [WARNING|modeling_utils.py:388] 2022-03-02 22:47:42,061 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▉ | 570/4460 [56:20<7:11:33, 6.66s/it]g-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▉ | 570/4460 [56:20<7:11:33, 6.66s/it]g-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.998, 'learning_rate': 5.6999999999999996e-05, 'epoch': 0.64} 13%|█████████▉ | 570/4460 [56:20<7:11:33, 6.66s/it]g-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:47:50,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:47:50,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8671, 'learning_rate': 5.71e-05, 'epoch': 0.64} [WARNING|modeling_utils.py:388] 2022-03-02 22:47:50,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:47:50,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:47:50,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 572/4460 [56:33<7:08:36, 6.61s/it]g-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:48:00,121 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:48:00,121 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 573/4460 [56:40<7:04:51, 6.56s/it]g-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 573/4460 [56:40<7:04:51, 6.56s/it]g-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8657, 'learning_rate': 5.73e-05, 'epoch': 0.64} [WARNING|modeling_utils.py:388] 2022-03-02 22:48:08,154 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 574/4460 [56:46<7:03:42, 6.54s/it]g-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 574/4460 [56:46<7:03:42, 6.54s/it]g-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7437, 'learning_rate': 5.74e-05, 'epoch': 0.64} [WARNING|modeling_utils.py:388] 2022-03-02 22:48:14,631 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:48:14,631 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:48:14,631 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9499, 'learning_rate': 5.7499999999999995e-05, 'epoch': 0.64} [WARNING|modeling_utils.py:388] 2022-03-02 22:48:14,631 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:48:14,631 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:48:14,631 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:46:58,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 576/4460 [57:00<7:07:04, 6.60s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 576/4460 [57:00<7:07:04, 6.60s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 576/4460 [57:00<7:07:04, 6.60s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 577/4460 [57:06<7:02:16, 6.52s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 577/4460 [57:06<7:02:16, 6.52s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:48:32,678 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:48:32,678 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 578/4460 [57:12<6:56:08, 6.43s/it]g-point operations will not be computed-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 578/4460 [57:12<6:56:08, 6.43s/it]g-point operations will not be computed-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:48:38,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:48:38,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 579/4460 [57:18<6:52:10, 6.37s/it]g-point operations will not be computed-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 579/4460 [57:18<6:52:10, 6.37s/it]g-point operations will not be computed-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:48:45,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:48:45,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 580/4460 [57:25<6:48:11, 6.31s/it]g-point operations will not be computed-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 580/4460 [57:25<6:48:11, 6.31s/it]g-point operations will not be computed-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:48:51,249 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:48:51,249 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 581/4460 [57:31<6:44:12, 6.25s/it]g-point operations will not be computed-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 581/4460 [57:31<6:44:12, 6.25s/it]g-point operations will not be computed-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:48:57,411 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:48:57,411 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 582/4460 [57:37<6:41:06, 6.21s/it]g-point operations will not be computed-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 582/4460 [57:37<6:41:06, 6.21s/it]g-point operations will not be computed-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:49:03,450 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:49:03,450 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 583/4460 [57:43<6:37:27, 6.15s/it]g-point operations will not be computed-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 583/4460 [57:43<6:37:27, 6.15s/it]g-point operations will not be computed-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:49:09,323 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:49:09,323 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:49:09,323 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:48:24,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 584/4460 [57:49<6:30:29, 6.04s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:49:13,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 584/4460 [57:49<6:30:29, 6.04s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:49:13,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:49:17,970 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:49:13,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:49:17,970 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:49:13,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9344, 'learning_rate': 5.85e-05, 'epoch': 0.66} [WARNING|modeling_utils.py:388] 2022-03-02 22:49:17,970 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:49:13,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:49:23,692 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:49:13,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:49:23,692 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:49:13,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9409, 'learning_rate': 5.86e-05, 'epoch': 0.66} [WARNING|modeling_utils.py:388] 2022-03-02 22:49:27,889 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:49:13,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▎ | 587/4460 [58:06<6:13:33, 5.79s/it]g-point operations will not be computed-02 22:49:13,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▎ | 587/4460 [58:06<6:13:33, 5.79s/it]g-point operations will not be computed-02 22:49:13,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:49:32,018 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:49:13,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:49:32,018 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:49:13,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▎ | 588/4460 [58:11<6:06:49, 5.68s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:49:36,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▎ | 588/4460 [58:11<6:06:49, 5.68s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:49:36,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8768, 'learning_rate': 5.88e-05, 'epoch': 0.66} [WARNING|modeling_utils.py:388] 2022-03-02 22:49:39,947 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:49:36,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:49:39,947 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:49:36,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:49:42,583 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:49:36,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:49:42,583 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:49:36,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▎ | 590/4460 [58:22<5:52:42, 5.47s/it]g-point operations will not be computed-02 22:49:36,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▎ | 590/4460 [58:22<5:52:42, 5.47s/it]g-point operations will not be computed-02 22:49:36,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:49:47,714 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:49:36,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:49:50,091 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:49:36,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:49:50,091 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:49:36,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:49:52,524 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:49:36,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:49:52,524 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:49:36,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:49:52,524 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:49:36,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▎ | 592/4460 [58:31<5:30:34, 5.13s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:49:56,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:49:58,209 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:49:56,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:49:58,209 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:49:56,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▎ | 593/4460 [58:36<5:17:11, 4.92s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:50:00,376 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:50:02,387 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:00,376 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:50:02,387 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:00,376 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▍ | 594/4460 [58:40<5:00:47, 4.67s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:50:04,373 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:50:06,201 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:04,373 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:50:06,201 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:04,373 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▍ | 595/4460 [58:44<4:42:48, 4.39s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:50:08,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▍ | 595/4460 [58:44<4:42:48, 4.39s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:50:08,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▍ | 596/4460 [58:47<4:23:06, 4.09s/it]g-point operations will not be computed-02 22:50:08,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:50:12,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:11,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:50:12,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:11,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▍ | 597/4460 [58:50<4:04:07, 3.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:50:14,349 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▍ | 597/4460 [58:50<4:04:07, 3.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:50:14,349 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▍ | 598/4460 [58:53<3:42:11, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:50:16,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▍ | 598/4460 [58:53<3:42:11, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:50:16,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▍ | 599/4460 [58:55<3:21:55, 3.14s/it]g-point operations will not be computed-02 22:50:16,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:50:20,330 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:19,300 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:50:20,330 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:19,300 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▍ | 600/4460 [58:58<3:14:33, 3.02s/it]g-point operations will not be computed-02 22:50:19,300 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▍ | 600/4460 [58:58<3:14:33, 3.02s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:50:23,483 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▍ | 600/4460 [58:58<3:14:33, 3.02s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:50:23,483 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▌ | 601/4460 [59:06<4:45:20, 4.44s/it]g-point operations will not be computed-02 22:50:23,483 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▌ | 601/4460 [59:06<4:45:20, 4.44s/it]g-point operations will not be computed-02 22:50:23,483 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▌ | 601/4460 [59:06<4:45:20, 4.44s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▌ | 601/4460 [59:06<4:45:20, 4.44s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:50:34,664 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:50:34,664 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▌ | 602/4460 [59:13<5:41:11, 5.31s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▌ | 602/4460 [59:13<5:41:11, 5.31s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▌ | 602/4460 [59:13<5:41:11, 5.31s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▌ | 602/4460 [59:13<5:41:11, 5.31s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▌ | 602/4460 [59:13<5:41:11, 5.31s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▌ | 603/4460 [59:20<6:19:08, 5.90s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:50:47,405 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:50:47,405 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:50:47,405 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▌ | 604/4460 [59:28<6:45:43, 6.31s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▌ | 604/4460 [59:28<6:45:43, 6.31s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▌ | 604/4460 [59:28<6:45:43, 6.31s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▌ | 604/4460 [59:28<6:45:43, 6.31s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▌ | 604/4460 [59:28<6:45:43, 6.31s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▌ | 605/4460 [59:35<7:02:36, 6.58s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:51:01,892 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:51:01,892 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▌ | 606/4460 [59:42<7:14:42, 6.77s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▌ | 606/4460 [59:42<7:14:42, 6.77s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8571, 'learning_rate': 6.06e-05, 'epoch': 0.68} 14%|██████████▌ | 606/4460 [59:42<7:14:42, 6.77s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▌ | 606/4460 [59:42<7:14:42, 6.77s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▌ | 606/4460 [59:42<7:14:42, 6.77s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▌ | 607/4460 [59:49<7:19:59, 6.85s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:51:16,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:51:16,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:51:16,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▋ | 608/4460 [59:56<7:24:12, 6.92s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▋ | 608/4460 [59:56<7:24:12, 6.92s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▋ | 608/4460 [59:56<7:24:12, 6.92s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▋ | 608/4460 [59:56<7:24:12, 6.92s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▋ | 608/4460 [59:56<7:24:12, 6.92s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▍ | 609/4460 [1:00:03<7:27:02, 6.96s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:51:30,143 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:51:30,143 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:51:30,143 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▍ | 610/4460 [1:00:10<7:26:05, 6.95s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▍ | 610/4460 [1:00:10<7:26:05, 6.95s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▍ | 610/4460 [1:00:10<7:26:05, 6.95s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:51:40,438 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:51:40,438 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9091, 'learning_rate': 6.110000000000001e-05, 'epoch': 0.68} [WARNING|modeling_utils.py:388] 2022-03-02 22:51:40,438 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:51:40,438 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:51:40,438 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▍ | 612/4460 [1:00:24<7:24:13, 6.93s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:51:50,773 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:51:50,773 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▍ | 613/4460 [1:00:31<7:21:09, 6.88s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▍ | 613/4460 [1:00:31<7:21:09, 6.88s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.846, 'learning_rate': 6.13e-05, 'epoch': 0.69} 14%|██████████▍ | 613/4460 [1:00:31<7:21:09, 6.88s/it]g-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:52:00,930 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:52:00,930 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8987, 'learning_rate': 6.14e-05, 'epoch': 0.69} [WARNING|modeling_utils.py:388] 2022-03-02 22:52:00,930 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:52:00,930 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:52:00,930 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:50:31,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▍ | 615/4460 [1:00:44<7:17:42, 6.83s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:52:09,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▍ | 615/4460 [1:00:44<7:17:42, 6.83s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:52:09,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▍ | 615/4460 [1:00:44<7:17:42, 6.83s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:52:09,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▍ | 615/4460 [1:00:44<7:17:42, 6.83s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:52:09,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▍ | 616/4460 [1:00:51<7:16:04, 6.81s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:52:09,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:52:17,819 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:52:09,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:52:17,819 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:52:09,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▌ | 617/4460 [1:00:58<7:12:25, 6.75s/it]g-point operations will not be computed-02 22:52:09,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▌ | 617/4460 [1:00:58<7:12:25, 6.75s/it]g-point operations will not be computed-02 22:52:09,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8996, 'learning_rate': 6.170000000000001e-05, 'epoch': 0.69} 14%|██████████▌ | 617/4460 [1:00:58<7:12:25, 6.75s/it]g-point operations will not be computed-02 22:52:09,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:52:27,662 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:52:09,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:52:27,662 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:52:09,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7595, 'learning_rate': 6.18e-05, 'epoch': 0.69} [WARNING|modeling_utils.py:388] 2022-03-02 22:52:27,662 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:52:09,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:52:27,662 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:52:09,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:52:27,662 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:52:09,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▌ | 619/4460 [1:01:11<7:08:17, 6.69s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:52:36,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▌ | 619/4460 [1:01:11<7:08:17, 6.69s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:52:36,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▌ | 619/4460 [1:01:11<7:08:17, 6.69s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:52:36,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▌ | 619/4460 [1:01:11<7:08:17, 6.69s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:52:36,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▌ | 620/4460 [1:01:17<7:07:40, 6.68s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:52:36,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:52:44,333 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:52:36,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:52:44,333 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:52:36,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:52:44,333 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:52:36,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▌ | 621/4460 [1:01:24<7:05:27, 6.65s/it]g-point operations will not be computed-02 22:52:36,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▌ | 621/4460 [1:01:24<7:05:27, 6.65s/it]g-point operations will not be computed-02 22:52:36,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▌ | 621/4460 [1:01:24<7:05:27, 6.65s/it]g-point operations will not be computed-02 22:52:36,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:52:54,049 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:52:36,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:52:54,049 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:52:36,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8547, 'learning_rate': 6.220000000000001e-05, 'epoch': 0.7} [WARNING|modeling_utils.py:388] 2022-03-02 22:52:54,049 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:52:36,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:53:00,502 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:52:36,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:53:00,502 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:52:36,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.965, 'learning_rate': 6.23e-05, 'epoch': 0.7} [WARNING|modeling_utils.py:388] 2022-03-02 22:53:00,502 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:52:36,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:53:00,502 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:52:36,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:53:00,502 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:52:36,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▋ | 624/4460 [1:01:43<6:57:03, 6.52s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▋ | 624/4460 [1:01:43<6:57:03, 6.52s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▋ | 624/4460 [1:01:43<6:57:03, 6.52s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▋ | 625/4460 [1:01:51<7:07:50, 6.69s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▋ | 625/4460 [1:01:51<7:07:50, 6.69s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0048, 'learning_rate': 6.25e-05, 'epoch': 0.7} 14%|██████████▋ | 625/4460 [1:01:51<7:07:50, 6.69s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:53:20,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:53:20,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8393, 'learning_rate': 6.26e-05, 'epoch': 0.7} [WARNING|modeling_utils.py:388] 2022-03-02 22:53:20,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:53:26,684 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:53:26,684 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9773, 'learning_rate': 6.27e-05, 'epoch': 0.7} [WARNING|modeling_utils.py:388] 2022-03-02 22:53:26,684 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:53:32,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:53:32,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.958, 'learning_rate': 6.280000000000001e-05, 'epoch': 0.7} [WARNING|modeling_utils.py:388] 2022-03-02 22:53:32,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:53:39,151 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:53:39,151 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9277, 'learning_rate': 6.29e-05, 'epoch': 0.71} [WARNING|modeling_utils.py:388] 2022-03-02 22:53:39,151 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:53:45,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:53:45,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.1124, 'learning_rate': 6.3e-05, 'epoch': 0.71} [WARNING|modeling_utils.py:388] 2022-03-02 22:53:49,971 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▊ | 631/4460 [1:02:28<6:39:03, 6.25s/it]g-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▊ | 631/4460 [1:02:28<6:39:03, 6.25s/it]g-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8434, 'learning_rate': 6.31e-05, 'epoch': 0.71} [WARNING|modeling_utils.py:388] 2022-03-02 22:53:55,983 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▊ | 632/4460 [1:02:34<6:34:19, 6.18s/it]g-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▊ | 632/4460 [1:02:34<6:34:19, 6.18s/it]g-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7448, 'learning_rate': 6.32e-05, 'epoch': 0.71} 14%|██████████▊ | 632/4460 [1:02:34<6:34:19, 6.18s/it]g-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:54:03,434 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:54:03,434 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9454, 'learning_rate': 6.330000000000001e-05, 'epoch': 0.71} [WARNING|modeling_utils.py:388] 2022-03-02 22:54:07,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▊ | 634/4460 [1:02:46<6:26:59, 6.07s/it]g-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▊ | 634/4460 [1:02:46<6:26:59, 6.07s/it]g-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8538, 'learning_rate': 6.340000000000001e-05, 'epoch': 0.71} [WARNING|modeling_utils.py:388] 2022-03-02 22:54:13,753 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▊ | 635/4460 [1:02:52<6:21:29, 5.98s/it]g-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▊ | 635/4460 [1:02:52<6:21:29, 5.98s/it]g-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:54:18,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:54:18,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:53:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▊ | 636/4460 [1:02:57<6:16:00, 5.90s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▊ | 636/4460 [1:02:57<6:16:00, 5.90s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9616, 'learning_rate': 6.36e-05, 'epoch': 0.71} [WARNING|modeling_utils.py:388] 2022-03-02 22:54:26,529 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:54:26,529 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9076, 'learning_rate': 6.37e-05, 'epoch': 0.71} [WARNING|modeling_utils.py:388] 2022-03-02 22:54:30,745 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▊ | 638/4460 [1:03:09<6:06:00, 5.75s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▊ | 638/4460 [1:03:09<6:06:00, 5.75s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7661, 'learning_rate': 6.38e-05, 'epoch': 0.72} [WARNING|modeling_utils.py:388] 2022-03-02 22:54:36,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▉ | 639/4460 [1:03:14<5:58:26, 5.63s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▉ | 639/4460 [1:03:14<5:58:26, 5.63s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:54:40,147 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:54:42,687 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:54:42,687 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7776, 'learning_rate': 6.400000000000001e-05, 'epoch': 0.72} [WARNING|modeling_utils.py:388] 2022-03-02 22:54:46,508 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▉ | 641/4460 [1:03:24<5:42:48, 5.39s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▉ | 641/4460 [1:03:24<5:42:48, 5.39s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:54:50,269 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:54:52,504 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:54:52,504 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:54:54,771 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:54:56,813 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:54:56,813 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:54:58,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:55:00,879 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:55:00,879 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:55:02,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:55:04,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:55:04,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:55:06,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:55:06,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:55:08,089 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:55:08,089 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:55:11,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:55:12,792 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:55:12,792 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:55:15,440 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:55:15,440 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:55:16,595 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:55:19,411 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:55:19,411 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.4109, 'learning_rate': 6.500000000000001e-05, 'epoch': 0.73} [WARNING|modeling_utils.py:388] 2022-03-02 22:55:23,511 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:55:23,511 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:55:23,511 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:55:27,228 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:55:27,228 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:55:30,983 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:55:30,983 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:55:30,983 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████ | 652/4460 [1:04:11<5:40:48, 5.37s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████ | 652/4460 [1:04:11<5:40:48, 5.37s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:55:40,018 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▏ | 653/4460 [1:04:18<6:15:25, 5.92s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▏ | 653/4460 [1:04:18<6:15:25, 5.92s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.1023, 'learning_rate': 6.53e-05, 'epoch': 0.73} 15%|███████████▏ | 653/4460 [1:04:18<6:15:25, 5.92s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▏ | 653/4460 [1:04:18<6:15:25, 5.92s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▏ | 653/4460 [1:04:18<6:15:25, 5.92s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▏ | 654/4460 [1:04:25<6:39:27, 6.30s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▏ | 654/4460 [1:04:25<6:39:27, 6.30s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:55:54,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▏ | 655/4460 [1:04:33<6:56:45, 6.57s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▏ | 655/4460 [1:04:33<6:56:45, 6.57s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0161, 'learning_rate': 6.55e-05, 'epoch': 0.73} 15%|███████████▏ | 655/4460 [1:04:33<6:56:45, 6.57s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▏ | 655/4460 [1:04:33<6:56:45, 6.57s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▏ | 655/4460 [1:04:33<6:56:45, 6.57s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▏ | 656/4460 [1:04:40<7:07:25, 6.74s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▏ | 656/4460 [1:04:40<7:07:25, 6.74s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:56:08,665 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▏ | 657/4460 [1:04:47<7:13:36, 6.84s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▏ | 657/4460 [1:04:47<7:13:36, 6.84s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.787, 'learning_rate': 6.570000000000001e-05, 'epoch': 0.74} 15%|███████████▏ | 657/4460 [1:04:47<7:13:36, 6.84s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▏ | 657/4460 [1:04:47<7:13:36, 6.84s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▏ | 657/4460 [1:04:47<7:13:36, 6.84s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▏ | 658/4460 [1:04:54<7:17:44, 6.91s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▏ | 658/4460 [1:04:54<7:17:44, 6.91s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:56:22,731 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▏ | 659/4460 [1:05:01<7:19:16, 6.93s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▏ | 659/4460 [1:05:01<7:19:16, 6.93s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8224, 'learning_rate': 6.59e-05, 'epoch': 0.74} 15%|███████████▏ | 659/4460 [1:05:01<7:19:16, 6.93s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▏ | 659/4460 [1:05:01<7:19:16, 6.93s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▏ | 659/4460 [1:05:01<7:19:16, 6.93s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▏ | 660/4460 [1:05:08<7:20:52, 6.96s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:56:34,971 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:56:34,971 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▎ | 661/4460 [1:05:15<7:18:56, 6.93s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▎ | 661/4460 [1:05:15<7:18:56, 6.93s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8718, 'learning_rate': 6.610000000000001e-05, 'epoch': 0.74} 15%|███████████▎ | 661/4460 [1:05:15<7:18:56, 6.93s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:56:45,229 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:56:45,229 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8949, 'learning_rate': 6.620000000000001e-05, 'epoch': 0.74} [WARNING|modeling_utils.py:388] 2022-03-02 22:56:45,229 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:56:45,229 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:56:45,229 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▎ | 663/4460 [1:05:29<7:16:10, 6.89s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:56:55,566 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:56:55,566 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:56:55,566 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▎ | 664/4460 [1:05:35<7:14:18, 6.86s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▎ | 664/4460 [1:05:35<7:14:18, 6.86s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▎ | 664/4460 [1:05:35<7:14:18, 6.86s/it]g-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:57:05,605 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:57:05,605 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8473, 'learning_rate': 6.65e-05, 'epoch': 0.75} [WARNING|modeling_utils.py:388] 2022-03-02 22:57:05,605 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:57:05,605 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:57:05,605 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:54:22,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▎ | 666/4460 [1:05:49<7:08:43, 6.78s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:57:14,036 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▎ | 666/4460 [1:05:49<7:08:43, 6.78s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:57:14,036 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▎ | 666/4460 [1:05:49<7:08:43, 6.78s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:57:14,036 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▎ | 667/4460 [1:05:55<7:07:07, 6.76s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:57:14,036 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▎ | 667/4460 [1:05:55<7:07:07, 6.76s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:57:14,036 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:57:22,400 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:57:14,036 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:57:22,400 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:57:14,036 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▍ | 668/4460 [1:06:02<7:05:07, 6.73s/it]g-point operations will not be computed-02 22:57:14,036 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▍ | 668/4460 [1:06:02<7:05:07, 6.73s/it]g-point operations will not be computed-02 22:57:14,036 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7926, 'learning_rate': 6.680000000000001e-05, 'epoch': 0.75} [WARNING|modeling_utils.py:388] 2022-03-02 22:57:30,593 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:57:14,036 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▍ | 669/4460 [1:06:09<7:02:33, 6.69s/it]g-point operations will not be computed-02 22:57:14,036 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▍ | 669/4460 [1:06:09<7:02:33, 6.69s/it]g-point operations will not be computed-02 22:57:14,036 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9591, 'learning_rate': 6.690000000000001e-05, 'epoch': 0.75} 15%|███████████▍ | 669/4460 [1:06:09<7:02:33, 6.69s/it]g-point operations will not be computed-02 22:57:14,036 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:57:38,790 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:57:14,036 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:57:38,790 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:57:14,036 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8435, 'learning_rate': 6.7e-05, 'epoch': 0.75} [WARNING|modeling_utils.py:388] 2022-03-02 22:57:38,790 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:57:14,036 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:57:38,790 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:57:14,036 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▍ | 671/4460 [1:06:22<6:57:40, 6.61s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:57:47,061 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▍ | 671/4460 [1:06:22<6:57:40, 6.61s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:57:47,061 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7617, 'learning_rate': 6.71e-05, 'epoch': 0.75} 15%|███████████▍ | 671/4460 [1:06:22<6:57:40, 6.61s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:57:47,061 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▍ | 672/4460 [1:06:28<6:56:10, 6.59s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:57:53,573 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▍ | 672/4460 [1:06:28<6:56:10, 6.59s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:57:53,573 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8906, 'learning_rate': 6.720000000000001e-05, 'epoch': 0.75} 15%|███████████▍ | 672/4460 [1:06:28<6:56:10, 6.59s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:57:53,573 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▍ | 673/4460 [1:06:35<6:54:44, 6.57s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:57:53,573 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▍ | 673/4460 [1:06:35<6:54:44, 6.57s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:57:53,573 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8486, 'learning_rate': 6.730000000000001e-05, 'epoch': 0.75} [WARNING|modeling_utils.py:388] 2022-03-02 22:58:03,303 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:57:53,573 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▍ | 674/4460 [1:06:41<6:53:20, 6.55s/it]g-point operations will not be computed-02 22:57:53,573 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▍ | 674/4460 [1:06:41<6:53:20, 6.55s/it]g-point operations will not be computed-02 22:57:53,573 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8124, 'learning_rate': 6.740000000000001e-05, 'epoch': 0.76} [WARNING|modeling_utils.py:388] 2022-03-02 22:58:09,724 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:57:53,573 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:58:09,724 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:57:53,573 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:58:09,724 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:57:53,573 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9221, 'learning_rate': 6.750000000000001e-05, 'epoch': 0.76} [WARNING|modeling_utils.py:388] 2022-03-02 22:58:09,724 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:57:53,573 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:58:09,724 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:57:53,573 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:58:09,724 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:57:53,573 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 676/4460 [1:06:55<6:55:14, 6.58s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:58:19,860 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 676/4460 [1:06:55<6:55:14, 6.58s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:58:19,860 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 676/4460 [1:06:55<6:55:14, 6.58s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:58:19,860 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 676/4460 [1:06:55<6:55:14, 6.58s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:58:19,860 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 677/4460 [1:07:01<6:49:33, 6.50s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:58:26,153 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 677/4460 [1:07:01<6:49:33, 6.50s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:58:26,153 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 677/4460 [1:07:01<6:49:33, 6.50s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:58:26,153 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 677/4460 [1:07:01<6:49:33, 6.50s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:58:26,153 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 678/4460 [1:07:07<6:44:38, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:58:32,404 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 678/4460 [1:07:07<6:44:38, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:58:32,404 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 678/4460 [1:07:07<6:44:38, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:58:32,404 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 678/4460 [1:07:07<6:44:38, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:58:32,404 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 679/4460 [1:07:14<6:41:51, 6.38s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:58:38,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 679/4460 [1:07:14<6:41:51, 6.38s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:58:38,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 679/4460 [1:07:14<6:41:51, 6.38s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:58:38,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 679/4460 [1:07:14<6:41:51, 6.38s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:58:38,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 680/4460 [1:07:20<6:37:24, 6.31s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:58:44,819 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 680/4460 [1:07:20<6:37:24, 6.31s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:58:44,819 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 680/4460 [1:07:20<6:37:24, 6.31s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:58:44,819 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 680/4460 [1:07:20<6:37:24, 6.31s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:58:44,819 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 681/4460 [1:07:26<6:33:15, 6.24s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:58:50,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 681/4460 [1:07:26<6:33:15, 6.24s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:58:50,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 681/4460 [1:07:26<6:33:15, 6.24s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:58:50,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 681/4460 [1:07:26<6:33:15, 6.24s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:58:50,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 682/4460 [1:07:32<6:29:57, 6.19s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:58:50,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:58:58,456 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:58:50,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:58:58,456 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:58:50,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:58:58,456 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:58:50,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▋ | 683/4460 [1:07:38<6:26:34, 6.14s/it]g-point operations will not be computed-02 22:58:50,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:59:04,371 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:58:50,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:59:04,371 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:58:50,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:59:04,371 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:58:50,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▋ | 684/4460 [1:07:44<6:20:56, 6.05s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:59:08,717 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:59:11,532 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:59:08,717 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▋ | 685/4460 [1:07:49<6:14:58, 5.96s/it]g-point operations will not be computed-02 22:59:08,717 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▋ | 685/4460 [1:07:49<6:14:58, 5.96s/it]g-point operations will not be computed-02 22:59:08,717 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9565, 'learning_rate': 6.850000000000001e-05, 'epoch': 0.77} [WARNING|modeling_utils.py:388] 2022-03-02 22:59:17,276 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:59:08,717 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:59:17,276 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:59:08,717 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▋ | 686/4460 [1:07:55<6:10:01, 5.88s/it]g-point operations will not be computed-02 22:59:08,717 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▋ | 686/4460 [1:07:55<6:10:01, 5.88s/it]g-point operations will not be computed-02 22:59:08,717 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:59:22,858 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:59:08,717 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:59:22,858 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:59:08,717 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▋ | 687/4460 [1:08:01<6:04:22, 5.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:59:25,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▋ | 687/4460 [1:08:01<6:04:22, 5.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:59:25,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:59:29,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:59:25,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:59:29,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:59:25,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9914, 'learning_rate': 6.879999999999999e-05, 'epoch': 0.77} [WARNING|modeling_utils.py:388] 2022-03-02 22:59:33,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:59:25,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:59:33,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:59:25,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▋ | 689/4460 [1:08:12<5:50:35, 5.58s/it]g-point operations will not be computed-02 22:59:25,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:59:37,712 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:59:25,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:59:37,712 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:59:25,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:59:37,712 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:59:25,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▊ | 690/4460 [1:08:17<5:43:28, 5.47s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:59:41,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▊ | 690/4460 [1:08:17<5:43:28, 5.47s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:59:41,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▊ | 690/4460 [1:08:17<5:43:28, 5.47s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:59:41,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:59:45,229 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:59:41,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:59:47,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:59:41,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:59:49,926 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:59:41,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:59:49,926 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:59:41,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8301, 'learning_rate': 6.92e-05, 'epoch': 0.78} [WARNING|modeling_utils.py:388] 2022-03-02 22:59:53,236 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:59:41,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:59:53,236 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:59:41,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|███████████▊ | 693/4460 [1:08:31<5:07:23, 4.90s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:59:55,379 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:59:57,281 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:59:55,379 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 22:59:57,281 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 22:59:55,379 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|███████████▊ | 694/4460 [1:08:35<4:49:10, 4.61s/it][WARNING|modeling_utils.py:388] 2022-03-02 22:59:59,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|███████████▊ | 695/4460 [1:08:38<4:31:25, 4.33s/it]g-point operations will not be computed-02 22:59:59,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|███████████▊ | 695/4460 [1:08:38<4:31:25, 4.33s/it]g-point operations will not be computed-02 22:59:59,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|███████████▊ | 695/4460 [1:08:38<4:31:25, 4.33s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:00:02,836 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|███████████▊ | 695/4460 [1:08:38<4:31:25, 4.33s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:00:02,836 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|███████████▊ | 696/4460 [1:08:42<4:12:40, 4.03s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:00:06,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|███████████▉ | 697/4460 [1:08:45<3:54:21, 3.74s/it]g-point operations will not be computed-02 23:00:06,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|███████████▉ | 697/4460 [1:08:45<3:54:21, 3.74s/it]g-point operations will not be computed-02 23:00:06,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:00:10,369 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:00:09,053 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:00:10,369 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:00:09,053 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|███████████▉ | 698/4460 [1:08:48<3:35:45, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:00:11,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|███████████▉ | 698/4460 [1:08:48<3:35:45, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:00:11,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|███████████▉ | 699/4460 [1:08:50<3:16:14, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:00:14,049 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|███████████▉ | 700/4460 [1:08:53<3:08:24, 3.01s/it]g-point operations will not be computed-02 23:00:14,049 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|███████████▉ | 700/4460 [1:08:53<3:08:24, 3.01s/it]g-point operations will not be computed-02 23:00:14,049 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|███████████▉ | 700/4460 [1:08:53<3:08:24, 3.01s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:00:18,202 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|███████████▉ | 700/4460 [1:08:53<3:08:24, 3.01s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:00:18,202 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:00:21,844 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:00:18,202 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:00:21,844 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:00:18,202 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|███████████▉ | 701/4460 [1:09:00<4:33:23, 4.36s/it]g-point operations will not be computed-02 23:00:18,202 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|███████████▉ | 701/4460 [1:09:00<4:33:23, 4.36s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:00:25,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|███████████▉ | 701/4460 [1:09:00<4:33:23, 4.36s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:00:25,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:00:29,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:00:25,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:00:29,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:00:25,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|███████████▉ | 702/4460 [1:09:07<5:28:10, 5.24s/it]g-point operations will not be computed-02 23:00:25,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:00:34,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:00:25,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:00:34,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:00:25,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|███████████▉ | 703/4460 [1:09:15<6:04:44, 5.82s/it]g-point operations will not be computed-02 23:00:25,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|███████████▉ | 703/4460 [1:09:15<6:04:44, 5.82s/it]g-point operations will not be computed-02 23:00:25,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9961, 'learning_rate': 7.03e-05, 'epoch': 0.79} 16%|███████████▉ | 703/4460 [1:09:15<6:04:44, 5.82s/it]g-point operations will not be computed-02 23:00:25,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|███████████▉ | 703/4460 [1:09:15<6:04:44, 5.82s/it]g-point operations will not be computed-02 23:00:25,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|███████████▉ | 703/4460 [1:09:15<6:04:44, 5.82s/it]g-point operations will not be computed-02 23:00:25,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|███████████▉ | 704/4460 [1:09:22<6:29:13, 6.22s/it]g-point operations will not be computed-02 23:00:25,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:00:48,926 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:00:25,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:00:48,926 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:00:25,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████ | 705/4460 [1:09:29<6:47:09, 6.51s/it]g-point operations will not be computed-02 23:00:25,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████ | 705/4460 [1:09:29<6:47:09, 6.51s/it]g-point operations will not be computed-02 23:00:25,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9719, 'learning_rate': 7.05e-05, 'epoch': 0.79} 16%|████████████ | 705/4460 [1:09:29<6:47:09, 6.51s/it]g-point operations will not be computed-02 23:00:25,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████ | 705/4460 [1:09:29<6:47:09, 6.51s/it]g-point operations will not be computed-02 23:00:25,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████ | 705/4460 [1:09:29<6:47:09, 6.51s/it]g-point operations will not be computed-02 23:00:25,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████ | 706/4460 [1:09:36<6:56:29, 6.66s/it]g-point operations will not be computed-02 23:00:25,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:01:03,033 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:00:25,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:01:03,033 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:00:25,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:01:03,033 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:00:25,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████ | 707/4460 [1:09:43<7:03:59, 6.78s/it]g-point operations will not be computed-02 23:00:25,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████ | 707/4460 [1:09:43<7:03:59, 6.78s/it]g-point operations will not be computed-02 23:00:25,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████ | 707/4460 [1:09:43<7:03:59, 6.78s/it]g-point operations will not be computed-02 23:00:25,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████ | 707/4460 [1:09:43<7:03:59, 6.78s/it]g-point operations will not be computed-02 23:00:25,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████ | 707/4460 [1:09:43<7:03:59, 6.78s/it]g-point operations will not be computed-02 23:00:25,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████ | 708/4460 [1:09:50<7:08:28, 6.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:01:15,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████ | 708/4460 [1:09:50<7:08:28, 6.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:01:15,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████ | 708/4460 [1:09:50<7:08:28, 6.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:01:15,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████ | 709/4460 [1:09:57<7:11:46, 6.91s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:01:15,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████ | 709/4460 [1:09:57<7:11:46, 6.91s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:01:15,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9168, 'learning_rate': 7.09e-05, 'epoch': 0.79} 16%|████████████ | 709/4460 [1:09:57<7:11:46, 6.91s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:01:15,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:01:27,563 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:15,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:01:27,563 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:15,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8363, 'learning_rate': 7.1e-05, 'epoch': 0.8} [WARNING|modeling_utils.py:388] 2022-03-02 23:01:27,563 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:15,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:01:27,563 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:15,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:01:27,563 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:15,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████ | 711/4460 [1:10:11<7:12:29, 6.92s/it]g-point operations will not be computed-02 23:01:15,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:01:37,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:15,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:01:37,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:15,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▏ | 712/4460 [1:10:18<7:10:10, 6.89s/it]g-point operations will not be computed-02 23:01:15,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▏ | 712/4460 [1:10:18<7:10:10, 6.89s/it]g-point operations will not be computed-02 23:01:15,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.743, 'learning_rate': 7.12e-05, 'epoch': 0.8} 16%|████████████▏ | 712/4460 [1:10:18<7:10:10, 6.89s/it]g-point operations will not be computed-02 23:01:15,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:01:48,108 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:15,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:01:48,108 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:15,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8171, 'learning_rate': 7.13e-05, 'epoch': 0.8} [WARNING|modeling_utils.py:388] 2022-03-02 23:01:48,108 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:15,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:01:48,108 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:15,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:01:48,108 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:15,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▏ | 714/4460 [1:10:32<7:09:57, 6.89s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▏ | 714/4460 [1:10:32<7:09:57, 6.89s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▏ | 714/4460 [1:10:32<7:09:57, 6.89s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▏ | 715/4460 [1:10:38<7:06:45, 6.84s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▏ | 715/4460 [1:10:38<7:06:45, 6.84s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9596, 'learning_rate': 7.15e-05, 'epoch': 0.8} 16%|████████████▏ | 715/4460 [1:10:38<7:06:45, 6.84s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:02:08,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:02:08,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9834, 'learning_rate': 7.16e-05, 'epoch': 0.8} [WARNING|modeling_utils.py:388] 2022-03-02 23:02:08,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:02:15,095 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:02:15,095 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.1085, 'learning_rate': 7.17e-05, 'epoch': 0.8} [WARNING|modeling_utils.py:388] 2022-03-02 23:02:15,095 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:02:15,095 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:02:15,095 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▏ | 718/4460 [1:10:58<6:58:40, 6.71s/it]g-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:02:25,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:02:25,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▎ | 719/4460 [1:11:05<6:58:21, 6.71s/it]g-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▎ | 719/4460 [1:11:05<6:58:21, 6.71s/it]g-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7457, 'learning_rate': 7.19e-05, 'epoch': 0.81} [WARNING|modeling_utils.py:388] 2022-03-02 23:02:33,422 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▎ | 720/4460 [1:11:12<6:57:28, 6.70s/it]g-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▎ | 720/4460 [1:11:12<6:57:28, 6.70s/it]g-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8237, 'learning_rate': 7.2e-05, 'epoch': 0.81} 16%|████████████▎ | 720/4460 [1:11:12<6:57:28, 6.70s/it]g-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:02:41,670 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:02:41,670 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8725, 'learning_rate': 7.21e-05, 'epoch': 0.81} [WARNING|modeling_utils.py:388] 2022-03-02 23:02:41,670 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:02:41,670 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▎ | 722/4460 [1:11:25<6:55:29, 6.67s/it]g-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▎ | 722/4460 [1:11:25<6:55:29, 6.67s/it]g-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:02:51,691 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:02:51,691 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▎ | 723/4460 [1:11:31<6:53:35, 6.64s/it]g-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▎ | 723/4460 [1:11:31<6:53:35, 6.64s/it]g-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.786, 'learning_rate': 7.23e-05, 'epoch': 0.81} 16%|████████████▎ | 723/4460 [1:11:31<6:53:35, 6.64s/it]g-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:03:01,400 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:03:01,400 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.942, 'learning_rate': 7.24e-05, 'epoch': 0.81} [WARNING|modeling_utils.py:388] 2022-03-02 23:03:06,278 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▎ | 725/4460 [1:11:45<6:57:51, 6.71s/it]g-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▎ | 725/4460 [1:11:45<6:57:51, 6.71s/it]g-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8645, 'learning_rate': 7.25e-05, 'epoch': 0.81} 16%|████████████▎ | 725/4460 [1:11:45<6:57:51, 6.71s/it]g-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▎ | 725/4460 [1:11:45<6:57:51, 6.71s/it]g-point operations will not be computed-02 23:01:56,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▎ | 726/4460 [1:11:51<6:53:12, 6.64s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:03:16,532 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▎ | 726/4460 [1:11:51<6:53:12, 6.64s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:03:16,532 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9925, 'learning_rate': 7.26e-05, 'epoch': 0.81} 16%|████████████▎ | 726/4460 [1:11:51<6:53:12, 6.64s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:03:16,532 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▍ | 727/4460 [1:11:58<6:47:42, 6.55s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:03:22,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▍ | 727/4460 [1:11:58<6:47:42, 6.55s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:03:22,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8994, 'learning_rate': 7.27e-05, 'epoch': 0.82} 16%|████████████▍ | 727/4460 [1:11:58<6:47:42, 6.55s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:03:22,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▍ | 728/4460 [1:12:04<6:42:39, 6.47s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:03:22,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▍ | 728/4460 [1:12:04<6:42:39, 6.47s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:03:22,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:03:30,682 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:03:22,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:03:30,682 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:03:22,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▍ | 729/4460 [1:12:10<6:37:16, 6.39s/it]g-point operations will not be computed-02 23:03:22,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▍ | 729/4460 [1:12:10<6:37:16, 6.39s/it]g-point operations will not be computed-02 23:03:22,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:03:36,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:03:22,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:03:36,888 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:03:22,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▍ | 730/4460 [1:12:16<6:35:28, 6.36s/it]g-point operations will not be computed-02 23:03:22,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▍ | 730/4460 [1:12:16<6:35:28, 6.36s/it]g-point operations will not be computed-02 23:03:22,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:03:43,100 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:03:22,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:03:43,100 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:03:22,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▍ | 731/4460 [1:12:23<6:31:18, 6.30s/it]g-point operations will not be computed-02 23:03:22,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▍ | 731/4460 [1:12:23<6:31:18, 6.30s/it]g-point operations will not be computed-02 23:03:22,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:03:49,264 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:03:22,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:03:49,264 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:03:22,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▍ | 732/4460 [1:12:29<6:27:13, 6.23s/it]g-point operations will not be computed-02 23:03:22,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▍ | 732/4460 [1:12:29<6:27:13, 6.23s/it]g-point operations will not be computed-02 23:03:22,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:03:55,289 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:03:22,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:03:55,289 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:03:22,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▍ | 733/4460 [1:12:35<6:23:10, 6.17s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:03:59,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▍ | 733/4460 [1:12:35<6:23:10, 6.17s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:03:59,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8379, 'learning_rate': 7.33e-05, 'epoch': 0.82} 16%|████████████▍ | 733/4460 [1:12:35<6:23:10, 6.17s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:03:59,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▌ | 734/4460 [1:12:41<6:17:15, 6.07s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:04:05,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▌ | 734/4460 [1:12:41<6:17:15, 6.07s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:04:05,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0602, 'learning_rate': 7.340000000000001e-05, 'epoch': 0.82} 16%|████████████▌ | 734/4460 [1:12:41<6:17:15, 6.07s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:04:05,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:04:09,844 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:04:05,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:04:09,844 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:04:05,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:04:09,844 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:04:05,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:04:09,844 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:04:05,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:04:15,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:04:05,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:04:15,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:04:05,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:04:19,731 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:04:05,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:04:19,731 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:04:05,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▌ | 737/4460 [1:12:58<5:59:20, 5.79s/it]g-point operations will not be computed-02 23:04:05,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:04:23,890 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:04:05,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:04:26,512 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:04:05,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:04:26,512 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:04:05,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8849, 'learning_rate': 7.38e-05, 'epoch': 0.83} [WARNING|modeling_utils.py:388] 2022-03-02 23:04:30,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:04:05,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:04:30,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:04:05,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▌ | 739/4460 [1:13:08<5:43:40, 5.54s/it]g-point operations will not be computed-02 23:04:05,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:04:34,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:04:05,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:04:34,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:04:05,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:04:34,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:04:05,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▌ | 740/4460 [1:13:13<5:34:10, 5.39s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:04:38,119 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:04:40,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:04:38,119 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:04:40,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:04:38,119 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▋ | 741/4460 [1:13:18<5:25:58, 5.26s/it]g-point operations will not be computed-02 23:04:38,119 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:04:44,258 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:04:38,119 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:04:46,554 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:04:38,119 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:04:46,554 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:04:38,119 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8951, 'learning_rate': 7.42e-05, 'epoch': 0.83} [WARNING|modeling_utils.py:388] 2022-03-02 23:04:50,004 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:04:38,119 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:04:50,004 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:04:38,119 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▋ | 743/4460 [1:13:28<5:06:39, 4.95s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:04:52,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:04:54,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:04:52,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:04:54,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:04:52,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▋ | 744/4460 [1:13:32<4:52:57, 4.73s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:04:56,394 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:04:58,258 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:04:56,394 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:04:58,258 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:04:56,394 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▋ | 745/4460 [1:13:36<4:36:46, 4.47s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:05:00,180 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▋ | 746/4460 [1:13:39<4:19:19, 4.19s/it]g-point operations will not be computed-02 23:05:00,180 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▋ | 746/4460 [1:13:39<4:19:19, 4.19s/it]g-point operations will not be computed-02 23:05:00,180 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:05:05,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:03,602 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:05:05,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:03,602 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▋ | 747/4460 [1:13:42<3:59:50, 3.88s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:05:06,647 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▋ | 748/4460 [1:13:45<3:38:50, 3.54s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:05:09,291 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▋ | 748/4460 [1:13:45<3:38:50, 3.54s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:05:09,291 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▊ | 749/4460 [1:13:48<3:18:48, 3.21s/it]g-point operations will not be computed-02 23:05:09,291 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▊ | 749/4460 [1:13:48<3:18:48, 3.21s/it]g-point operations will not be computed-02 23:05:09,291 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:05:12,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:11,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:05:12,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:11,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▊ | 750/4460 [1:13:51<3:13:41, 3.13s/it]g-point operations will not be computed-02 23:05:11,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▊ | 750/4460 [1:13:51<3:13:41, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:05:16,125 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▊ | 750/4460 [1:13:51<3:13:41, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:05:16,125 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:05:19,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:16,125 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:05:19,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:16,125 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▊ | 751/4460 [1:13:58<4:37:42, 4.49s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▊ | 751/4460 [1:13:58<4:37:42, 4.49s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:05:27,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▊ | 752/4460 [1:14:06<5:32:40, 5.38s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▊ | 752/4460 [1:14:06<5:32:40, 5.38s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0678, 'learning_rate': 7.52e-05, 'epoch': 0.84} 17%|████████████▊ | 752/4460 [1:14:06<5:32:40, 5.38s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▊ | 752/4460 [1:14:06<5:32:40, 5.38s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▊ | 752/4460 [1:14:06<5:32:40, 5.38s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▊ | 753/4460 [1:14:13<6:09:36, 5.98s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▊ | 753/4460 [1:14:13<6:09:36, 5.98s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▊ | 753/4460 [1:14:13<6:09:36, 5.98s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:05:43,758 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:05:43,758 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9445, 'learning_rate': 7.54e-05, 'epoch': 0.85} [WARNING|modeling_utils.py:388] 2022-03-02 23:05:43,758 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:05:43,758 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:05:43,758 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▊ | 755/4460 [1:14:27<6:48:18, 6.61s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▊ | 755/4460 [1:14:27<6:48:18, 6.61s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▊ | 755/4460 [1:14:27<6:48:18, 6.61s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:05:58,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:05:58,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0371, 'learning_rate': 7.560000000000001e-05, 'epoch': 0.85} [WARNING|modeling_utils.py:388] 2022-03-02 23:05:58,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:05:58,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:05:58,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▉ | 757/4460 [1:14:42<7:03:48, 6.87s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▉ | 757/4460 [1:14:42<7:03:48, 6.87s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▉ | 757/4460 [1:14:42<7:03:48, 6.87s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:06:12,280 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:06:12,280 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8103, 'learning_rate': 7.58e-05, 'epoch': 0.85} [WARNING|modeling_utils.py:388] 2022-03-02 23:06:12,280 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:06:12,280 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:06:12,280 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▉ | 759/4460 [1:14:56<7:10:20, 6.98s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▉ | 759/4460 [1:14:56<7:10:20, 6.98s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▉ | 759/4460 [1:14:56<7:10:20, 6.98s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:06:26,376 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:06:26,376 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8274, 'learning_rate': 7.6e-05, 'epoch': 0.85} [WARNING|modeling_utils.py:388] 2022-03-02 23:06:26,376 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:06:26,376 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:06:26,376 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▉ | 761/4460 [1:15:10<7:08:25, 6.95s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:06:36,823 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:06:36,823 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▉ | 762/4460 [1:15:17<7:08:13, 6.95s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|████████████▉ | 762/4460 [1:15:17<7:08:13, 6.95s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8471, 'learning_rate': 7.620000000000001e-05, 'epoch': 0.85} [WARNING|modeling_utils.py:388] 2022-03-02 23:06:45,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████ | 763/4460 [1:15:24<7:06:23, 6.92s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████ | 763/4460 [1:15:24<7:06:23, 6.92s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7048, 'learning_rate': 7.630000000000001e-05, 'epoch': 0.86} 17%|█████████████ | 763/4460 [1:15:24<7:06:23, 6.92s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████ | 763/4460 [1:15:24<7:06:23, 6.92s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████ | 764/4460 [1:15:30<7:06:31, 6.92s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████ | 764/4460 [1:15:30<7:06:31, 6.92s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7163, 'learning_rate': 7.64e-05, 'epoch': 0.86} [WARNING|modeling_utils.py:388] 2022-03-02 23:06:59,087 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████ | 765/4460 [1:15:37<7:04:17, 6.89s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████ | 765/4460 [1:15:37<7:04:17, 6.89s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8448, 'learning_rate': 7.65e-05, 'epoch': 0.86} 17%|█████████████ | 765/4460 [1:15:37<7:04:17, 6.89s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:07:07,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:07:07,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7835, 'learning_rate': 7.66e-05, 'epoch': 0.86} [WARNING|modeling_utils.py:388] 2022-03-02 23:07:07,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:07:07,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████ | 767/4460 [1:15:51<7:01:44, 6.85s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████ | 767/4460 [1:15:51<7:01:44, 6.85s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:07:17,816 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:07:17,816 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████ | 768/4460 [1:15:58<6:58:36, 6.80s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████ | 768/4460 [1:15:58<6:58:36, 6.80s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8388, 'learning_rate': 7.680000000000001e-05, 'epoch': 0.86} 17%|█████████████ | 768/4460 [1:15:58<6:58:36, 6.80s/it]g-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:07:27,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:07:27,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9558, 'learning_rate': 7.69e-05, 'epoch': 0.86} [WARNING|modeling_utils.py:388] 2022-03-02 23:07:27,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:07:27,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:05:23,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████ | 770/4460 [1:16:11<6:52:49, 6.71s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████ | 770/4460 [1:16:11<6:52:49, 6.71s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.753, 'learning_rate': 7.7e-05, 'epoch': 0.86} 17%|█████████████ | 770/4460 [1:16:11<6:52:49, 6.71s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▏ | 771/4460 [1:16:18<6:51:12, 6.69s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▏ | 771/4460 [1:16:18<6:51:12, 6.69s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:07:44,366 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:07:44,366 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▏ | 772/4460 [1:16:24<6:49:33, 6.66s/it]g-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▏ | 772/4460 [1:16:24<6:49:33, 6.66s/it]g-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9966, 'learning_rate': 7.72e-05, 'epoch': 0.87} [WARNING|modeling_utils.py:388] 2022-03-02 23:07:52,611 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:07:52,611 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▏ | 773/4460 [1:16:31<6:48:47, 6.65s/it]g-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▏ | 773/4460 [1:16:31<6:48:47, 6.65s/it]g-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▏ | 773/4460 [1:16:31<6:48:47, 6.65s/it]g-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:08:00,701 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:08:00,701 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7173, 'learning_rate': 7.740000000000001e-05, 'epoch': 0.87} [WARNING|modeling_utils.py:388] 2022-03-02 23:08:00,701 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:08:00,701 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:08:00,701 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▏ | 775/4460 [1:16:44<6:52:22, 6.71s/it]g-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:08:11,019 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:08:11,019 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:08:11,019 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▏ | 776/4460 [1:16:51<6:46:36, 6.62s/it]g-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:08:17,296 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:08:17,296 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:08:17,296 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▏ | 777/4460 [1:16:57<6:42:56, 6.56s/it]g-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▏ | 777/4460 [1:16:57<6:42:56, 6.56s/it]g-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:08:25,244 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:08:25,244 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▎ | 778/4460 [1:17:03<6:37:28, 6.48s/it]g-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▎ | 778/4460 [1:17:03<6:37:28, 6.48s/it]g-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:08:31,536 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:08:31,536 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▎ | 779/4460 [1:17:10<6:32:54, 6.40s/it]g-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▎ | 779/4460 [1:17:10<6:32:54, 6.40s/it]g-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:08:37,692 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:08:37,692 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▎ | 780/4460 [1:17:16<6:28:06, 6.33s/it]g-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▎ | 780/4460 [1:17:16<6:28:06, 6.33s/it]g-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:08:43,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:08:43,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▎ | 781/4460 [1:17:22<6:27:06, 6.31s/it]g-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▎ | 781/4460 [1:17:22<6:27:06, 6.31s/it]g-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▎ | 781/4460 [1:17:22<6:27:06, 6.31s/it]g-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:08:51,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:08:51,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7788, 'learning_rate': 7.82e-05, 'epoch': 0.88} [WARNING|modeling_utils.py:388] 2022-03-02 23:08:56,082 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:08:56,082 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▎ | 783/4460 [1:17:34<6:17:53, 6.17s/it]g-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▎ | 783/4460 [1:17:34<6:17:53, 6.17s/it]g-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:09:02,034 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:09:02,034 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▎ | 784/4460 [1:17:40<6:13:11, 6.09s/it]g-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▎ | 784/4460 [1:17:40<6:13:11, 6.09s/it]g-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:09:07,774 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:09:07,774 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▍ | 785/4460 [1:17:46<6:06:18, 5.98s/it]g-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:09:12,095 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:09:12,095 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:09:12,095 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:07:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▍ | 786/4460 [1:17:51<6:01:15, 5.90s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:09:16,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▍ | 786/4460 [1:17:51<6:01:15, 5.90s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:09:16,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:09:20,400 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:09:16,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:09:20,400 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:09:16,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8262, 'learning_rate': 7.87e-05, 'epoch': 0.88} [WARNING|modeling_utils.py:388] 2022-03-02 23:09:20,400 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:09:16,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:09:20,400 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:09:16,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:09:20,400 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:09:16,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▍ | 788/4460 [1:18:03<6:07:47, 6.01s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:09:28,547 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▍ | 788/4460 [1:18:03<6:07:47, 6.01s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:09:28,547 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:09:32,411 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:09:28,547 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:09:32,411 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:09:28,547 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8205, 'learning_rate': 7.890000000000001e-05, 'epoch': 0.88} [WARNING|modeling_utils.py:388] 2022-03-02 23:09:36,427 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:09:28,547 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:09:36,427 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:09:28,547 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▍ | 790/4460 [1:18:14<5:47:13, 5.68s/it]g-point operations will not be computed-02 23:09:28,547 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:09:40,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:09:28,547 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:09:42,696 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:09:28,547 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:09:42,696 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:09:28,547 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7078, 'learning_rate': 7.910000000000001e-05, 'epoch': 0.89} [WARNING|modeling_utils.py:388] 2022-03-02 23:09:46,285 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:09:28,547 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:09:46,285 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:09:28,547 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▍ | 792/4460 [1:18:24<5:21:02, 5.25s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:09:48,661 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:09:50,842 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:09:48,661 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:09:50,842 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:09:48,661 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▌ | 793/4460 [1:18:28<5:06:36, 5.02s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:09:53,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:09:55,052 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:09:53,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:09:55,052 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:09:53,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▌ | 794/4460 [1:18:33<4:50:01, 4.75s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:09:57,036 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:09:58,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:09:57,036 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:09:58,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:09:57,036 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▌ | 795/4460 [1:18:36<4:31:39, 4.45s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:10:00,731 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▌ | 796/4460 [1:18:40<4:15:41, 4.19s/it]g-point operations will not be computed-02 23:10:00,731 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▌ | 796/4460 [1:18:40<4:15:41, 4.19s/it]g-point operations will not be computed-02 23:10:00,731 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▌ | 796/4460 [1:18:40<4:15:41, 4.19s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:10:04,232 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▌ | 797/4460 [1:18:43<3:54:47, 3.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:10:07,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▌ | 797/4460 [1:18:43<3:54:47, 3.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:10:07,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▌ | 798/4460 [1:18:46<3:34:09, 3.51s/it]g-point operations will not be computed-02 23:10:07,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▌ | 798/4460 [1:18:46<3:34:09, 3.51s/it]g-point operations will not be computed-02 23:10:07,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:10:10,993 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:09,831 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:10:10,993 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:09,831 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:10:13,241 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:12,218 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 800/4460 [1:18:51<3:06:47, 3.06s/it]g-point operations will not be computed-02 23:10:12,218 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 800/4460 [1:18:51<3:06:47, 3.06s/it]g-point operations will not be computed-02 23:10:12,218 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 800/4460 [1:18:51<3:06:47, 3.06s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:10:16,736 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 800/4460 [1:18:51<3:06:47, 3.06s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:10:16,736 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:10:20,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:16,736 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:10:20,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:16,736 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 801/4460 [1:18:59<4:35:48, 4.52s/it]g-point operations will not be computed-02 23:10:16,736 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 801/4460 [1:18:59<4:35:48, 4.52s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 801/4460 [1:18:59<4:35:48, 4.52s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 801/4460 [1:18:59<4:35:48, 4.52s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 801/4460 [1:18:59<4:35:48, 4.52s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 802/4460 [1:19:06<5:30:31, 5.42s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:10:33,482 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:10:33,482 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 803/4460 [1:19:14<6:07:19, 6.03s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 803/4460 [1:19:14<6:07:19, 6.03s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.05, 'learning_rate': 8.030000000000001e-05, 'epoch': 0.9} 18%|█████████████▋ | 803/4460 [1:19:14<6:07:19, 6.03s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 803/4460 [1:19:14<6:07:19, 6.03s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 803/4460 [1:19:14<6:07:19, 6.03s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 804/4460 [1:19:21<6:28:16, 6.37s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 804/4460 [1:19:21<6:28:16, 6.37s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:10:49,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 805/4460 [1:19:28<6:42:08, 6.60s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 805/4460 [1:19:28<6:42:08, 6.60s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7699, 'learning_rate': 8.05e-05, 'epoch': 0.9} 18%|█████████████▋ | 805/4460 [1:19:28<6:42:08, 6.60s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 805/4460 [1:19:28<6:42:08, 6.60s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 805/4460 [1:19:28<6:42:08, 6.60s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 806/4460 [1:19:35<6:50:32, 6.74s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:11:02,164 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:11:02,164 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 807/4460 [1:19:42<6:56:15, 6.84s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 807/4460 [1:19:42<6:56:15, 6.84s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9359, 'learning_rate': 8.070000000000001e-05, 'epoch': 0.9} 18%|█████████████▊ | 807/4460 [1:19:42<6:56:15, 6.84s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 807/4460 [1:19:42<6:56:15, 6.84s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 807/4460 [1:19:42<6:56:15, 6.84s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 808/4460 [1:19:49<6:59:54, 6.90s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:11:16,213 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:11:16,213 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 809/4460 [1:19:56<7:00:02, 6.90s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 809/4460 [1:19:56<7:00:02, 6.90s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8661, 'learning_rate': 8.090000000000001e-05, 'epoch': 0.91} 18%|█████████████▊ | 809/4460 [1:19:56<7:00:02, 6.90s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:11:26,528 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:11:26,528 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7949, 'learning_rate': 8.1e-05, 'epoch': 0.91} [WARNING|modeling_utils.py:388] 2022-03-02 23:11:26,528 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:11:26,528 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:11:26,528 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 811/4460 [1:20:10<6:59:14, 6.89s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 811/4460 [1:20:10<6:59:14, 6.89s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:11:38,573 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 812/4460 [1:20:17<6:58:54, 6.89s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 812/4460 [1:20:17<6:58:54, 6.89s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9528, 'learning_rate': 8.120000000000001e-05, 'epoch': 0.91} 18%|█████████████▊ | 812/4460 [1:20:17<6:58:54, 6.89s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:11:47,140 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:11:47,140 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7941, 'learning_rate': 8.13e-05, 'epoch': 0.91} [WARNING|modeling_utils.py:388] 2022-03-02 23:11:47,140 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:11:47,140 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:11:47,140 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 814/4460 [1:20:30<6:56:58, 6.86s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:11:57,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:11:57,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 815/4460 [1:20:37<6:53:47, 6.81s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 815/4460 [1:20:37<6:53:47, 6.81s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.897, 'learning_rate': 8.15e-05, 'epoch': 0.91} [WARNING|modeling_utils.py:388] 2022-03-02 23:12:05,654 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 816/4460 [1:20:44<6:51:05, 6.77s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 816/4460 [1:20:44<6:51:05, 6.77s/it]g-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7847, 'learning_rate': 8.16e-05, 'epoch': 0.91} [WARNING|modeling_utils.py:388] 2022-03-02 23:12:12,289 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:12:12,289 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:12:12,289 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9361, 'learning_rate': 8.17e-05, 'epoch': 0.92} [WARNING|modeling_utils.py:388] 2022-03-02 23:12:12,289 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:12:20,473 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:12:20,473 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8414, 'learning_rate': 8.18e-05, 'epoch': 0.92} [WARNING|modeling_utils.py:388] 2022-03-02 23:12:20,473 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:12:20,473 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:10:24,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 819/4460 [1:21:03<6:41:32, 6.62s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 819/4460 [1:21:03<6:41:32, 6.62s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7702, 'learning_rate': 8.19e-05, 'epoch': 0.92} 18%|█████████████▉ | 819/4460 [1:21:03<6:41:32, 6.62s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 819/4460 [1:21:03<6:41:32, 6.62s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 820/4460 [1:21:10<6:39:30, 6.59s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 820/4460 [1:21:10<6:39:30, 6.59s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:12:38,387 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:12:38,387 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 821/4460 [1:21:16<6:37:27, 6.55s/it]g-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 821/4460 [1:21:16<6:37:27, 6.55s/it]g-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 821/4460 [1:21:16<6:37:27, 6.55s/it]g-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 821/4460 [1:21:16<6:37:27, 6.55s/it]g-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:12:46,354 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:12:46,354 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:12:46,354 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:12:52,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:12:52,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9144, 'learning_rate': 8.23e-05, 'epoch': 0.92} [WARNING|modeling_utils.py:388] 2022-03-02 23:12:52,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:12:59,068 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:12:59,068 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7956, 'learning_rate': 8.24e-05, 'epoch': 0.92} [WARNING|modeling_utils.py:388] 2022-03-02 23:12:59,068 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:12:59,068 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:12:59,068 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████ | 825/4460 [1:21:43<6:37:46, 6.57s/it]g-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:13:09,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:13:09,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:13:09,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████ | 826/4460 [1:21:49<6:33:42, 6.50s/it]g-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████ | 826/4460 [1:21:49<6:33:42, 6.50s/it]g-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:13:16,926 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:13:16,926 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████ | 827/4460 [1:21:55<6:26:12, 6.38s/it]g-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████ | 827/4460 [1:21:55<6:26:12, 6.38s/it]g-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:13:23,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:13:23,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████ | 828/4460 [1:22:01<6:21:53, 6.31s/it]g-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████ | 828/4460 [1:22:01<6:21:53, 6.31s/it]g-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:13:29,185 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:13:29,185 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▏ | 829/4460 [1:22:07<6:19:34, 6.27s/it]g-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:13:33,808 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:13:33,808 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:13:33,808 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▏ | 830/4460 [1:22:13<6:14:21, 6.19s/it]g-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:13:39,815 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:13:39,815 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:13:39,815 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▏ | 831/4460 [1:22:19<6:11:53, 6.15s/it]g-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:13:45,847 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:13:45,847 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:13:45,847 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▏ | 832/4460 [1:22:25<6:09:11, 6.11s/it]g-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:13:51,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:13:51,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:13:51,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:12:28,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▏ | 833/4460 [1:22:31<6:05:40, 6.05s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:13:56,236 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▏ | 833/4460 [1:22:31<6:05:40, 6.05s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:13:56,236 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:14:00,541 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:13:56,236 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:14:00,541 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:13:56,236 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8443, 'learning_rate': 8.34e-05, 'epoch': 0.93} [WARNING|modeling_utils.py:388] 2022-03-02 23:14:00,541 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:13:56,236 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:14:06,221 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:13:56,236 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:14:06,221 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:13:56,236 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8043, 'learning_rate': 8.35e-05, 'epoch': 0.94} [WARNING|modeling_utils.py:388] 2022-03-02 23:14:10,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:13:56,236 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▏ | 836/4460 [1:22:48<5:51:57, 5.83s/it]g-point operations will not be computed-02 23:13:56,236 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▏ | 836/4460 [1:22:48<5:51:57, 5.83s/it]g-point operations will not be computed-02 23:13:56,236 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:14:14,682 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:13:56,236 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:14:14,682 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:13:56,236 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:14:14,682 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:13:56,236 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▎ | 837/4460 [1:22:54<5:44:02, 5.70s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:14:18,727 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▎ | 837/4460 [1:22:54<5:44:02, 5.70s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:14:18,727 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:14:22,556 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:14:18,727 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:14:22,556 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:14:18,727 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9342, 'learning_rate': 8.38e-05, 'epoch': 0.94} [WARNING|modeling_utils.py:388] 2022-03-02 23:14:26,517 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:14:18,727 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▎ | 839/4460 [1:23:04<5:29:01, 5.45s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:14:29,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▎ | 839/4460 [1:23:04<5:29:01, 5.45s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:14:29,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7495, 'learning_rate': 8.39e-05, 'epoch': 0.94} [WARNING|modeling_utils.py:388] 2022-03-02 23:14:32,549 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:14:29,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:14:32,549 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:14:29,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:14:34,940 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:14:29,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:14:37,137 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:14:29,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:14:37,137 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:14:29,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9511, 'learning_rate': 8.41e-05, 'epoch': 0.94} [WARNING|modeling_utils.py:388] 2022-03-02 23:14:40,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:14:29,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:14:40,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:14:29,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▎ | 842/4460 [1:23:18<4:51:53, 4.84s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:14:42,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:14:44,538 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:14:42,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:14:44,538 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:14:42,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▎ | 843/4460 [1:23:22<4:37:00, 4.60s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:14:46,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▍ | 844/4460 [1:23:26<4:22:20, 4.35s/it]g-point operations will not be computed-02 23:14:46,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▍ | 844/4460 [1:23:26<4:22:20, 4.35s/it]g-point operations will not be computed-02 23:14:46,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▍ | 844/4460 [1:23:26<4:22:20, 4.35s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:14:50,248 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▍ | 845/4460 [1:23:29<4:05:07, 4.07s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:14:53,554 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▍ | 845/4460 [1:23:29<4:05:07, 4.07s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:14:53,554 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:14:55,034 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:14:53,554 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:14:55,034 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:14:53,554 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▍ | 846/4460 [1:23:32<3:46:51, 3.77s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:14:56,554 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▍ | 847/4460 [1:23:35<3:28:30, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:14:59,250 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▍ | 847/4460 [1:23:35<3:28:30, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:14:59,250 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▍ | 848/4460 [1:23:38<3:11:27, 3.18s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:15:01,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▍ | 848/4460 [1:23:38<3:11:27, 3.18s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:15:01,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▍ | 849/4460 [1:23:40<2:57:05, 2.94s/it]g-point operations will not be computed-02 23:15:01,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▍ | 849/4460 [1:23:40<2:57:05, 2.94s/it]g-point operations will not be computed-02 23:15:01,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:15:05,047 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:04,042 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▍ | 850/4460 [1:23:43<2:52:30, 2.87s/it]g-point operations will not be computed-02 23:15:04,042 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▍ | 850/4460 [1:23:43<2:52:30, 2.87s/it]g-point operations will not be computed-02 23:15:04,042 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▍ | 850/4460 [1:23:43<2:52:30, 2.87s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:15:08,215 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:15:11,845 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:08,215 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:15:11,845 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:08,215 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▌ | 851/4460 [1:23:50<4:18:01, 4.29s/it]g-point operations will not be computed-02 23:15:08,215 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▌ | 851/4460 [1:23:50<4:18:01, 4.29s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:15:15,655 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▌ | 851/4460 [1:23:50<4:18:01, 4.29s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:15:15,655 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:15:19,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:15,655 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:15:19,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:15,655 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▌ | 852/4460 [1:23:58<5:13:34, 5.21s/it]g-point operations will not be computed-02 23:15:15,655 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▌ | 852/4460 [1:23:58<5:13:34, 5.21s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▌ | 852/4460 [1:23:58<5:13:34, 5.21s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:15:28,358 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:15:28,358 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.2565, 'learning_rate': 8.53e-05, 'epoch': 0.96} [WARNING|modeling_utils.py:388] 2022-03-02 23:15:28,358 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:15:28,358 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:15:28,358 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▌ | 854/4460 [1:24:12<6:16:02, 6.26s/it]g-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▌ | 854/4460 [1:24:12<6:16:02, 6.26s/it]g-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▌ | 854/4460 [1:24:12<6:16:02, 6.26s/it]g-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▌ | 854/4460 [1:24:12<6:16:02, 6.26s/it]g-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▌ | 854/4460 [1:24:12<6:16:02, 6.26s/it]g-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▌ | 855/4460 [1:24:19<6:32:25, 6.53s/it]g-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:15:46,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:15:46,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:15:46,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▌ | 856/4460 [1:24:26<6:43:12, 6.71s/it]g-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▌ | 856/4460 [1:24:26<6:43:12, 6.71s/it]g-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▌ | 856/4460 [1:24:26<6:43:12, 6.71s/it]g-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:15:56,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:15:56,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0794, 'learning_rate': 8.57e-05, 'epoch': 0.96} [WARNING|modeling_utils.py:388] 2022-03-02 23:15:56,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:15:56,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:15:56,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▌ | 858/4460 [1:24:40<6:50:47, 6.84s/it]g-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:16:07,371 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:16:07,371 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▋ | 859/4460 [1:24:47<6:52:00, 6.86s/it]g-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▋ | 859/4460 [1:24:47<6:52:00, 6.86s/it]g-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7794, 'learning_rate': 8.59e-05, 'epoch': 0.96} 19%|██████████████▋ | 859/4460 [1:24:47<6:52:00, 6.86s/it]g-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:16:17,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:16:17,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8992, 'learning_rate': 8.6e-05, 'epoch': 0.96} [WARNING|modeling_utils.py:388] 2022-03-02 23:16:17,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:16:17,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:16:17,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▋ | 861/4460 [1:25:01<6:51:32, 6.86s/it]g-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▋ | 861/4460 [1:25:01<6:51:32, 6.86s/it]g-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:16:29,592 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▋ | 862/4460 [1:25:08<6:49:26, 6.83s/it]g-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▋ | 862/4460 [1:25:08<6:49:26, 6.83s/it]g-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9089, 'learning_rate': 8.620000000000001e-05, 'epoch': 0.97} 19%|██████████████▋ | 862/4460 [1:25:08<6:49:26, 6.83s/it]g-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:16:37,967 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:16:37,967 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8525, 'learning_rate': 8.63e-05, 'epoch': 0.97} [WARNING|modeling_utils.py:388] 2022-03-02 23:16:37,967 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:16:37,967 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:16:37,967 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:15:22,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▋ | 864/4460 [1:25:21<6:46:34, 6.78s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:16:46,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▋ | 864/4460 [1:25:21<6:46:34, 6.78s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:16:46,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▋ | 864/4460 [1:25:21<6:46:34, 6.78s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:16:46,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▋ | 865/4460 [1:25:28<6:43:23, 6.73s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:16:46,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▋ | 865/4460 [1:25:28<6:43:23, 6.73s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:16:46,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9645, 'learning_rate': 8.65e-05, 'epoch': 0.97} [WARNING|modeling_utils.py:388] 2022-03-02 23:16:56,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:16:46,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▊ | 866/4460 [1:25:34<6:41:48, 6.71s/it]g-point operations will not be computed-02 23:16:46,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▊ | 866/4460 [1:25:34<6:41:48, 6.71s/it]g-point operations will not be computed-02 23:16:46,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7923, 'learning_rate': 8.66e-05, 'epoch': 0.97} 19%|██████████████▊ | 866/4460 [1:25:34<6:41:48, 6.71s/it]g-point operations will not be computed-02 23:16:46,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:17:04,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:16:46,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:17:04,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:16:46,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0068, 'learning_rate': 8.67e-05, 'epoch': 0.97} [WARNING|modeling_utils.py:388] 2022-03-02 23:17:04,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:16:46,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:17:04,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:16:46,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:17:04,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:16:46,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▊ | 868/4460 [1:25:48<6:38:37, 6.66s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:17:12,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▊ | 868/4460 [1:25:48<6:38:37, 6.66s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:17:12,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▊ | 868/4460 [1:25:48<6:38:37, 6.66s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:17:12,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▊ | 868/4460 [1:25:48<6:38:37, 6.66s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:17:12,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▊ | 869/4460 [1:25:54<6:35:53, 6.61s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:17:12,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:17:20,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:17:12,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:17:20,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:17:12,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:17:20,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:17:12,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|██████████████▊ | 870/4460 [1:26:01<6:32:25, 6.56s/it]g-point operations will not be computed-02 23:17:12,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|██████████████▊ | 870/4460 [1:26:01<6:32:25, 6.56s/it]g-point operations will not be computed-02 23:17:12,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:17:28,940 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:17:12,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|██████████████▊ | 871/4460 [1:26:07<6:29:19, 6.51s/it]g-point operations will not be computed-02 23:17:12,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|██████████████▊ | 871/4460 [1:26:07<6:29:19, 6.51s/it]g-point operations will not be computed-02 23:17:12,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8052, 'learning_rate': 8.71e-05, 'epoch': 0.98} [WARNING|modeling_utils.py:388] 2022-03-02 23:17:35,297 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:17:12,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|██████████████▊ | 872/4460 [1:26:13<6:25:40, 6.45s/it]g-point operations will not be computed-02 23:17:12,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|██████████████▊ | 872/4460 [1:26:13<6:25:40, 6.45s/it]g-point operations will not be computed-02 23:17:12,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8843, 'learning_rate': 8.72e-05, 'epoch': 0.98} [WARNING|modeling_utils.py:388] 2022-03-02 23:17:41,633 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:17:12,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|██████████████▉ | 873/4460 [1:26:20<6:24:14, 6.43s/it]g-point operations will not be computed-02 23:17:12,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|██████████████▉ | 873/4460 [1:26:20<6:24:14, 6.43s/it]g-point operations will not be computed-02 23:17:12,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8487, 'learning_rate': 8.730000000000001e-05, 'epoch': 0.98} [WARNING|modeling_utils.py:388] 2022-03-02 23:17:47,896 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:17:12,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|██████████████▉ | 874/4460 [1:26:26<6:20:53, 6.37s/it]g-point operations will not be computed-02 23:17:12,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|██████████████▉ | 874/4460 [1:26:26<6:20:53, 6.37s/it]g-point operations will not be computed-02 23:17:12,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9423, 'learning_rate': 8.740000000000001e-05, 'epoch': 0.98} 20%|██████████████▉ | 874/4460 [1:26:26<6:20:53, 6.37s/it]g-point operations will not be computed-02 23:17:12,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:17:56,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:17:12,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:17:56,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:17:12,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9963, 'learning_rate': 8.75e-05, 'epoch': 0.98} [WARNING|modeling_utils.py:388] 2022-03-02 23:17:56,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:17:12,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:17:56,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:17:12,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:17:56,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:17:12,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|██████████████▉ | 876/4460 [1:26:39<6:22:11, 6.40s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:18:03,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|██████████████▉ | 876/4460 [1:26:39<6:22:11, 6.40s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:18:03,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:18:08,335 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:18:03,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:18:08,335 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:18:03,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.694, 'learning_rate': 8.77e-05, 'epoch': 0.98} [WARNING|modeling_utils.py:388] 2022-03-02 23:18:08,335 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:18:03,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:18:14,235 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:18:03,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:18:14,235 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:18:03,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8147, 'learning_rate': 8.78e-05, 'epoch': 0.98} [WARNING|modeling_utils.py:388] 2022-03-02 23:18:18,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:18:03,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|██████████████▉ | 879/4460 [1:26:57<6:00:22, 6.04s/it]g-point operations will not be computed-02 23:18:03,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|██████████████▉ | 879/4460 [1:26:57<6:00:22, 6.04s/it]g-point operations will not be computed-02 23:18:03,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8343, 'learning_rate': 8.790000000000001e-05, 'epoch': 0.99} [WARNING|modeling_utils.py:388] 2022-03-02 23:18:24,295 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:18:03,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|██████████████▉ | 880/4460 [1:27:02<5:54:01, 5.93s/it]g-point operations will not be computed-02 23:18:03,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|██████████████▉ | 880/4460 [1:27:02<5:54:01, 5.93s/it]g-point operations will not be computed-02 23:18:03,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:18:28,569 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:18:03,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:18:28,569 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:18:03,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████ | 881/4460 [1:27:08<5:47:29, 5.83s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:18:32,715 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████ | 881/4460 [1:27:08<5:47:29, 5.83s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:18:32,715 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6338, 'learning_rate': 8.81e-05, 'epoch': 0.99} [WARNING|modeling_utils.py:388] 2022-03-02 23:18:36,658 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:18:32,715 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:18:36,658 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:18:32,715 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8634, 'learning_rate': 8.82e-05, 'epoch': 0.99} [WARNING|modeling_utils.py:388] 2022-03-02 23:18:40,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:18:32,715 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:18:40,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:18:32,715 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████ | 883/4460 [1:27:18<5:29:57, 5.53s/it]g-point operations will not be computed-02 23:18:32,715 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:18:44,383 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:18:32,715 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:18:44,383 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:18:32,715 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:18:46,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:18:32,715 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:18:49,111 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:18:32,715 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:18:51,286 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:18:32,715 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:18:51,286 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:18:32,715 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:18:53,498 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:18:32,715 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:18:55,540 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:18:32,715 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:18:55,540 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:18:32,715 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8339, 'learning_rate': 8.86e-05, 'epoch': 0.99} 20%|███████████████ | 887/4460 [1:27:36<4:32:22, 4.57s/it]g-point operations will not be computed-02 23:18:32,715 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████ | 887/4460 [1:27:36<4:32:22, 4.57s/it]g-point operations will not be computed-02 23:18:32,715 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████ | 887/4460 [1:27:36<4:32:22, 4.57s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:19:00,466 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▏ | 888/4460 [1:27:40<4:14:13, 4.27s/it]g-point operations will not be computed-02 23:19:00,466 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▏ | 888/4460 [1:27:40<4:14:13, 4.27s/it]g-point operations will not be computed-02 23:19:00,466 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:19:05,494 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:03,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:19:05,494 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:03,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▏ | 889/4460 [1:27:43<3:55:19, 3.95s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:19:07,051 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▏ | 890/4460 [1:27:46<3:34:21, 3.60s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:19:09,725 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▏ | 890/4460 [1:27:46<3:34:21, 3.60s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:19:09,725 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▏ | 891/4460 [1:27:48<3:12:19, 3.23s/it]g-point operations will not be computed-02 23:19:09,725 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▏ | 891/4460 [1:27:48<3:12:19, 3.23s/it]g-point operations will not be computed-02 23:19:09,725 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▏ | 892/4460 [1:27:50<2:51:30, 2.88s/it]g-point operations will not be computed-02 23:19:12,073 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▏ | 892/4460 [1:27:50<2:51:30, 2.88s/it]g-point operations will not be computed-02 23:19:12,073 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▏ | 892/4460 [1:27:50<2:51:30, 2.88s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:19:15,662 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▏ | 892/4460 [1:27:50<2:51:30, 2.88s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:19:15,662 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:19:19,356 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:15,662 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:19:19,356 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:15,662 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▏ | 893/4460 [1:27:58<4:17:24, 4.33s/it]g-point operations will not be computed-02 23:19:15,662 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▏ | 893/4460 [1:27:58<4:17:24, 4.33s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▏ | 893/4460 [1:27:58<4:17:24, 4.33s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:19:26,736 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:19:26,736 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▏ | 894/4460 [1:28:05<5:11:10, 5.24s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▏ | 894/4460 [1:28:05<5:11:10, 5.24s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:19:33,950 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▎ | 895/4460 [1:28:12<5:46:08, 5.83s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▎ | 895/4460 [1:28:12<5:46:08, 5.83s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.84, 'learning_rate': 8.950000000000001e-05, 'epoch': 1.0} 20%|███████████████▎ | 895/4460 [1:28:12<5:46:08, 5.83s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▎ | 895/4460 [1:28:12<5:46:08, 5.83s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▎ | 895/4460 [1:28:12<5:46:08, 5.83s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▎ | 896/4460 [1:28:19<6:08:42, 6.21s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▎ | 896/4460 [1:28:19<6:08:42, 6.21s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:19:48,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▎ | 897/4460 [1:28:26<6:24:29, 6.47s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▎ | 897/4460 [1:28:26<6:24:29, 6.47s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8529, 'learning_rate': 8.970000000000001e-05, 'epoch': 1.01} 20%|███████████████▎ | 897/4460 [1:28:26<6:24:29, 6.47s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▎ | 897/4460 [1:28:26<6:24:29, 6.47s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▎ | 897/4460 [1:28:26<6:24:29, 6.47s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▎ | 898/4460 [1:28:34<6:35:56, 6.67s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▎ | 898/4460 [1:28:34<6:35:56, 6.67s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:20:02,394 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▎ | 899/4460 [1:28:41<6:42:33, 6.78s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▎ | 899/4460 [1:28:41<6:42:33, 6.78s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8332, 'learning_rate': 8.99e-05, 'epoch': 1.01} 20%|███████████████▎ | 899/4460 [1:28:41<6:42:33, 6.78s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▎ | 899/4460 [1:28:41<6:42:33, 6.78s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▎ | 899/4460 [1:28:41<6:42:33, 6.78s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▎ | 900/4460 [1:28:48<6:55:45, 7.01s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▎ | 900/4460 [1:28:48<6:55:45, 7.01s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▎ | 900/4460 [1:28:48<6:55:45, 7.01s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:20:18,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:20:18,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9338, 'learning_rate': 9.010000000000001e-05, 'epoch': 1.01} [WARNING|modeling_utils.py:388] 2022-03-02 23:20:18,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:20:18,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:20:18,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▎ | 902/4460 [1:29:02<6:55:20, 7.00s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:20:29,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:20:29,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 903/4460 [1:29:09<6:53:28, 6.97s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 903/4460 [1:29:09<6:53:28, 6.97s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7139, 'learning_rate': 9.030000000000001e-05, 'epoch': 1.01} 20%|███████████████▍ | 903/4460 [1:29:09<6:53:28, 6.97s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:20:39,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:20:39,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8658, 'learning_rate': 9.04e-05, 'epoch': 1.01} [WARNING|modeling_utils.py:388] 2022-03-02 23:20:42,803 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:20:42,803 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:20:42,803 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 905/4460 [1:29:23<6:47:44, 6.88s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 905/4460 [1:29:23<6:47:44, 6.88s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:20:51,195 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 906/4460 [1:29:29<6:44:43, 6.83s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 906/4460 [1:29:29<6:44:43, 6.83s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7589, 'learning_rate': 9.06e-05, 'epoch': 1.02} 20%|███████████████▍ | 906/4460 [1:29:29<6:44:43, 6.83s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:20:59,490 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:20:59,490 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9746, 'learning_rate': 9.070000000000001e-05, 'epoch': 1.02} [WARNING|modeling_utils.py:388] 2022-03-02 23:20:59,490 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:21:06,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:21:06,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9491, 'learning_rate': 9.080000000000001e-05, 'epoch': 1.02} [WARNING|modeling_utils.py:388] 2022-03-02 23:21:09,452 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:21:09,452 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 909/4460 [1:29:49<6:36:26, 6.70s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 909/4460 [1:29:49<6:36:26, 6.70s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8044, 'learning_rate': 9.090000000000001e-05, 'epoch': 1.02} [WARNING|modeling_utils.py:388] 2022-03-02 23:21:17,688 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▌ | 910/4460 [1:29:56<6:34:17, 6.66s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▌ | 910/4460 [1:29:56<6:34:17, 6.66s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6508, 'learning_rate': 9.1e-05, 'epoch': 1.02} 20%|███████████████▌ | 910/4460 [1:29:56<6:34:17, 6.66s/it]g-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:21:25,857 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:21:25,857 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7558, 'learning_rate': 9.11e-05, 'epoch': 1.02} [WARNING|modeling_utils.py:388] 2022-03-02 23:21:25,857 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:21:25,857 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:21:25,857 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:19:23,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▌ | 912/4460 [1:30:09<6:31:38, 6.62s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:21:34,182 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▌ | 912/4460 [1:30:09<6:31:38, 6.62s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:21:34,182 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▌ | 912/4460 [1:30:09<6:31:38, 6.62s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:21:34,182 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▌ | 912/4460 [1:30:09<6:31:38, 6.62s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:21:34,182 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▌ | 913/4460 [1:30:15<6:29:44, 6.59s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:21:34,182 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:21:42,324 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:21:34,182 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:21:42,324 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:21:34,182 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▌ | 914/4460 [1:30:22<6:29:23, 6.59s/it]g-point operations will not be computed-02 23:21:34,182 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▌ | 914/4460 [1:30:22<6:29:23, 6.59s/it]g-point operations will not be computed-02 23:21:34,182 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:21:48,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:21:34,182 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:21:48,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:21:34,182 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▌ | 915/4460 [1:30:28<6:26:16, 6.54s/it]g-point operations will not be computed-02 23:21:34,182 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▌ | 915/4460 [1:30:28<6:26:16, 6.54s/it]g-point operations will not be computed-02 23:21:34,182 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6343, 'learning_rate': 9.15e-05, 'epoch': 1.03} [WARNING|modeling_utils.py:388] 2022-03-02 23:21:56,798 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:21:34,182 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▌ | 916/4460 [1:30:35<6:23:27, 6.49s/it]g-point operations will not be computed-02 23:21:34,182 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▌ | 916/4460 [1:30:35<6:23:27, 6.49s/it]g-point operations will not be computed-02 23:21:34,182 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7585, 'learning_rate': 9.16e-05, 'epoch': 1.03} [WARNING|modeling_utils.py:388] 2022-03-02 23:22:03,137 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:21:34,182 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▋ | 917/4460 [1:30:41<6:20:08, 6.44s/it]g-point operations will not be computed-02 23:21:34,182 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▋ | 917/4460 [1:30:41<6:20:08, 6.44s/it]g-point operations will not be computed-02 23:21:34,182 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.725, 'learning_rate': 9.17e-05, 'epoch': 1.03} 21%|███████████████▋ | 917/4460 [1:30:41<6:20:08, 6.44s/it]g-point operations will not be computed-02 23:21:34,182 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:22:11,013 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:21:34,182 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:22:11,013 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:21:34,182 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8298, 'learning_rate': 9.180000000000001e-05, 'epoch': 1.03} [WARNING|modeling_utils.py:388] 2022-03-02 23:22:11,013 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:21:34,182 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:22:11,013 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:21:34,182 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:22:11,013 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:21:34,182 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▋ | 919/4460 [1:30:54<6:16:34, 6.38s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:22:18,997 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▋ | 919/4460 [1:30:54<6:16:34, 6.38s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:22:18,997 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▋ | 919/4460 [1:30:54<6:16:34, 6.38s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:22:18,997 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▋ | 919/4460 [1:30:54<6:16:34, 6.38s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:22:18,997 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▋ | 920/4460 [1:31:00<6:12:51, 6.32s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:22:25,139 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▋ | 920/4460 [1:31:00<6:12:51, 6.32s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:22:25,139 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▋ | 920/4460 [1:31:00<6:12:51, 6.32s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:22:25,139 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▋ | 920/4460 [1:31:00<6:12:51, 6.32s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:22:25,139 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▋ | 921/4460 [1:31:06<6:09:36, 6.27s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:22:31,281 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▋ | 921/4460 [1:31:06<6:09:36, 6.27s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:22:31,281 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▋ | 921/4460 [1:31:06<6:09:36, 6.27s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:22:31,281 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▋ | 921/4460 [1:31:06<6:09:36, 6.27s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:22:31,281 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▋ | 922/4460 [1:31:12<6:06:54, 6.22s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:22:37,402 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▋ | 922/4460 [1:31:12<6:06:54, 6.22s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:22:37,402 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▋ | 922/4460 [1:31:12<6:06:54, 6.22s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:22:37,402 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▋ | 922/4460 [1:31:12<6:06:54, 6.22s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:22:37,402 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▋ | 923/4460 [1:31:18<6:05:14, 6.20s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:22:43,515 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▋ | 923/4460 [1:31:18<6:05:14, 6.20s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:22:43,515 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▋ | 923/4460 [1:31:18<6:05:14, 6.20s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:22:43,515 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▋ | 923/4460 [1:31:18<6:05:14, 6.20s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:22:43,515 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▋ | 924/4460 [1:31:24<6:02:31, 6.15s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:22:49,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▋ | 924/4460 [1:31:24<6:02:31, 6.15s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:22:49,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▋ | 924/4460 [1:31:24<6:02:31, 6.15s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:22:49,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▊ | 925/4460 [1:31:31<6:08:42, 6.26s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:22:49,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▊ | 925/4460 [1:31:31<6:08:42, 6.26s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:22:49,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:22:57,579 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:22:49,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:22:57,579 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:22:49,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▊ | 926/4460 [1:31:37<6:02:41, 6.16s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:23:01,913 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▊ | 926/4460 [1:31:37<6:02:41, 6.16s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:23:01,913 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7668, 'learning_rate': 9.260000000000001e-05, 'epoch': 1.04} [WARNING|modeling_utils.py:388] 2022-03-02 23:23:06,113 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:23:01,913 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:23:06,113 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:23:01,913 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8041, 'learning_rate': 9.27e-05, 'epoch': 1.04} [WARNING|modeling_utils.py:388] 2022-03-02 23:23:10,404 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:23:01,913 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▊ | 928/4460 [1:31:48<5:48:56, 5.93s/it]g-point operations will not be computed-02 23:23:01,913 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▊ | 928/4460 [1:31:48<5:48:56, 5.93s/it]g-point operations will not be computed-02 23:23:01,913 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7687, 'learning_rate': 9.28e-05, 'epoch': 1.04} [WARNING|modeling_utils.py:388] 2022-03-02 23:23:15,994 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:23:01,913 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▊ | 929/4460 [1:31:54<5:41:52, 5.81s/it]g-point operations will not be computed-02 23:23:01,913 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▊ | 929/4460 [1:31:54<5:41:52, 5.81s/it]g-point operations will not be computed-02 23:23:01,913 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:23:20,202 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:23:01,913 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:23:20,202 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:23:01,913 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▊ | 930/4460 [1:31:59<5:36:04, 5.71s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:23:24,251 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▊ | 930/4460 [1:31:59<5:36:04, 5.71s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:23:24,251 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9067, 'learning_rate': 9.300000000000001e-05, 'epoch': 1.04} [WARNING|modeling_utils.py:388] 2022-03-02 23:23:28,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:23:24,251 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:23:28,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:23:24,251 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7715, 'learning_rate': 9.310000000000001e-05, 'epoch': 1.04} [WARNING|modeling_utils.py:388] 2022-03-02 23:23:31,980 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:23:24,251 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▉ | 932/4460 [1:32:10<5:19:55, 5.44s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:23:34,542 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▉ | 932/4460 [1:32:10<5:19:55, 5.44s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:23:34,542 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7018, 'learning_rate': 9.320000000000002e-05, 'epoch': 1.04} [WARNING|modeling_utils.py:388] 2022-03-02 23:23:38,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:23:34,542 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:23:38,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:23:34,542 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:23:40,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:23:34,542 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:23:42,714 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:23:34,542 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:23:42,714 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:23:34,542 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8784, 'learning_rate': 9.340000000000001e-05, 'epoch': 1.05} [WARNING|modeling_utils.py:388] 2022-03-02 23:23:46,079 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:23:34,542 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:23:46,079 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:23:34,542 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▉ | 935/4460 [1:32:24<4:46:46, 4.88s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:23:48,292 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:23:50,312 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:23:48,292 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:23:50,312 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:23:48,292 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▉ | 936/4460 [1:32:28<4:33:55, 4.66s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:23:52,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▉ | 937/4460 [1:32:32<4:18:13, 4.40s/it]g-point operations will not be computed-02 23:23:52,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▉ | 937/4460 [1:32:32<4:18:13, 4.40s/it]g-point operations will not be computed-02 23:23:52,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▉ | 937/4460 [1:32:32<4:18:13, 4.40s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:23:56,033 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▉ | 937/4460 [1:32:32<4:18:13, 4.40s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:23:56,033 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|███████████████▉ | 938/4460 [1:32:35<4:00:38, 4.10s/it]g-point operations will not be computed-02 23:23:56,033 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:24:00,724 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:23:59,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:24:00,724 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:23:59,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 939/4460 [1:32:38<3:40:13, 3.75s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:24:02,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 939/4460 [1:32:38<3:40:13, 3.75s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:24:02,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 940/4460 [1:32:41<3:19:48, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:24:04,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 940/4460 [1:32:41<3:19:48, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:24:04,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 941/4460 [1:32:43<3:01:26, 3.09s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:24:07,023 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 941/4460 [1:32:43<3:01:26, 3.09s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:24:07,023 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 942/4460 [1:32:45<2:43:21, 2.79s/it]g-point operations will not be computed-02 23:24:07,023 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 942/4460 [1:32:45<2:43:21, 2.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:24:10,495 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 942/4460 [1:32:45<2:43:21, 2.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:24:10,495 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:24:14,157 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:24:10,495 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:24:14,157 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:24:10,495 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 943/4460 [1:32:52<4:06:08, 4.20s/it]g-point operations will not be computed-02 23:24:10,495 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 943/4460 [1:32:52<4:06:08, 4.20s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:24:17,895 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:24:21,526 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:24:17,895 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:24:21,526 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:24:17,895 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 944/4460 [1:33:00<5:01:56, 5.15s/it]g-point operations will not be computed-02 23:24:17,895 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 944/4460 [1:33:00<5:01:56, 5.15s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 944/4460 [1:33:00<5:01:56, 5.15s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 944/4460 [1:33:00<5:01:56, 5.15s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 944/4460 [1:33:00<5:01:56, 5.15s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 945/4460 [1:33:07<5:38:39, 5.78s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 945/4460 [1:33:07<5:38:39, 5.78s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:24:35,869 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 946/4460 [1:33:14<6:01:11, 6.17s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 946/4460 [1:33:14<6:01:11, 6.17s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8965, 'learning_rate': 9.46e-05, 'epoch': 1.06} 21%|████████████████ | 946/4460 [1:33:14<6:01:11, 6.17s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 946/4460 [1:33:14<6:01:11, 6.17s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 946/4460 [1:33:14<6:01:11, 6.17s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 947/4460 [1:33:21<6:18:03, 6.46s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 947/4460 [1:33:21<6:18:03, 6.46s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 947/4460 [1:33:21<6:18:03, 6.46s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 947/4460 [1:33:21<6:18:03, 6.46s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 947/4460 [1:33:21<6:18:03, 6.46s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 948/4460 [1:33:29<6:45:35, 6.93s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 948/4460 [1:33:29<6:45:35, 6.93s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:24:58,139 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:24:58,139 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 949/4460 [1:33:36<6:47:11, 6.96s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 949/4460 [1:33:36<6:47:11, 6.96s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 949/4460 [1:33:36<6:47:11, 6.96s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 949/4460 [1:33:36<6:47:11, 6.96s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 949/4460 [1:33:36<6:47:11, 6.96s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 950/4460 [1:33:44<6:56:56, 7.13s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 950/4460 [1:33:44<6:56:56, 7.13s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:25:12,648 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 951/4460 [1:33:51<6:54:21, 7.09s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 951/4460 [1:33:51<6:54:21, 7.09s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6696, 'learning_rate': 9.51e-05, 'epoch': 1.07} 21%|████████████████▏ | 951/4460 [1:33:51<6:54:21, 7.09s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 951/4460 [1:33:51<6:54:21, 7.09s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 951/4460 [1:33:51<6:54:21, 7.09s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 952/4460 [1:33:58<6:51:25, 7.04s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 952/4460 [1:33:58<6:51:25, 7.04s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:25:26,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 953/4460 [1:34:05<6:51:46, 7.05s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 953/4460 [1:34:05<6:51:46, 7.05s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8356, 'learning_rate': 9.53e-05, 'epoch': 1.07} 21%|████████████████▏ | 953/4460 [1:34:05<6:51:46, 7.05s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 953/4460 [1:34:05<6:51:46, 7.05s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 953/4460 [1:34:05<6:51:46, 7.05s/it]g-point operations will not be computed-02 23:24:25,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 954/4460 [1:34:12<6:48:12, 6.99s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 954/4460 [1:34:12<6:48:12, 6.99s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 954/4460 [1:34:12<6:48:12, 6.99s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 954/4460 [1:34:12<6:48:12, 6.99s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 955/4460 [1:34:18<6:44:51, 6.93s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 955/4460 [1:34:18<6:44:51, 6.93s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:25:47,092 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 956/4460 [1:34:25<6:42:30, 6.89s/it]g-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 956/4460 [1:34:25<6:42:30, 6.89s/it]g-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8157, 'learning_rate': 9.56e-05, 'epoch': 1.07} 21%|████████████████▎ | 956/4460 [1:34:25<6:42:30, 6.89s/it]g-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:25:55,505 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:25:55,505 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8341, 'learning_rate': 9.57e-05, 'epoch': 1.07} [WARNING|modeling_utils.py:388] 2022-03-02 23:25:55,505 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:25:55,505 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:25:55,505 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 958/4460 [1:34:39<6:38:35, 6.83s/it]g-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:26:05,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:26:05,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▎ | 959/4460 [1:34:46<6:36:07, 6.79s/it]g-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▎ | 959/4460 [1:34:46<6:36:07, 6.79s/it]g-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9325, 'learning_rate': 9.59e-05, 'epoch': 1.08} [WARNING|modeling_utils.py:388] 2022-03-02 23:26:14,022 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▎ | 960/4460 [1:34:52<6:33:46, 6.75s/it]g-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▎ | 960/4460 [1:34:52<6:33:46, 6.75s/it]g-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7229, 'learning_rate': 9.6e-05, 'epoch': 1.08} 22%|████████████████▎ | 960/4460 [1:34:52<6:33:46, 6.75s/it]g-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:26:22,233 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:26:22,233 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7774, 'learning_rate': 9.61e-05, 'epoch': 1.08} [WARNING|modeling_utils.py:388] 2022-03-02 23:26:22,233 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:26:22,233 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:26:22,233 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▍ | 962/4460 [1:35:05<6:27:56, 6.65s/it]g-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:26:32,153 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:26:32,153 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▍ | 963/4460 [1:35:12<6:26:05, 6.62s/it]g-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▍ | 963/4460 [1:35:12<6:26:05, 6.62s/it]g-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8987, 'learning_rate': 9.63e-05, 'epoch': 1.08} [WARNING|modeling_utils.py:388] 2022-03-02 23:26:40,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▍ | 964/4460 [1:35:18<6:23:02, 6.57s/it]g-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▍ | 964/4460 [1:35:18<6:23:02, 6.57s/it]g-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8257, 'learning_rate': 9.64e-05, 'epoch': 1.08} 22%|████████████████▍ | 964/4460 [1:35:18<6:23:02, 6.57s/it]g-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:26:48,233 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:26:48,233 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7984, 'learning_rate': 9.65e-05, 'epoch': 1.08} [WARNING|modeling_utils.py:388] 2022-03-02 23:26:48,233 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:26:54,528 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:26:54,528 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8233, 'learning_rate': 9.66e-05, 'epoch': 1.08} [WARNING|modeling_utils.py:388] 2022-03-02 23:26:54,528 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:27:00,984 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:27:00,984 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7268, 'learning_rate': 9.67e-05, 'epoch': 1.08} [WARNING|modeling_utils.py:388] 2022-03-02 23:27:00,984 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:27:00,984 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:27:00,984 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:25:36,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▍ | 968/4460 [1:35:44<6:15:02, 6.44s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:27:09,047 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▍ | 968/4460 [1:35:44<6:15:02, 6.44s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:27:09,047 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▍ | 968/4460 [1:35:44<6:15:02, 6.44s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:27:09,047 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▍ | 968/4460 [1:35:44<6:15:02, 6.44s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:27:09,047 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▌ | 969/4460 [1:35:50<6:10:54, 6.37s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:27:15,192 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▌ | 969/4460 [1:35:50<6:10:54, 6.37s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:27:15,192 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▌ | 969/4460 [1:35:50<6:10:54, 6.37s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:27:15,192 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▌ | 969/4460 [1:35:50<6:10:54, 6.37s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:27:15,192 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▌ | 970/4460 [1:35:56<6:05:53, 6.29s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:27:15,192 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:27:22,816 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:27:15,192 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:27:22,816 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:27:15,192 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:27:22,816 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:27:15,192 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▌ | 971/4460 [1:36:02<6:02:46, 6.24s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:27:27,432 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▌ | 971/4460 [1:36:02<6:02:46, 6.24s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:27:27,432 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▌ | 971/4460 [1:36:02<6:02:46, 6.24s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:27:27,432 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▌ | 971/4460 [1:36:02<6:02:46, 6.24s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:27:27,432 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▌ | 972/4460 [1:36:08<6:00:19, 6.20s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:27:33,503 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▌ | 972/4460 [1:36:08<6:00:19, 6.20s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:27:33,503 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:27:37,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:27:33,503 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:27:37,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:27:33,503 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8162, 'learning_rate': 9.730000000000001e-05, 'epoch': 1.09} [WARNING|modeling_utils.py:388] 2022-03-02 23:27:37,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:27:33,503 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:27:44,020 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:27:33,503 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:27:44,020 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:27:33,503 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.902, 'learning_rate': 9.74e-05, 'epoch': 1.09} [WARNING|modeling_utils.py:388] 2022-03-02 23:27:44,020 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:27:33,503 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:27:44,020 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:27:33,503 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▌ | 975/4460 [1:36:27<6:03:31, 6.26s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:27:52,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▌ | 975/4460 [1:36:27<6:03:31, 6.26s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:27:52,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7075, 'learning_rate': 9.75e-05, 'epoch': 1.09} [WARNING|modeling_utils.py:388] 2022-03-02 23:27:56,424 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:27:52,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:27:56,424 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:27:52,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:27:56,424 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:27:52,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7672, 'learning_rate': 9.76e-05, 'epoch': 1.09} [WARNING|modeling_utils.py:388] 2022-03-02 23:27:56,424 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:27:52,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:27:56,424 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:27:52,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▋ | 977/4460 [1:36:39<5:49:13, 6.02s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:28:03,702 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▋ | 977/4460 [1:36:39<5:49:13, 6.02s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:28:03,702 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:28:07,798 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:28:03,702 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:28:07,798 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:28:03,702 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6737, 'learning_rate': 9.78e-05, 'epoch': 1.1} [WARNING|modeling_utils.py:388] 2022-03-02 23:28:11,964 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:28:03,702 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:28:11,964 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:28:03,702 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▋ | 979/4460 [1:36:50<5:37:34, 5.82s/it]g-point operations will not be computed-02 23:28:03,702 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:28:16,171 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:28:03,702 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:28:16,171 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:28:03,702 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:28:16,171 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:28:03,702 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▋ | 980/4460 [1:36:55<5:28:28, 5.66s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:28:20,128 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▋ | 980/4460 [1:36:55<5:28:28, 5.66s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:28:20,128 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▋ | 980/4460 [1:36:55<5:28:28, 5.66s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:28:20,128 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:28:23,826 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:28:20,128 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:28:26,377 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:28:20,128 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:28:26,377 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:28:20,128 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:28:28,744 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:28:20,128 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:28:28,744 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:28:20,128 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:28:32,259 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:28:20,128 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:28:32,259 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:28:20,128 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▊ | 983/4460 [1:37:10<4:55:43, 5.10s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:28:34,512 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:28:36,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:28:34,512 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:28:36,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:28:34,512 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▊ | 984/4460 [1:37:14<4:41:13, 4.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:28:38,757 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:28:40,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:28:38,757 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:28:40,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:28:38,757 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▊ | 985/4460 [1:37:18<4:28:19, 4.63s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:28:42,803 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:28:44,644 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:28:42,803 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:28:44,644 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:28:42,803 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▊ | 986/4460 [1:37:22<4:13:26, 4.38s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:28:46,496 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▊ | 987/4460 [1:37:25<3:56:42, 4.09s/it]g-point operations will not be computed-02 23:28:46,496 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▊ | 987/4460 [1:37:25<3:56:42, 4.09s/it]g-point operations will not be computed-02 23:28:46,496 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▊ | 987/4460 [1:37:25<3:56:42, 4.09s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:28:49,818 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▊ | 987/4460 [1:37:25<3:56:42, 4.09s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:28:49,818 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▊ | 988/4460 [1:37:28<3:38:14, 3.77s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:28:52,783 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▊ | 988/4460 [1:37:28<3:38:14, 3.77s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:28:52,783 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▊ | 989/4460 [1:37:31<3:21:00, 3.47s/it]g-point operations will not be computed-02 23:28:52,783 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:28:56,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:28:55,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:28:56,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:28:55,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:28:58,929 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:28:57,848 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:28:58,929 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:28:57,848 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▉ | 991/4460 [1:37:36<2:47:14, 2.89s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:29:00,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▉ | 991/4460 [1:37:36<2:47:14, 2.89s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:29:00,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▉ | 992/4460 [1:37:38<2:33:07, 2.65s/it]g-point operations will not be computed-02 23:29:00,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▉ | 992/4460 [1:37:38<2:33:07, 2.65s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:29:03,487 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:29:07,165 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:29:03,487 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▉ | 993/4460 [1:37:45<3:55:50, 4.08s/it]g-point operations will not be computed-02 23:29:03,487 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▉ | 993/4460 [1:37:45<3:55:50, 4.08s/it]g-point operations will not be computed-02 23:29:03,487 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▉ | 993/4460 [1:37:45<3:55:50, 4.08s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:29:10,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▉ | 993/4460 [1:37:45<3:55:50, 4.08s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:29:10,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:29:14,505 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:29:10,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:29:14,505 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:29:10,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▉ | 994/4460 [1:37:53<4:52:26, 5.06s/it]g-point operations will not be computed-02 23:29:10,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▉ | 994/4460 [1:37:53<4:52:26, 5.06s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▉ | 994/4460 [1:37:53<4:52:26, 5.06s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:29:23,582 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:29:23,582 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9552, 'learning_rate': 9.95e-05, 'epoch': 1.12} [WARNING|modeling_utils.py:388] 2022-03-02 23:29:23,582 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:29:23,582 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:29:23,582 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▉ | 996/4460 [1:38:07<5:54:27, 6.14s/it]g-point operations will not be computed-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▉ | 996/4460 [1:38:07<5:54:27, 6.14s/it]g-point operations will not be computed-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:29:35,984 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▉ | 997/4460 [1:38:14<6:09:26, 6.40s/it]g-point operations will not be computed-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▉ | 997/4460 [1:38:14<6:09:26, 6.40s/it]g-point operations will not be computed-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7764, 'learning_rate': 9.970000000000001e-05, 'epoch': 1.12} 22%|████████████████▉ | 997/4460 [1:38:14<6:09:26, 6.40s/it]g-point operations will not be computed-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▉ | 997/4460 [1:38:14<6:09:26, 6.40s/it]g-point operations will not be computed-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▉ | 997/4460 [1:38:14<6:09:26, 6.40s/it]g-point operations will not be computed-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████ | 998/4460 [1:38:21<6:18:59, 6.57s/it]g-point operations will not be computed-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:29:48,242 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:29:48,242 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:29:48,242 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████ | 999/4460 [1:38:28<6:26:05, 6.69s/it]g-point operations will not be computed-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████ | 999/4460 [1:38:28<6:26:05, 6.69s/it]g-point operations will not be computed-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████ | 999/4460 [1:38:28<6:26:05, 6.69s/it]g-point operations will not be computed-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████ | 999/4460 [1:38:28<6:26:05, 6.69s/it]g-point operations will not be computed-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████ | 999/4460 [1:38:28<6:26:05, 6.69s/it]g-point operations will not be computed-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▊ | 1000/4460 [1:38:36<6:41:14, 6.96s/it]g-point operations will not be computed-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:30:02,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:30:06,287 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:30:06,287 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8448, 'learning_rate': 9.997109826589596e-05, 'epoch': 1.12} [WARNING|modeling_utils.py:388] 2022-03-02 23:30:06,287 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:30:06,287 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:30:06,287 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:29:18,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▊ | 1002/4460 [1:38:50<6:38:34, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▊ | 1002/4460 [1:38:50<6:38:34, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▊ | 1002/4460 [1:38:50<6:38:34, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▊ | 1002/4460 [1:38:50<6:38:34, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▊ | 1003/4460 [1:38:56<6:36:12, 6.88s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:30:23,235 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:30:23,235 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|████████████████▉ | 1004/4460 [1:39:03<6:34:05, 6.84s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|████████████████▉ | 1004/4460 [1:39:03<6:34:05, 6.84s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8916, 'learning_rate': 9.988439306358382e-05, 'epoch': 1.13} 23%|████████████████▉ | 1004/4460 [1:39:03<6:34:05, 6.84s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:30:33,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:30:33,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9549, 'learning_rate': 9.985549132947977e-05, 'epoch': 1.13} [WARNING|modeling_utils.py:388] 2022-03-02 23:30:33,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:30:33,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:30:33,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|████████████████▉ | 1006/4460 [1:39:17<6:31:17, 6.80s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:30:43,554 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:30:43,554 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|████████████████▉ | 1007/4460 [1:39:23<6:30:30, 6.79s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|████████████████▉ | 1007/4460 [1:39:23<6:30:30, 6.79s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7415, 'learning_rate': 9.979768786127168e-05, 'epoch': 1.13} 23%|████████████████▉ | 1007/4460 [1:39:23<6:30:30, 6.79s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:30:53,510 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:30:53,510 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8201, 'learning_rate': 9.976878612716763e-05, 'epoch': 1.13} [WARNING|modeling_utils.py:388] 2022-03-02 23:30:53,510 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:31:00,145 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:31:00,145 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8066, 'learning_rate': 9.973988439306359e-05, 'epoch': 1.13} [WARNING|modeling_utils.py:388] 2022-03-02 23:31:00,145 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:31:00,145 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:31:00,145 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|████████████████▉ | 1010/4460 [1:39:43<6:25:11, 6.70s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:31:10,154 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:31:10,154 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████ | 1011/4460 [1:39:50<6:23:00, 6.66s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████ | 1011/4460 [1:39:50<6:23:00, 6.66s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8517, 'learning_rate': 9.968208092485549e-05, 'epoch': 1.13} [WARNING|modeling_utils.py:388] 2022-03-02 23:31:18,274 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████ | 1012/4460 [1:39:56<6:19:58, 6.61s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████ | 1012/4460 [1:39:56<6:19:58, 6.61s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9332, 'learning_rate': 9.965317919075145e-05, 'epoch': 1.13} 23%|█████████████████ | 1012/4460 [1:39:56<6:19:58, 6.61s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:31:26,362 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:31:26,362 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7091, 'learning_rate': 9.962427745664741e-05, 'epoch': 1.14} [WARNING|modeling_utils.py:388] 2022-03-02 23:31:26,362 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:31:32,815 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:31:32,815 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8574, 'learning_rate': 9.959537572254337e-05, 'epoch': 1.14} [WARNING|modeling_utils.py:388] 2022-03-02 23:31:32,815 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:31:39,244 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:31:39,244 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:31:39,244 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6962, 'learning_rate': 9.95664739884393e-05, 'epoch': 1.14} [WARNING|modeling_utils.py:388] 2022-03-02 23:31:39,244 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████ | 1016/4460 [1:40:22<6:10:37, 6.46s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████ | 1016/4460 [1:40:22<6:10:37, 6.46s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:31:48,812 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:31:48,812 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████ | 1017/4460 [1:40:28<6:07:08, 6.40s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████ | 1017/4460 [1:40:28<6:07:08, 6.40s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:31:55,091 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:31:55,091 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████ | 1018/4460 [1:40:35<6:05:37, 6.37s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████ | 1018/4460 [1:40:35<6:05:37, 6.37s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:32:01,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:32:01,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▏ | 1019/4460 [1:40:41<6:02:19, 6.32s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▏ | 1019/4460 [1:40:41<6:02:19, 6.32s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:32:07,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:32:07,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▏ | 1020/4460 [1:40:47<6:00:23, 6.29s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▏ | 1020/4460 [1:40:47<6:00:23, 6.29s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9041, 'learning_rate': 9.942196531791907e-05, 'epoch': 1.14} [WARNING|modeling_utils.py:388] 2022-03-02 23:32:15,231 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▏ | 1021/4460 [1:40:53<5:58:35, 6.26s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▏ | 1021/4460 [1:40:53<5:58:35, 6.26s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8285, 'learning_rate': 9.939306358381504e-05, 'epoch': 1.14} [WARNING|modeling_utils.py:388] 2022-03-02 23:32:21,379 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▏ | 1022/4460 [1:40:59<5:56:09, 6.22s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▏ | 1022/4460 [1:40:59<5:56:09, 6.22s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7863, 'learning_rate': 9.9364161849711e-05, 'epoch': 1.15} [WARNING|modeling_utils.py:388] 2022-03-02 23:32:27,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▏ | 1023/4460 [1:41:05<5:53:07, 6.16s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▏ | 1023/4460 [1:41:05<5:53:07, 6.16s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:32:31,948 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:32:31,948 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▏ | 1024/4460 [1:41:11<5:49:34, 6.10s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▏ | 1024/4460 [1:41:11<5:49:34, 6.10s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:32:37,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:32:37,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▏ | 1025/4460 [1:41:18<5:57:10, 6.24s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▏ | 1025/4460 [1:41:18<5:57:10, 6.24s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6102, 'learning_rate': 9.927745664739884e-05, 'epoch': 1.15} [WARNING|modeling_utils.py:388] 2022-03-02 23:32:45,894 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▎ | 1026/4460 [1:41:24<5:50:35, 6.13s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▎ | 1026/4460 [1:41:24<5:50:35, 6.13s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6934, 'learning_rate': 9.92485549132948e-05, 'epoch': 1.15} [WARNING|modeling_utils.py:388] 2022-03-02 23:32:51,583 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:32:51,583 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▎ | 1027/4460 [1:41:29<5:42:37, 5.99s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:32:55,817 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:32:58,524 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:32:58,524 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6877, 'learning_rate': 9.91907514450867e-05, 'epoch': 1.15} [WARNING|modeling_utils.py:388] 2022-03-02 23:32:58,524 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:33:04,055 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:33:04,055 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.5464, 'learning_rate': 9.916184971098267e-05, 'epoch': 1.15} [WARNING|modeling_utils.py:388] 2022-03-02 23:33:08,124 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▎ | 1030/4460 [1:41:46<5:22:25, 5.64s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▎ | 1030/4460 [1:41:46<5:22:25, 5.64s/it]g-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:33:12,080 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:33:14,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:33:14,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.638, 'learning_rate': 9.910404624277458e-05, 'epoch': 1.16} [WARNING|modeling_utils.py:388] 2022-03-02 23:33:18,422 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:30:14,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▎ | 1032/4460 [1:41:56<5:05:43, 5.35s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:33:20,907 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▎ | 1032/4460 [1:41:56<5:05:43, 5.35s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:33:20,907 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6992, 'learning_rate': 9.907514450867053e-05, 'epoch': 1.16} [WARNING|modeling_utils.py:388] 2022-03-02 23:33:24,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:33:20,907 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:33:24,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:33:20,907 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:33:26,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:33:20,907 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:33:28,999 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:33:20,907 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:33:28,999 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:33:20,907 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:33:31,215 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:33:20,907 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:33:33,273 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:33:20,907 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:33:33,273 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:33:20,907 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8435, 'learning_rate': 9.898843930635839e-05, 'epoch': 1.16} [WARNING|modeling_utils.py:388] 2022-03-02 23:33:36,349 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:33:20,907 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:33:36,349 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:33:20,907 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▍ | 1036/4460 [1:42:14<4:19:55, 4.55s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:33:38,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▍ | 1036/4460 [1:42:14<4:19:55, 4.55s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:33:38,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▍ | 1037/4460 [1:42:17<4:04:30, 4.29s/it]g-point operations will not be computed-02 23:33:38,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:33:43,376 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:33:41,840 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:33:43,376 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:33:41,840 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▍ | 1038/4460 [1:42:21<3:45:46, 3.96s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:33:44,977 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▍ | 1038/4460 [1:42:21<3:45:46, 3.96s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:33:44,977 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▍ | 1039/4460 [1:42:24<3:28:09, 3.65s/it]g-point operations will not be computed-02 23:33:44,977 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:33:49,091 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:33:47,838 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:33:49,091 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:33:47,838 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:33:51,504 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:33:50,382 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:33:51,504 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:33:50,382 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 1041/4460 [1:42:29<2:53:31, 3.05s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:33:52,677 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 1041/4460 [1:42:29<2:53:31, 3.05s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:33:52,677 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 1042/4460 [1:42:31<2:37:27, 2.76s/it]g-point operations will not be computed-02 23:33:52,677 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 1042/4460 [1:42:31<2:37:27, 2.76s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:33:56,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 1042/4460 [1:42:31<2:37:27, 2.76s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:33:56,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 1043/4460 [1:42:38<3:57:47, 4.18s/it]g-point operations will not be computed-02 23:33:56,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 1043/4460 [1:42:38<3:57:47, 4.18s/it]g-point operations will not be computed-02 23:33:56,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 1043/4460 [1:42:38<3:57:47, 4.18s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:34:03,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 1043/4460 [1:42:38<3:57:47, 4.18s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:34:03,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:34:07,097 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:34:03,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:34:07,097 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:34:03,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 1044/4460 [1:42:45<4:49:34, 5.09s/it]g-point operations will not be computed-02 23:34:03,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 1044/4460 [1:42:45<4:49:34, 5.09s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:34:10,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 1044/4460 [1:42:45<4:49:34, 5.09s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:34:10,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:34:15,999 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:34:10,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:34:15,999 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:34:10,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0778, 'learning_rate': 9.869942196531792e-05, 'epoch': 1.17} [WARNING|modeling_utils.py:388] 2022-03-02 23:34:15,999 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:34:10,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:34:15,999 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:34:10,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:34:15,999 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:34:10,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 1046/4460 [1:43:00<5:49:33, 6.14s/it]g-point operations will not be computed-02 23:34:10,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 1046/4460 [1:43:00<5:49:33, 6.14s/it]g-point operations will not be computed-02 23:34:10,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 1046/4460 [1:43:00<5:49:33, 6.14s/it]g-point operations will not be computed-02 23:34:10,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 1046/4460 [1:43:00<5:49:33, 6.14s/it]g-point operations will not be computed-02 23:34:10,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 1046/4460 [1:43:00<5:49:33, 6.14s/it]g-point operations will not be computed-02 23:34:10,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 1047/4460 [1:43:07<6:05:19, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:34:32,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 1047/4460 [1:43:07<6:05:19, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:34:32,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 1047/4460 [1:43:07<6:05:19, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:34:32,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 1047/4460 [1:43:07<6:05:19, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:34:32,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 1048/4460 [1:43:14<6:14:57, 6.59s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:34:32,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 1048/4460 [1:43:14<6:14:57, 6.59s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:34:32,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:34:42,482 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:34:32,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▋ | 1049/4460 [1:43:21<6:21:53, 6.72s/it]g-point operations will not be computed-02 23:34:32,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▋ | 1049/4460 [1:43:21<6:21:53, 6.72s/it]g-point operations will not be computed-02 23:34:32,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9212, 'learning_rate': 9.858381502890174e-05, 'epoch': 1.18} 24%|█████████████████▋ | 1049/4460 [1:43:21<6:21:53, 6.72s/it]g-point operations will not be computed-02 23:34:32,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▋ | 1049/4460 [1:43:21<6:21:53, 6.72s/it]g-point operations will not be computed-02 23:34:32,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▋ | 1049/4460 [1:43:21<6:21:53, 6.72s/it]g-point operations will not be computed-02 23:34:32,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▋ | 1050/4460 [1:43:28<6:34:49, 6.95s/it]g-point operations will not be computed-02 23:34:32,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▋ | 1050/4460 [1:43:28<6:34:49, 6.95s/it]g-point operations will not be computed-02 23:34:32,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▋ | 1050/4460 [1:43:28<6:34:49, 6.95s/it]g-point operations will not be computed-02 23:34:32,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:34:58,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:34:32,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:34:58,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:34:32,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8042, 'learning_rate': 9.852601156069364e-05, 'epoch': 1.18} [WARNING|modeling_utils.py:388] 2022-03-02 23:34:58,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:34:32,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:34:58,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:34:32,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:34:58,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:34:32,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▋ | 1052/4460 [1:43:42<6:35:01, 6.95s/it]g-point operations will not be computed-02 23:34:32,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:35:09,152 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:34:32,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:35:09,152 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:34:32,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:35:09,152 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:34:32,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▋ | 1053/4460 [1:43:49<6:33:18, 6.93s/it]g-point operations will not be computed-02 23:34:32,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▋ | 1053/4460 [1:43:49<6:33:18, 6.93s/it]g-point operations will not be computed-02 23:34:32,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▋ | 1053/4460 [1:43:49<6:33:18, 6.93s/it]g-point operations will not be computed-02 23:34:32,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▋ | 1053/4460 [1:43:49<6:33:18, 6.93s/it]g-point operations will not be computed-02 23:34:32,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▋ | 1053/4460 [1:43:49<6:33:18, 6.93s/it]g-point operations will not be computed-02 23:34:32,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▋ | 1054/4460 [1:43:56<6:31:00, 6.89s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▋ | 1054/4460 [1:43:56<6:31:00, 6.89s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▋ | 1054/4460 [1:43:56<6:31:00, 6.89s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▋ | 1054/4460 [1:43:56<6:31:00, 6.89s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▋ | 1055/4460 [1:44:03<6:28:38, 6.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▋ | 1055/4460 [1:44:03<6:28:38, 6.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:35:31,281 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▊ | 1056/4460 [1:44:09<6:29:07, 6.86s/it]g-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▊ | 1056/4460 [1:44:09<6:29:07, 6.86s/it]g-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7968, 'learning_rate': 9.838150289017341e-05, 'epoch': 1.18} 24%|█████████████████▊ | 1056/4460 [1:44:09<6:29:07, 6.86s/it]g-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:35:39,663 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:35:39,663 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.961, 'learning_rate': 9.835260115606936e-05, 'epoch': 1.18} [WARNING|modeling_utils.py:388] 2022-03-02 23:35:39,663 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:35:39,663 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:35:39,663 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▊ | 1058/4460 [1:44:23<6:26:27, 6.82s/it]g-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▊ | 1058/4460 [1:44:23<6:26:27, 6.82s/it]g-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:35:51,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▊ | 1059/4460 [1:44:30<6:29:25, 6.87s/it]g-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▊ | 1059/4460 [1:44:30<6:29:25, 6.87s/it]g-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7562, 'learning_rate': 9.829479768786127e-05, 'epoch': 1.19} 24%|█████████████████▊ | 1059/4460 [1:44:30<6:29:25, 6.87s/it]g-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▊ | 1059/4460 [1:44:30<6:29:25, 6.87s/it]g-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▊ | 1059/4460 [1:44:30<6:29:25, 6.87s/it]g-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▊ | 1060/4460 [1:44:37<6:33:07, 6.94s/it]g-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:36:03,988 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:36:03,988 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:36:03,988 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▊ | 1061/4460 [1:44:44<6:28:56, 6.87s/it]g-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▊ | 1061/4460 [1:44:44<6:28:56, 6.87s/it]g-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:36:12,363 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▊ | 1062/4460 [1:44:51<6:27:38, 6.84s/it]g-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▊ | 1062/4460 [1:44:51<6:27:38, 6.84s/it]g-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8379, 'learning_rate': 9.820809248554915e-05, 'epoch': 1.19} 24%|█████████████████▊ | 1062/4460 [1:44:51<6:27:38, 6.84s/it]g-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:36:20,658 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:36:20,658 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7659, 'learning_rate': 9.81791907514451e-05, 'epoch': 1.19} [WARNING|modeling_utils.py:388] 2022-03-02 23:36:20,658 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:36:27,164 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:36:27,164 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7486, 'learning_rate': 9.815028901734105e-05, 'epoch': 1.19} [WARNING|modeling_utils.py:388] 2022-03-02 23:36:27,164 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:36:27,164 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:35:21,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▉ | 1065/4460 [1:45:10<6:14:53, 6.63s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▉ | 1065/4460 [1:45:10<6:14:53, 6.63s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8629, 'learning_rate': 9.812138728323699e-05, 'epoch': 1.19} 24%|█████████████████▉ | 1065/4460 [1:45:10<6:14:53, 6.63s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▉ | 1066/4460 [1:45:17<6:11:33, 6.57s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▉ | 1066/4460 [1:45:17<6:11:33, 6.57s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.78, 'learning_rate': 9.809248554913295e-05, 'epoch': 1.2} [WARNING|modeling_utils.py:388] 2022-03-02 23:36:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▉ | 1067/4460 [1:45:23<6:10:28, 6.55s/it]g-point operations will not be computed-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▉ | 1067/4460 [1:45:23<6:10:28, 6.55s/it]g-point operations will not be computed-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7193, 'learning_rate': 9.80635838150289e-05, 'epoch': 1.2} [WARNING|modeling_utils.py:388] 2022-03-02 23:36:51,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▉ | 1068/4460 [1:45:29<6:06:11, 6.48s/it]g-point operations will not be computed-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|█████████████████▉ | 1068/4460 [1:45:29<6:06:11, 6.48s/it]g-point operations will not be computed-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7284, 'learning_rate': 9.803468208092485e-05, 'epoch': 1.2} 24%|█████████████████▉ | 1068/4460 [1:45:29<6:06:11, 6.48s/it]g-point operations will not be computed-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:36:59,233 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:36:59,233 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9339, 'learning_rate': 9.800578034682082e-05, 'epoch': 1.2} [WARNING|modeling_utils.py:388] 2022-03-02 23:36:59,233 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:37:05,433 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:37:05,433 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.799, 'learning_rate': 9.797687861271677e-05, 'epoch': 1.2} [WARNING|modeling_utils.py:388] 2022-03-02 23:37:05,433 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:37:11,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:37:11,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.796, 'learning_rate': 9.794797687861273e-05, 'epoch': 1.2} [WARNING|modeling_utils.py:388] 2022-03-02 23:37:11,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:37:17,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:37:17,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8769, 'learning_rate': 9.791907514450868e-05, 'epoch': 1.2} [WARNING|modeling_utils.py:388] 2022-03-02 23:37:17,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:37:24,523 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:37:24,523 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7186, 'learning_rate': 9.789017341040463e-05, 'epoch': 1.2} [WARNING|modeling_utils.py:388] 2022-03-02 23:37:24,523 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:37:30,522 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:37:30,522 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6335, 'learning_rate': 9.786127167630059e-05, 'epoch': 1.2} [WARNING|modeling_utils.py:388] 2022-03-02 23:37:30,522 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:37:30,522 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:37:30,522 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:36:35,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████ | 1075/4460 [1:46:14<5:57:28, 6.34s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:37:38,644 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████ | 1075/4460 [1:46:14<5:57:28, 6.34s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:37:38,644 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:37:42,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:37:38,644 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:37:42,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:37:38,644 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7582, 'learning_rate': 9.780346820809248e-05, 'epoch': 1.21} [WARNING|modeling_utils.py:388] 2022-03-02 23:37:42,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:37:38,644 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:37:42,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:37:38,644 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:37:48,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:37:38,644 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:37:48,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:37:38,644 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:37:52,835 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:37:38,644 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:37:52,835 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:37:38,644 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▏ | 1078/4460 [1:46:31<5:33:30, 5.92s/it]g-point operations will not be computed-02 23:37:38,644 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:37:56,948 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:37:38,644 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:37:59,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:37:38,644 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:37:59,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:37:38,644 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8127, 'learning_rate': 9.771676300578035e-05, 'epoch': 1.21} [WARNING|modeling_utils.py:388] 2022-03-02 23:38:03,612 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:37:38,644 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▏ | 1080/4460 [1:46:41<5:17:10, 5.63s/it]g-point operations will not be computed-02 23:37:38,644 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▏ | 1080/4460 [1:46:41<5:17:10, 5.63s/it]g-point operations will not be computed-02 23:37:38,644 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:38:07,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:37:38,644 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:38:07,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:37:38,644 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:38:07,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:37:38,644 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▏ | 1081/4460 [1:46:47<5:09:45, 5.50s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:38:11,469 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:38:13,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:38:11,469 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▏ | 1082/4460 [1:46:52<5:01:10, 5.35s/it]g-point operations will not be computed-02 23:38:11,469 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▏ | 1082/4460 [1:46:52<5:01:10, 5.35s/it]g-point operations will not be computed-02 23:38:11,469 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:38:17,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:38:11,469 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:38:19,831 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:38:11,469 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:38:19,831 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:38:11,469 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:38:22,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:38:11,469 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:38:24,344 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:38:11,469 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:38:24,344 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:38:11,469 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.649, 'learning_rate': 9.757225433526012e-05, 'epoch': 1.22} [WARNING|modeling_utils.py:388] 2022-03-02 23:38:27,522 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:38:11,469 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:38:27,522 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:38:11,469 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▏ | 1085/4460 [1:47:05<4:25:49, 4.73s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:38:29,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:38:31,495 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:38:29,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:38:31,495 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:38:29,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▎ | 1086/4460 [1:47:09<4:11:51, 4.48s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:38:33,413 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:38:35,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:38:33,413 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:38:35,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:38:33,413 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▎ | 1087/4460 [1:47:12<3:56:35, 4.21s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:38:36,896 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▎ | 1087/4460 [1:47:12<3:56:35, 4.21s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:38:36,896 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▎ | 1088/4460 [1:47:16<3:40:39, 3.93s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:38:40,116 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▎ | 1089/4460 [1:47:19<3:25:33, 3.66s/it]g-point operations will not be computed-02 23:38:40,116 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▎ | 1089/4460 [1:47:19<3:25:33, 3.66s/it]g-point operations will not be computed-02 23:38:40,116 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:38:44,475 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:38:43,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:38:44,475 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:38:43,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▎ | 1090/4460 [1:47:22<3:11:31, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:38:45,834 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▎ | 1091/4460 [1:47:24<2:55:02, 3.12s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:38:48,221 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▎ | 1091/4460 [1:47:24<2:55:02, 3.12s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:38:48,221 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▎ | 1092/4460 [1:47:26<2:39:36, 2.84s/it]g-point operations will not be computed-02 23:38:48,221 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▎ | 1092/4460 [1:47:26<2:39:36, 2.84s/it]g-point operations will not be computed-02 23:38:48,221 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▎ | 1092/4460 [1:47:26<2:39:36, 2.84s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:38:51,772 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▎ | 1092/4460 [1:47:26<2:39:36, 2.84s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:38:51,772 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:38:55,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:38:51,772 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:38:55,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:38:51,772 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▍ | 1093/4460 [1:47:34<3:58:07, 4.24s/it]g-point operations will not be computed-02 23:38:51,772 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▍ | 1093/4460 [1:47:34<3:58:07, 4.24s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:38:59,191 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▍ | 1093/4460 [1:47:34<3:58:07, 4.24s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:38:59,191 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:39:02,773 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:38:59,191 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:39:02,773 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:38:59,191 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▍ | 1094/4460 [1:47:41<4:49:11, 5.15s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▍ | 1094/4460 [1:47:41<4:49:11, 5.15s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▍ | 1094/4460 [1:47:41<4:49:11, 5.15s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▍ | 1094/4460 [1:47:41<4:49:11, 5.15s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▍ | 1095/4460 [1:47:48<5:21:37, 5.73s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▍ | 1095/4460 [1:47:48<5:21:37, 5.73s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▍ | 1095/4460 [1:47:48<5:21:37, 5.73s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:39:18,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:39:18,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0257, 'learning_rate': 9.722543352601156e-05, 'epoch': 1.23} [WARNING|modeling_utils.py:388] 2022-03-02 23:39:18,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:39:18,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:39:18,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▍ | 1097/4460 [1:48:02<6:01:39, 6.45s/it]g-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▍ | 1097/4460 [1:48:02<6:01:39, 6.45s/it]g-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▍ | 1097/4460 [1:48:02<6:01:39, 6.45s/it]g-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:39:32,980 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:39:32,980 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8981, 'learning_rate': 9.716763005780347e-05, 'epoch': 1.23} [WARNING|modeling_utils.py:388] 2022-03-02 23:39:32,980 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:39:32,980 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:39:32,980 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▍ | 1099/4460 [1:48:16<6:17:56, 6.75s/it]g-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▍ | 1099/4460 [1:48:16<6:17:56, 6.75s/it]g-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:39:45,293 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▍ | 1100/4460 [1:48:24<6:32:08, 7.00s/it]g-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▍ | 1100/4460 [1:48:24<6:32:08, 7.00s/it]g-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0534, 'learning_rate': 9.710982658959538e-05, 'epoch': 1.23} 25%|██████████████████▍ | 1100/4460 [1:48:24<6:32:08, 7.00s/it]g-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▍ | 1100/4460 [1:48:24<6:32:08, 7.00s/it]g-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▌ | 1101/4460 [1:48:31<6:33:31, 7.03s/it]g-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▌ | 1101/4460 [1:48:31<6:33:31, 7.03s/it]g-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8323, 'learning_rate': 9.708092485549133e-05, 'epoch': 1.23} [WARNING|modeling_utils.py:388] 2022-03-02 23:39:59,869 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▌ | 1102/4460 [1:48:38<6:30:17, 6.97s/it]g-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▌ | 1102/4460 [1:48:38<6:30:17, 6.97s/it]g-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7655, 'learning_rate': 9.70520231213873e-05, 'epoch': 1.24} 25%|██████████████████▌ | 1102/4460 [1:48:38<6:30:17, 6.97s/it]g-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:40:08,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:40:08,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6748, 'learning_rate': 9.702312138728325e-05, 'epoch': 1.24} [WARNING|modeling_utils.py:388] 2022-03-02 23:40:11,825 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:40:11,825 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▌ | 1104/4460 [1:48:52<6:26:07, 6.90s/it]g-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▌ | 1104/4460 [1:48:52<6:26:07, 6.90s/it]g-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:40:18,657 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:40:18,657 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▌ | 1105/4460 [1:48:59<6:24:47, 6.88s/it]g-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▌ | 1105/4460 [1:48:59<6:24:47, 6.88s/it]g-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8083, 'learning_rate': 9.696531791907514e-05, 'epoch': 1.24} 25%|██████████████████▌ | 1105/4460 [1:48:59<6:24:47, 6.88s/it]g-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▌ | 1105/4460 [1:48:59<6:24:47, 6.88s/it]g-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▌ | 1105/4460 [1:48:59<6:24:47, 6.88s/it]g-point operations will not be computed-02 23:39:06,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▌ | 1106/4460 [1:49:05<6:22:16, 6.84s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:40:30,514 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▌ | 1106/4460 [1:49:05<6:22:16, 6.84s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:40:30,514 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▌ | 1106/4460 [1:49:05<6:22:16, 6.84s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:40:30,514 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▌ | 1107/4460 [1:49:12<6:20:13, 6.80s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:40:30,514 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▌ | 1107/4460 [1:49:12<6:20:13, 6.80s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:40:30,514 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:40:38,835 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:40:30,514 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:40:38,835 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:40:30,514 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▋ | 1108/4460 [1:49:19<6:16:42, 6.74s/it]g-point operations will not be computed-02 23:40:30,514 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▋ | 1108/4460 [1:49:19<6:16:42, 6.74s/it]g-point operations will not be computed-02 23:40:30,514 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7133, 'learning_rate': 9.6878612716763e-05, 'epoch': 1.24} [WARNING|modeling_utils.py:388] 2022-03-02 23:40:47,124 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:40:30,514 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▋ | 1109/4460 [1:49:25<6:16:09, 6.74s/it]g-point operations will not be computed-02 23:40:30,514 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▋ | 1109/4460 [1:49:25<6:16:09, 6.74s/it]g-point operations will not be computed-02 23:40:30,514 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7263, 'learning_rate': 9.684971098265896e-05, 'epoch': 1.24} 25%|██████████████████▋ | 1109/4460 [1:49:25<6:16:09, 6.74s/it]g-point operations will not be computed-02 23:40:30,514 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▋ | 1109/4460 [1:49:25<6:16:09, 6.74s/it]g-point operations will not be computed-02 23:40:30,514 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▋ | 1109/4460 [1:49:25<6:16:09, 6.74s/it]g-point operations will not be computed-02 23:40:30,514 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▋ | 1110/4460 [1:49:32<6:13:59, 6.70s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:40:57,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▋ | 1110/4460 [1:49:32<6:13:59, 6.70s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:40:57,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▋ | 1110/4460 [1:49:32<6:13:59, 6.70s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:40:57,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▋ | 1110/4460 [1:49:32<6:13:59, 6.70s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:40:57,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▋ | 1111/4460 [1:49:39<6:13:10, 6.69s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:40:57,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:41:05,432 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:40:57,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:41:05,432 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:40:57,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:41:05,432 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:40:57,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▋ | 1112/4460 [1:49:45<6:11:14, 6.65s/it]g-point operations will not be computed-02 23:40:57,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▋ | 1112/4460 [1:49:45<6:11:14, 6.65s/it]g-point operations will not be computed-02 23:40:57,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:41:13,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:40:57,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:41:13,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:40:57,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▋ | 1113/4460 [1:49:52<6:09:19, 6.62s/it]g-point operations will not be computed-02 23:40:57,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▋ | 1113/4460 [1:49:52<6:09:19, 6.62s/it]g-point operations will not be computed-02 23:40:57,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▋ | 1113/4460 [1:49:52<6:09:19, 6.62s/it]g-point operations will not be computed-02 23:40:57,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:41:21,659 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:40:57,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:41:21,659 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:40:57,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7696, 'learning_rate': 9.670520231213874e-05, 'epoch': 1.25} [WARNING|modeling_utils.py:388] 2022-03-02 23:41:21,659 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:40:57,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:41:21,659 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:40:57,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:41:21,659 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:40:57,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▊ | 1115/4460 [1:50:05<6:04:56, 6.55s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:41:29,802 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▊ | 1115/4460 [1:50:05<6:04:56, 6.55s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:41:29,802 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▊ | 1115/4460 [1:50:05<6:04:56, 6.55s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:41:29,802 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▊ | 1115/4460 [1:50:05<6:04:56, 6.55s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:41:29,802 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▊ | 1116/4460 [1:50:11<6:02:33, 6.51s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▊ | 1116/4460 [1:50:11<6:02:33, 6.51s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▊ | 1116/4460 [1:50:11<6:02:33, 6.51s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▊ | 1116/4460 [1:50:11<6:02:33, 6.51s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▊ | 1117/4460 [1:50:17<5:59:41, 6.46s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:41:44,143 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:41:44,143 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▊ | 1118/4460 [1:50:24<5:57:39, 6.42s/it]g-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▊ | 1118/4460 [1:50:24<5:57:39, 6.42s/it]g-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:41:50,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:41:50,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▊ | 1119/4460 [1:50:30<5:54:12, 6.36s/it]g-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▊ | 1119/4460 [1:50:30<5:54:12, 6.36s/it]g-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:41:56,663 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:41:56,663 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:41:56,663 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▊ | 1120/4460 [1:50:36<5:52:10, 6.33s/it]g-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:42:02,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:42:02,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:42:02,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▊ | 1121/4460 [1:50:42<5:48:52, 6.27s/it]g-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:42:08,923 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:42:08,923 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:42:08,923 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▊ | 1122/4460 [1:50:48<5:45:27, 6.21s/it]g-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:42:14,997 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:42:14,997 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:42:14,997 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▉ | 1123/4460 [1:50:54<5:42:23, 6.16s/it]g-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:42:20,972 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:42:20,972 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:42:20,972 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▉ | 1124/4460 [1:51:00<5:38:55, 6.10s/it]g-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:42:26,886 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:42:26,886 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:42:26,886 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▉ | 1125/4460 [1:51:07<5:44:27, 6.20s/it]g-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:42:33,291 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:42:33,291 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:42:33,291 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▉ | 1126/4460 [1:51:13<5:37:08, 6.07s/it]g-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:42:39,033 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:42:39,033 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:42:39,033 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:41:36,211 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▉ | 1127/4460 [1:51:18<5:31:02, 5.96s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:42:43,274 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▉ | 1127/4460 [1:51:18<5:31:02, 5.96s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:42:43,274 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:42:47,359 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:42:43,274 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:42:47,359 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:42:43,274 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:42:50,196 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:42:43,274 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:42:50,196 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:42:43,274 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:42:50,196 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:42:43,274 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|██████████████████▉ | 1129/4460 [1:51:29<5:19:32, 5.76s/it]g-point operations will not be computed-02 23:42:43,274 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:42:55,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:42:43,274 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:42:55,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:42:43,274 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:42:55,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:42:43,274 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████ | 1130/4460 [1:51:35<5:11:55, 5.62s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:42:59,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████ | 1130/4460 [1:51:35<5:11:55, 5.62s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:42:59,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████ | 1130/4460 [1:51:35<5:11:55, 5.62s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:42:59,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:43:03,318 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:42:59,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:43:05,844 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:42:59,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:43:08,245 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:42:59,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:43:08,245 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:42:59,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6711, 'learning_rate': 9.61849710982659e-05, 'epoch': 1.27} [WARNING|modeling_utils.py:388] 2022-03-02 23:43:11,800 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:42:59,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:43:11,800 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:42:59,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████ | 1133/4460 [1:51:49<4:43:31, 5.11s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:43:14,130 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:43:16,207 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:43:14,130 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:43:16,207 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:43:14,130 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████ | 1134/4460 [1:51:54<4:30:01, 4.87s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:43:18,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:43:20,320 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:43:18,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:43:20,320 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:43:18,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████ | 1135/4460 [1:51:58<4:16:21, 4.63s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:43:22,361 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:43:24,175 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:43:22,361 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:43:24,175 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:43:22,361 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████ | 1136/4460 [1:52:02<4:02:28, 4.38s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:43:26,055 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████ | 1137/4460 [1:52:05<3:46:46, 4.09s/it]g-point operations will not be computed-02 23:43:26,055 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████ | 1137/4460 [1:52:05<3:46:46, 4.09s/it]g-point operations will not be computed-02 23:43:26,055 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████ | 1137/4460 [1:52:05<3:46:46, 4.09s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:43:29,390 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████ | 1137/4460 [1:52:05<3:46:46, 4.09s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:43:29,390 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▏ | 1138/4460 [1:52:08<3:30:50, 3.81s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:43:32,427 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▏ | 1139/4460 [1:52:11<3:13:55, 3.50s/it]g-point operations will not be computed-02 23:43:32,427 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▏ | 1139/4460 [1:52:11<3:13:55, 3.50s/it]g-point operations will not be computed-02 23:43:32,427 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▏ | 1139/4460 [1:52:11<3:13:55, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:43:35,178 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▏ | 1139/4460 [1:52:11<3:13:55, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:43:35,178 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▏ | 1140/4460 [1:52:14<2:58:16, 3.22s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:43:37,697 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▏ | 1140/4460 [1:52:14<2:58:16, 3.22s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:43:37,697 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▏ | 1141/4460 [1:52:16<2:44:36, 2.98s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:43:40,064 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▏ | 1141/4460 [1:52:16<2:44:36, 2.98s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:43:40,064 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▏ | 1142/4460 [1:52:18<2:31:44, 2.74s/it]g-point operations will not be computed-02 23:43:40,064 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▏ | 1142/4460 [1:52:18<2:31:44, 2.74s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:43:43,601 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▏ | 1142/4460 [1:52:18<2:31:44, 2.74s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:43:43,601 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:43:47,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:43:43,601 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:43:47,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:43:43,601 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▏ | 1143/4460 [1:52:26<3:51:19, 4.18s/it]g-point operations will not be computed-02 23:43:43,601 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▏ | 1143/4460 [1:52:26<3:51:19, 4.18s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:43:51,070 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:43:54,660 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:43:51,070 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▏ | 1144/4460 [1:52:33<4:42:27, 5.11s/it]g-point operations will not be computed-02 23:43:51,070 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▏ | 1144/4460 [1:52:33<4:42:27, 5.11s/it]g-point operations will not be computed-02 23:43:51,070 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▏ | 1144/4460 [1:52:33<4:42:27, 5.11s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▏ | 1144/4460 [1:52:33<4:42:27, 5.11s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▏ | 1144/4460 [1:52:33<4:42:27, 5.11s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▏ | 1144/4460 [1:52:33<4:42:27, 5.11s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▎ | 1145/4460 [1:52:40<5:16:57, 5.74s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▎ | 1145/4460 [1:52:40<5:16:57, 5.74s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:44:08,945 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▎ | 1146/4460 [1:52:47<5:38:11, 6.12s/it]g-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▎ | 1146/4460 [1:52:47<5:38:11, 6.12s/it]g-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9757, 'learning_rate': 9.578034682080925e-05, 'epoch': 1.28} 26%|███████████████████▎ | 1146/4460 [1:52:47<5:38:11, 6.12s/it]g-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▎ | 1146/4460 [1:52:47<5:38:11, 6.12s/it]g-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▎ | 1146/4460 [1:52:47<5:38:11, 6.12s/it]g-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▎ | 1147/4460 [1:52:54<5:56:35, 6.46s/it]g-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:44:21,546 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:44:21,546 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▎ | 1148/4460 [1:53:02<6:08:07, 6.67s/it]g-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▎ | 1148/4460 [1:53:02<6:08:07, 6.67s/it]g-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.914, 'learning_rate': 9.572254335260116e-05, 'epoch': 1.29} 26%|███████████████████▎ | 1148/4460 [1:53:02<6:08:07, 6.67s/it]g-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▎ | 1148/4460 [1:53:02<6:08:07, 6.67s/it]g-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▎ | 1148/4460 [1:53:02<6:08:07, 6.67s/it]g-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▎ | 1149/4460 [1:53:09<6:13:20, 6.77s/it]g-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:44:35,588 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:44:35,588 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▎ | 1150/4460 [1:53:16<6:25:54, 7.00s/it]g-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▎ | 1150/4460 [1:53:16<6:25:54, 7.00s/it]g-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8501, 'learning_rate': 9.566473988439308e-05, 'epoch': 1.29} 26%|███████████████████▎ | 1150/4460 [1:53:16<6:25:54, 7.00s/it]g-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▎ | 1150/4460 [1:53:16<6:25:54, 7.00s/it]g-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▎ | 1151/4460 [1:53:23<6:28:03, 7.04s/it]g-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▎ | 1151/4460 [1:53:23<6:28:03, 7.04s/it]g-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:44:50,256 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:44:50,256 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▎ | 1152/4460 [1:53:30<6:25:20, 6.99s/it]g-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▎ | 1152/4460 [1:53:30<6:25:20, 6.99s/it]g-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6758, 'learning_rate': 9.560693641618498e-05, 'epoch': 1.29} 26%|███████████████████▎ | 1152/4460 [1:53:30<6:25:20, 6.99s/it]g-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▎ | 1152/4460 [1:53:30<6:25:20, 6.99s/it]g-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▎ | 1152/4460 [1:53:30<6:25:20, 6.99s/it]g-point operations will not be computed-02 23:43:58,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▍ | 1153/4460 [1:53:37<6:20:28, 6.90s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:45:02,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▍ | 1153/4460 [1:53:37<6:20:28, 6.90s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:45:02,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▍ | 1153/4460 [1:53:37<6:20:28, 6.90s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:45:02,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▍ | 1154/4460 [1:53:44<6:18:24, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:45:02,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▍ | 1154/4460 [1:53:44<6:18:24, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:45:02,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:45:10,556 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:02,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:45:10,556 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:02,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▍ | 1155/4460 [1:53:50<6:17:31, 6.85s/it]g-point operations will not be computed-02 23:45:02,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▍ | 1155/4460 [1:53:50<6:17:31, 6.85s/it]g-point operations will not be computed-02 23:45:02,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8182, 'learning_rate': 9.552023121387283e-05, 'epoch': 1.29} 26%|███████████████████▍ | 1155/4460 [1:53:50<6:17:31, 6.85s/it]g-point operations will not be computed-02 23:45:02,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:45:20,666 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:02,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:45:20,666 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:02,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7379, 'learning_rate': 9.549132947976878e-05, 'epoch': 1.3} [WARNING|modeling_utils.py:388] 2022-03-02 23:45:20,666 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:02,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:45:20,666 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:02,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▍ | 1157/4460 [1:54:04<6:13:41, 6.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▍ | 1157/4460 [1:54:04<6:13:41, 6.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7678, 'learning_rate': 9.546242774566474e-05, 'epoch': 1.3} 26%|███████████████████▍ | 1157/4460 [1:54:04<6:13:41, 6.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▍ | 1158/4460 [1:54:11<6:11:11, 6.74s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▍ | 1158/4460 [1:54:11<6:11:11, 6.74s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0244, 'learning_rate': 9.54335260115607e-05, 'epoch': 1.3} [WARNING|modeling_utils.py:388] 2022-03-02 23:45:38,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:45:38,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▍ | 1159/4460 [1:54:17<6:08:46, 6.70s/it]g-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▍ | 1159/4460 [1:54:17<6:08:46, 6.70s/it]g-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▍ | 1159/4460 [1:54:17<6:08:46, 6.70s/it]g-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▍ | 1159/4460 [1:54:17<6:08:46, 6.70s/it]g-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:45:47,209 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:45:47,209 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:45:47,209 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:45:47,209 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:45:47,209 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▌ | 1161/4460 [1:54:30<6:05:18, 6.64s/it]g-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:45:57,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:45:57,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:45:57,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▌ | 1162/4460 [1:54:37<6:04:00, 6.62s/it]g-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:46:03,621 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:46:03,621 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:46:03,621 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▌ | 1163/4460 [1:54:43<6:01:26, 6.58s/it]g-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▌ | 1163/4460 [1:54:43<6:01:26, 6.58s/it]g-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:46:11,671 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:46:11,671 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▌ | 1164/4460 [1:54:50<5:59:03, 6.54s/it]g-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▌ | 1164/4460 [1:54:50<5:59:03, 6.54s/it]g-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▌ | 1164/4460 [1:54:50<5:59:03, 6.54s/it]g-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▌ | 1164/4460 [1:54:50<5:59:03, 6.54s/it]g-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:46:19,660 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:46:19,660 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:46:19,660 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:46:26,012 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:46:26,012 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.612, 'learning_rate': 9.520231213872833e-05, 'epoch': 1.31} [WARNING|modeling_utils.py:388] 2022-03-02 23:46:26,012 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:46:32,308 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:46:32,308 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9583, 'learning_rate': 9.517341040462428e-05, 'epoch': 1.31} [WARNING|modeling_utils.py:388] 2022-03-02 23:46:32,308 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:46:32,308 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:46:32,308 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:45:29,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▋ | 1168/4460 [1:55:15<5:49:20, 6.37s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:46:40,251 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▋ | 1168/4460 [1:55:15<5:49:20, 6.37s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:46:40,251 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:46:44,850 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:46:40,251 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:46:44,850 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:46:40,251 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9203, 'learning_rate': 9.511560693641619e-05, 'epoch': 1.31} [WARNING|modeling_utils.py:388] 2022-03-02 23:46:44,850 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:46:40,251 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:46:44,850 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:46:40,251 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:46:44,850 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:46:40,251 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▋ | 1170/4460 [1:55:27<5:43:53, 6.27s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:46:52,601 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▋ | 1170/4460 [1:55:27<5:43:53, 6.27s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:46:52,601 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▋ | 1170/4460 [1:55:27<5:43:53, 6.27s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:46:52,601 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▋ | 1170/4460 [1:55:27<5:43:53, 6.27s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:46:52,601 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▋ | 1171/4460 [1:55:34<5:40:52, 6.22s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:46:52,601 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:47:00,239 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:46:52,601 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:47:00,239 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:46:52,601 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:47:00,239 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:46:52,601 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▋ | 1172/4460 [1:55:40<5:40:20, 6.21s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:47:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▋ | 1172/4460 [1:55:40<5:40:20, 6.21s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:47:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▋ | 1172/4460 [1:55:40<5:40:20, 6.21s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:47:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▋ | 1172/4460 [1:55:40<5:40:20, 6.21s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:47:04,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▋ | 1173/4460 [1:55:46<5:38:50, 6.19s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:47:10,991 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▋ | 1173/4460 [1:55:46<5:38:50, 6.19s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:47:10,991 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▋ | 1173/4460 [1:55:46<5:38:50, 6.19s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:47:10,991 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▋ | 1173/4460 [1:55:46<5:38:50, 6.19s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:47:10,991 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▋ | 1174/4460 [1:55:52<5:34:24, 6.11s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:47:16,863 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▋ | 1174/4460 [1:55:52<5:34:24, 6.11s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:47:16,863 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▋ | 1174/4460 [1:55:52<5:34:24, 6.11s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:47:16,863 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▋ | 1174/4460 [1:55:52<5:34:24, 6.11s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:47:16,863 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▊ | 1175/4460 [1:55:58<5:40:00, 6.21s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:47:23,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▊ | 1175/4460 [1:55:58<5:40:00, 6.21s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:47:23,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:47:27,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:47:23,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:47:27,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:47:23,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8255, 'learning_rate': 9.491329479768787e-05, 'epoch': 1.32} [WARNING|modeling_utils.py:388] 2022-03-02 23:47:27,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:47:23,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:47:33,215 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:47:23,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:47:33,215 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:47:23,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8335, 'learning_rate': 9.488439306358382e-05, 'epoch': 1.32} [WARNING|modeling_utils.py:388] 2022-03-02 23:47:37,411 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:47:23,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▊ | 1178/4460 [1:56:15<5:18:51, 5.83s/it]g-point operations will not be computed-02 23:47:23,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▊ | 1178/4460 [1:56:15<5:18:51, 5.83s/it]g-point operations will not be computed-02 23:47:23,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7124, 'learning_rate': 9.485549132947977e-05, 'epoch': 1.32} [WARNING|modeling_utils.py:388] 2022-03-02 23:47:42,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:47:23,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:47:42,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:47:23,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▊ | 1179/4460 [1:56:21<5:13:20, 5.73s/it]g-point operations will not be computed-02 23:47:23,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:47:46,957 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:47:23,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:47:49,538 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:47:23,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:47:49,538 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:47:23,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6875, 'learning_rate': 9.479768786127168e-05, 'epoch': 1.32} [WARNING|modeling_utils.py:388] 2022-03-02 23:47:53,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:47:23,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:47:53,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:47:23,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▊ | 1181/4460 [1:56:31<4:58:12, 5.46s/it]g-point operations will not be computed-02 23:47:23,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:47:57,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:47:23,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:47:59,481 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:47:23,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:47:59,481 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:47:23,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:48:01,847 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:47:23,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:48:01,847 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:47:23,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:48:01,847 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:47:23,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|███████████████████▉ | 1183/4460 [1:56:41<4:37:01, 5.07s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:48:05,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:48:07,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:48:05,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:48:07,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:48:05,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|███████████████████▉ | 1184/4460 [1:56:45<4:25:38, 4.87s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:48:09,607 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:48:11,587 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:48:09,607 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:48:11,587 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:48:09,607 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|███████████████████▉ | 1185/4460 [1:56:49<4:12:26, 4.62s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:48:13,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:48:15,474 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:48:13,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:48:15,474 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:48:13,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|███████████████████▉ | 1186/4460 [1:56:53<3:59:21, 4.39s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:48:17,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|███████████████████▉ | 1187/4460 [1:56:56<3:44:16, 4.11s/it]g-point operations will not be computed-02 23:48:17,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|███████████████████▉ | 1187/4460 [1:56:56<3:44:16, 4.11s/it]g-point operations will not be computed-02 23:48:17,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:48:22,282 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:48:20,756 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:48:22,282 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:48:20,756 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|███████████████████▉ | 1188/4460 [1:57:00<3:28:58, 3.83s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:48:23,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|███████████████████▉ | 1189/4460 [1:57:03<3:14:43, 3.57s/it]g-point operations will not be computed-02 23:48:23,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|███████████████████▉ | 1189/4460 [1:57:03<3:14:43, 3.57s/it]g-point operations will not be computed-02 23:48:23,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:48:28,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:48:26,779 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:48:28,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:48:26,779 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:48:30,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:48:29,307 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:48:30,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:48:29,307 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:48:32,520 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:48:31,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:48:32,520 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:48:31,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████ | 1192/4460 [1:57:10<2:28:11, 2.72s/it]g-point operations will not be computed-02 23:48:31,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████ | 1192/4460 [1:57:10<2:28:11, 2.72s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:48:35,046 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:48:38,751 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:48:35,046 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████ | 1193/4460 [1:57:17<3:47:22, 4.18s/it]g-point operations will not be computed-02 23:48:35,046 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████ | 1193/4460 [1:57:17<3:47:22, 4.18s/it]g-point operations will not be computed-02 23:48:35,046 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████ | 1193/4460 [1:57:17<3:47:22, 4.18s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:48:42,532 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████ | 1193/4460 [1:57:17<3:47:22, 4.18s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:48:42,532 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:48:46,179 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:48:42,532 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████ | 1194/4460 [1:57:24<4:39:52, 5.14s/it]g-point operations will not be computed-02 23:48:42,532 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████ | 1194/4460 [1:57:24<4:39:52, 5.14s/it]g-point operations will not be computed-02 23:48:42,532 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████ | 1194/4460 [1:57:24<4:39:52, 5.14s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:48:49,873 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████ | 1194/4460 [1:57:24<4:39:52, 5.14s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:48:49,873 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████ | 1194/4460 [1:57:24<4:39:52, 5.14s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:48:49,873 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████ | 1194/4460 [1:57:24<4:39:52, 5.14s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:48:49,873 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████ | 1195/4460 [1:57:32<5:15:00, 5.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:48:57,165 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████ | 1195/4460 [1:57:32<5:15:00, 5.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:48:57,165 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████ | 1195/4460 [1:57:32<5:15:00, 5.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:48:57,165 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████ | 1196/4460 [1:57:39<5:36:46, 6.19s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:48:57,165 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████ | 1196/4460 [1:57:39<5:36:46, 6.19s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:48:57,165 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.0197, 'learning_rate': 9.433526011560693e-05, 'epoch': 1.34} 27%|████████████████████ | 1196/4460 [1:57:39<5:36:46, 6.19s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:48:57,165 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████ | 1196/4460 [1:57:39<5:36:46, 6.19s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:48:57,165 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████ | 1196/4460 [1:57:39<5:36:46, 6.19s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:48:57,165 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▏ | 1197/4460 [1:57:46<5:52:03, 6.47s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▏ | 1197/4460 [1:57:46<5:52:03, 6.47s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▏ | 1197/4460 [1:57:46<5:52:03, 6.47s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▏ | 1198/4460 [1:57:53<6:02:07, 6.66s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▏ | 1198/4460 [1:57:53<6:02:07, 6.66s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8511, 'learning_rate': 9.427745664739884e-05, 'epoch': 1.34} 27%|████████████████████▏ | 1198/4460 [1:57:53<6:02:07, 6.66s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:49:23,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:49:23,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9713, 'learning_rate': 9.424855491329481e-05, 'epoch': 1.34} [WARNING|modeling_utils.py:388] 2022-03-02 23:49:23,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:49:23,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▏ | 1200/4460 [1:58:08<6:22:41, 7.04s/it]g-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▏ | 1200/4460 [1:58:08<6:22:41, 7.04s/it]g-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.933, 'learning_rate': 9.421965317919076e-05, 'epoch': 1.35} 27%|████████████████████▏ | 1200/4460 [1:58:08<6:22:41, 7.04s/it]g-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▏ | 1200/4460 [1:58:08<6:22:41, 7.04s/it]g-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▏ | 1201/4460 [1:58:15<6:23:24, 7.06s/it]g-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▏ | 1201/4460 [1:58:15<6:23:24, 7.06s/it]g-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:49:41,953 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:49:41,953 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▏ | 1202/4460 [1:58:22<6:21:35, 7.03s/it]g-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▏ | 1202/4460 [1:58:22<6:21:35, 7.03s/it]g-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8906, 'learning_rate': 9.416184971098267e-05, 'epoch': 1.35} 27%|████████████████████▏ | 1202/4460 [1:58:22<6:21:35, 7.03s/it]g-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:49:52,244 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:49:52,244 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.817, 'learning_rate': 9.413294797687862e-05, 'epoch': 1.35} [WARNING|modeling_utils.py:388] 2022-03-02 23:49:52,244 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:49:52,244 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▏ | 1204/4460 [1:58:36<6:15:54, 6.93s/it]g-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▏ | 1204/4460 [1:58:36<6:15:54, 6.93s/it]g-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:50:02,547 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:50:02,547 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:50:02,547 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▎ | 1205/4460 [1:58:42<6:15:01, 6.91s/it]g-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▎ | 1205/4460 [1:58:42<6:15:01, 6.91s/it]g-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:50:11,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:50:11,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▎ | 1206/4460 [1:58:49<6:13:17, 6.88s/it]g-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▎ | 1206/4460 [1:58:49<6:13:17, 6.88s/it]g-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▎ | 1206/4460 [1:58:49<6:13:17, 6.88s/it]g-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▎ | 1206/4460 [1:58:49<6:13:17, 6.88s/it]g-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▎ | 1206/4460 [1:58:49<6:13:17, 6.88s/it]g-point operations will not be computed-02 23:49:11,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▎ | 1207/4460 [1:58:56<6:11:36, 6.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:50:21,288 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▎ | 1207/4460 [1:58:56<6:11:36, 6.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:50:21,288 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▎ | 1207/4460 [1:58:56<6:11:36, 6.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:50:21,288 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▎ | 1207/4460 [1:58:56<6:11:36, 6.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:50:21,288 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▎ | 1208/4460 [1:59:03<6:09:31, 6.82s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:50:21,288 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▎ | 1208/4460 [1:59:03<6:09:31, 6.82s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:50:21,288 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:50:31,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:50:21,288 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:50:31,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:50:21,288 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▎ | 1209/4460 [1:59:10<6:07:59, 6.79s/it]g-point operations will not be computed-02 23:50:21,288 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▎ | 1209/4460 [1:59:10<6:07:59, 6.79s/it]g-point operations will not be computed-02 23:50:21,288 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▎ | 1209/4460 [1:59:10<6:07:59, 6.79s/it]g-point operations will not be computed-02 23:50:21,288 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▎ | 1209/4460 [1:59:10<6:07:59, 6.79s/it]g-point operations will not be computed-02 23:50:21,288 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▎ | 1209/4460 [1:59:10<6:07:59, 6.79s/it]g-point operations will not be computed-02 23:50:21,288 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▎ | 1210/4460 [1:59:16<6:04:46, 6.73s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:50:41,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▎ | 1210/4460 [1:59:16<6:04:46, 6.73s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:50:41,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▎ | 1210/4460 [1:59:16<6:04:46, 6.73s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:50:41,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▎ | 1210/4460 [1:59:16<6:04:46, 6.73s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:50:41,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▎ | 1211/4460 [1:59:23<6:02:39, 6.70s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:50:41,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:50:49,575 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:50:41,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:50:49,575 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:50:41,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:50:49,575 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:50:41,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▍ | 1212/4460 [1:59:29<6:01:41, 6.68s/it]g-point operations will not be computed-02 23:50:41,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▍ | 1212/4460 [1:59:29<6:01:41, 6.68s/it]g-point operations will not be computed-02 23:50:41,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:50:57,825 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:50:41,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▍ | 1213/4460 [1:59:36<6:00:04, 6.65s/it]g-point operations will not be computed-02 23:50:41,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▍ | 1213/4460 [1:59:36<6:00:04, 6.65s/it]g-point operations will not be computed-02 23:50:41,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8806, 'learning_rate': 9.384393063583816e-05, 'epoch': 1.36} 27%|████████████████████▍ | 1213/4460 [1:59:36<6:00:04, 6.65s/it]g-point operations will not be computed-02 23:50:41,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:51:05,964 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:50:41,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:51:05,964 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:50:41,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7568, 'learning_rate': 9.38150289017341e-05, 'epoch': 1.36} [WARNING|modeling_utils.py:388] 2022-03-02 23:51:05,964 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:50:41,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:51:12,433 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:50:41,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:51:12,433 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:50:41,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9281, 'learning_rate': 9.378612716763006e-05, 'epoch': 1.36} [WARNING|modeling_utils.py:388] 2022-03-02 23:51:15,772 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:50:41,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:51:15,772 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:50:41,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:51:15,772 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:50:41,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▍ | 1216/4460 [1:59:55<5:54:04, 6.55s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▍ | 1216/4460 [1:59:55<5:54:04, 6.55s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▍ | 1216/4460 [1:59:55<5:54:04, 6.55s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▍ | 1216/4460 [1:59:55<5:54:04, 6.55s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▍ | 1217/4460 [2:00:02<5:52:02, 6.51s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▍ | 1217/4460 [2:00:02<5:52:02, 6.51s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:51:30,167 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:51:30,167 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▍ | 1218/4460 [2:00:08<5:49:14, 6.46s/it]g-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▍ | 1218/4460 [2:00:08<5:49:14, 6.46s/it]g-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:51:36,480 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:51:36,480 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▍ | 1219/4460 [2:00:15<5:47:13, 6.43s/it]g-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▍ | 1219/4460 [2:00:15<5:47:13, 6.43s/it]g-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:51:42,822 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▌ | 1220/4460 [2:00:21<5:45:20, 6.40s/it]g-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▌ | 1220/4460 [2:00:21<5:45:20, 6.40s/it]g-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8054, 'learning_rate': 9.364161849710983e-05, 'epoch': 1.37} 27%|████████████████████▌ | 1220/4460 [2:00:21<5:45:20, 6.40s/it]g-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:51:50,630 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:51:50,630 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.798, 'learning_rate': 9.361271676300578e-05, 'epoch': 1.37} [WARNING|modeling_utils.py:388] 2022-03-02 23:51:50,630 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:51:56,806 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:51:56,806 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6997, 'learning_rate': 9.358381502890174e-05, 'epoch': 1.37} [WARNING|modeling_utils.py:388] 2022-03-02 23:51:56,806 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:52:03,010 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:52:03,010 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7324, 'learning_rate': 9.355491329479769e-05, 'epoch': 1.37} [WARNING|modeling_utils.py:388] 2022-03-02 23:52:03,010 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:52:09,079 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:52:09,079 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7915, 'learning_rate': 9.352601156069364e-05, 'epoch': 1.37} [WARNING|modeling_utils.py:388] 2022-03-02 23:52:13,606 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▌ | 1225/4460 [2:00:52<5:40:16, 6.31s/it]g-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▌ | 1225/4460 [2:00:52<5:40:16, 6.31s/it]g-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8616, 'learning_rate': 9.34971098265896e-05, 'epoch': 1.37} 27%|████████████████████▌ | 1225/4460 [2:00:52<5:40:16, 6.31s/it]g-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:52:21,553 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:52:21,553 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8569, 'learning_rate': 9.346820809248555e-05, 'epoch': 1.37} [WARNING|modeling_utils.py:388] 2022-03-02 23:52:25,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|████████████████████▋ | 1227/4460 [2:01:04<5:26:13, 6.05s/it]g-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|████████████████████▋ | 1227/4460 [2:01:04<5:26:13, 6.05s/it]g-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:52:30,186 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:52:30,186 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:52:30,186 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|████████████████████▋ | 1228/4460 [2:01:09<5:20:09, 5.94s/it]g-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:52:35,812 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:52:35,812 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:52:35,812 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:51:20,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|████████████████████▋ | 1229/4460 [2:01:15<5:14:12, 5.83s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:52:39,979 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|████████████████████▋ | 1229/4460 [2:01:15<5:14:12, 5.83s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:52:39,979 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:52:43,996 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:52:39,979 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:52:43,996 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:52:39,979 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6154, 'learning_rate': 9.335260115606937e-05, 'epoch': 1.38} [WARNING|modeling_utils.py:388] 2022-03-02 23:52:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:52:39,979 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:52:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:52:39,979 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|████████████████████▋ | 1231/4460 [2:01:26<5:01:40, 5.61s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:52:50,725 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|████████████████████▋ | 1231/4460 [2:01:26<5:01:40, 5.61s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:52:50,725 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:52:54,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:52:50,725 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:52:54,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:52:50,725 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7778, 'learning_rate': 9.329479768786129e-05, 'epoch': 1.38} [WARNING|modeling_utils.py:388] 2022-03-02 23:52:58,256 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:52:50,725 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|████████████████████▋ | 1233/4460 [2:01:36<4:45:26, 5.31s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|████████████████████▋ | 1233/4460 [2:01:36<4:45:26, 5.31s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:02,986 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|████████████████████▊ | 1234/4460 [2:01:41<4:35:11, 5.12s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|████████████████████▊ | 1234/4460 [2:01:41<4:35:11, 5.12s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:06,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:08,391 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:08,391 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:10,490 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:12,380 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:12,380 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:14,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:16,095 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:16,095 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:17,923 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:17,923 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:21,186 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:22,663 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:22,663 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:24,180 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:24,180 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:26,853 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:29,248 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:29,248 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:30,245 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:30,245 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.1078, 'learning_rate': 9.300578034682082e-05, 'epoch': 1.39} [WARNING|modeling_utils.py:388] 2022-03-02 23:53:34,244 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:34,244 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:37,896 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:37,896 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:41,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:41,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:41,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:45,294 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:45,294 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:45,294 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:45,294 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:45,294 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|████████████████████▉ | 1245/4460 [2:02:29<5:13:22, 5.85s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:56,196 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:56,196 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:53:56,196 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|████████████████████▉ | 1246/4460 [2:02:36<5:35:26, 6.26s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|████████████████████▉ | 1246/4460 [2:02:36<5:35:26, 6.26s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|████████████████████▉ | 1246/4460 [2:02:36<5:35:26, 6.26s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|████████████████████▉ | 1246/4460 [2:02:36<5:35:26, 6.26s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|████████████████████▉ | 1246/4460 [2:02:36<5:35:26, 6.26s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|████████████████████▉ | 1247/4460 [2:02:43<5:49:36, 6.53s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|████████████████████▉ | 1247/4460 [2:02:43<5:49:36, 6.53s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:54:12,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|████████████████████▉ | 1248/4460 [2:02:51<6:00:33, 6.74s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|████████████████████▉ | 1248/4460 [2:02:51<6:00:33, 6.74s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8727, 'learning_rate': 9.283236994219654e-05, 'epoch': 1.4} 28%|████████████████████▉ | 1248/4460 [2:02:51<6:00:33, 6.74s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|████████████████████▉ | 1248/4460 [2:02:51<6:00:33, 6.74s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|████████████████████▉ | 1248/4460 [2:02:51<6:00:33, 6.74s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████ | 1249/4460 [2:02:58<6:05:56, 6.84s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████ | 1249/4460 [2:02:58<6:05:56, 6.84s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:54:26,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:54:26,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:54:26,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8564, 'learning_rate': 9.277456647398845e-05, 'epoch': 1.4} [WARNING|modeling_utils.py:388] 2022-03-02 23:54:26,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:54:26,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:54:26,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████ | 1251/4460 [2:03:13<6:19:45, 7.10s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████ | 1251/4460 [2:03:13<6:19:45, 7.10s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████ | 1251/4460 [2:03:13<6:19:45, 7.10s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:54:43,059 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:54:43,059 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.61, 'learning_rate': 9.271676300578035e-05, 'epoch': 1.4} [WARNING|modeling_utils.py:388] 2022-03-02 23:54:43,059 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:54:43,059 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:54:43,059 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████ | 1253/4460 [2:03:27<6:17:10, 7.06s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:54:53,539 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:54:53,539 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████ | 1254/4460 [2:03:33<6:14:39, 7.01s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████ | 1254/4460 [2:03:33<6:14:39, 7.01s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8845, 'learning_rate': 9.265895953757225e-05, 'epoch': 1.41} 28%|█████████████████████ | 1254/4460 [2:03:33<6:14:39, 7.01s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████ | 1254/4460 [2:03:33<6:14:39, 7.01s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████ | 1254/4460 [2:03:33<6:14:39, 7.01s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████ | 1255/4460 [2:03:40<6:12:34, 6.97s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:55:07,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:55:07,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:55:07,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████ | 1256/4460 [2:03:47<6:11:09, 6.95s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████ | 1256/4460 [2:03:47<6:11:09, 6.95s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:55:15,891 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▏ | 1257/4460 [2:03:54<6:09:13, 6.92s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▏ | 1257/4460 [2:03:54<6:09:13, 6.92s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7033, 'learning_rate': 9.257225433526012e-05, 'epoch': 1.41} 28%|█████████████████████▏ | 1257/4460 [2:03:54<6:09:13, 6.92s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:55:24,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:55:24,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8093, 'learning_rate': 9.254335260115608e-05, 'epoch': 1.41} [WARNING|modeling_utils.py:388] 2022-03-02 23:55:24,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:55:24,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:55:24,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▏ | 1259/4460 [2:04:08<6:04:30, 6.83s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▏ | 1259/4460 [2:04:08<6:04:30, 6.83s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:55:36,168 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▏ | 1260/4460 [2:04:14<6:02:52, 6.80s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▏ | 1260/4460 [2:04:14<6:02:52, 6.80s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7379, 'learning_rate': 9.248554913294798e-05, 'epoch': 1.41} 28%|█████████████████████▏ | 1260/4460 [2:04:14<6:02:52, 6.80s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:55:44,460 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:55:44,460 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6826, 'learning_rate': 9.245664739884394e-05, 'epoch': 1.41} [WARNING|modeling_utils.py:388] 2022-03-02 23:55:44,460 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:55:44,460 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:55:44,460 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▏ | 1262/4460 [2:04:28<5:59:21, 6.74s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:55:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:55:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▏ | 1263/4460 [2:04:34<5:58:18, 6.72s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▏ | 1263/4460 [2:04:34<5:58:18, 6.72s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.5806, 'learning_rate': 9.239884393063584e-05, 'epoch': 1.42} [WARNING|modeling_utils.py:388] 2022-03-02 23:56:02,844 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▎ | 1264/4460 [2:04:41<5:56:36, 6.69s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▎ | 1264/4460 [2:04:41<5:56:36, 6.69s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7554, 'learning_rate': 9.23699421965318e-05, 'epoch': 1.42} [WARNING|modeling_utils.py:388] 2022-03-02 23:56:09,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▎ | 1265/4460 [2:04:47<5:51:37, 6.60s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▎ | 1265/4460 [2:04:47<5:51:37, 6.60s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6997, 'learning_rate': 9.234104046242775e-05, 'epoch': 1.42} 28%|█████████████████████▎ | 1265/4460 [2:04:47<5:51:37, 6.60s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▎ | 1265/4460 [2:04:47<5:51:37, 6.60s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▎ | 1265/4460 [2:04:47<5:51:37, 6.60s/it]g-point operations will not be computed-02 23:53:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▎ | 1266/4460 [2:04:54<5:49:54, 6.57s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▎ | 1266/4460 [2:04:54<5:49:54, 6.57s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▎ | 1266/4460 [2:04:54<5:49:54, 6.57s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▎ | 1267/4460 [2:05:00<5:47:27, 6.53s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▎ | 1267/4460 [2:05:00<5:47:27, 6.53s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:56:27,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:56:27,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▎ | 1268/4460 [2:05:07<5:44:26, 6.47s/it]g-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▎ | 1268/4460 [2:05:07<5:44:26, 6.47s/it]g-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:56:33,382 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:56:33,382 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▎ | 1269/4460 [2:05:13<5:41:29, 6.42s/it]g-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▎ | 1269/4460 [2:05:13<5:41:29, 6.42s/it]g-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:56:39,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:56:39,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▎ | 1270/4460 [2:05:19<5:39:12, 6.38s/it]g-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▎ | 1270/4460 [2:05:19<5:39:12, 6.38s/it]g-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7957, 'learning_rate': 9.219653179190752e-05, 'epoch': 1.42} [WARNING|modeling_utils.py:388] 2022-03-02 23:56:47,427 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▎ | 1271/4460 [2:05:25<5:36:38, 6.33s/it]g-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▎ | 1271/4460 [2:05:25<5:36:38, 6.33s/it]g-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6207, 'learning_rate': 9.216763005780348e-05, 'epoch': 1.42} [WARNING|modeling_utils.py:388] 2022-03-02 23:56:53,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▍ | 1272/4460 [2:05:32<5:34:46, 6.30s/it]g-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▍ | 1272/4460 [2:05:32<5:34:46, 6.30s/it]g-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6815, 'learning_rate': 9.213872832369944e-05, 'epoch': 1.43} [WARNING|modeling_utils.py:388] 2022-03-02 23:56:59,848 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▍ | 1273/4460 [2:05:38<5:33:06, 6.27s/it]g-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▍ | 1273/4460 [2:05:38<5:33:06, 6.27s/it]g-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6672, 'learning_rate': 9.210982658959538e-05, 'epoch': 1.43} [WARNING|modeling_utils.py:388] 2022-03-02 23:57:05,983 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▍ | 1274/4460 [2:05:44<5:30:14, 6.22s/it]g-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▍ | 1274/4460 [2:05:44<5:30:14, 6.22s/it]g-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6621, 'learning_rate': 9.208092485549133e-05, 'epoch': 1.43} [WARNING|modeling_utils.py:388] 2022-03-02 23:57:11,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:57:11,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▍ | 1275/4460 [2:05:51<5:35:19, 6.32s/it]g-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▍ | 1275/4460 [2:05:51<5:35:19, 6.32s/it]g-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▍ | 1275/4460 [2:05:51<5:35:19, 6.32s/it]g-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▍ | 1275/4460 [2:05:51<5:35:19, 6.32s/it]g-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:57:19,963 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:57:19,963 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:57:24,331 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:57:24,331 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▍ | 1277/4460 [2:06:02<5:22:20, 6.08s/it]g-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:57:28,667 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:57:28,667 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:57:28,667 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:56:19,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▍ | 1278/4460 [2:06:08<5:15:18, 5.95s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:57:32,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▍ | 1278/4460 [2:06:08<5:15:18, 5.95s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:57:32,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▍ | 1278/4460 [2:06:08<5:15:18, 5.95s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:57:32,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:57:36,935 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:32,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:57:36,935 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:32,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:57:41,051 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:32,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:57:41,051 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:32,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▌ | 1280/4460 [2:06:19<5:02:17, 5.70s/it]g-point operations will not be computed-02 23:57:32,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:57:45,062 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:32,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:57:45,062 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:32,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:57:45,062 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:32,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▌ | 1281/4460 [2:06:24<4:56:12, 5.59s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:57:49,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▌ | 1281/4460 [2:06:24<4:56:12, 5.59s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:57:49,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▌ | 1281/4460 [2:06:24<4:56:12, 5.59s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:57:49,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:57:52,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:49,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:57:55,417 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:49,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:57:55,417 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:49,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:57:55,417 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:49,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▌ | 1283/4460 [2:06:34<4:42:26, 5.33s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:01,505 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:01,505 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▌ | 1284/4460 [2:06:39<4:34:56, 5.19s/it]g-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:05,042 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:05,042 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:07,193 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:09,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:09,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:11,270 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:13,202 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:13,202 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:14,980 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:16,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:16,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:18,424 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:21,630 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:21,630 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:23,124 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:23,124 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:24,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:24,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:26,935 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:26,935 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:29,137 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:29,137 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:32,996 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:32,996 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:32,996 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:36,661 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:36,661 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:40,432 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:40,432 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:40,432 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▊ | 1294/4460 [2:07:21<4:36:19, 5.24s/it]g-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:47,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:47,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:58:47,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▊ | 1295/4460 [2:07:28<5:08:29, 5.85s/it]g-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▊ | 1295/4460 [2:07:28<5:08:29, 5.85s/it]g-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▊ | 1295/4460 [2:07:28<5:08:29, 5.85s/it]g-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▊ | 1295/4460 [2:07:28<5:08:29, 5.85s/it]g-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▊ | 1295/4460 [2:07:28<5:08:29, 5.85s/it]g-point operations will not be computed-02 23:57:59,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▊ | 1296/4460 [2:07:35<5:28:59, 6.24s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▊ | 1296/4460 [2:07:35<5:28:59, 6.24s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▊ | 1296/4460 [2:07:35<5:28:59, 6.24s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▊ | 1296/4460 [2:07:35<5:28:59, 6.24s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▊ | 1297/4460 [2:07:42<5:43:46, 6.52s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▊ | 1297/4460 [2:07:42<5:43:46, 6.52s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▊ | 1297/4460 [2:07:42<5:43:46, 6.52s/it][WARNING|modeling_utils.py:388] 2022-03-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:59:12,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:59:12,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8277, 'learning_rate': 9.138728323699423e-05, 'epoch': 1.46} [WARNING|modeling_utils.py:388] 2022-03-02 23:59:12,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:59:12,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:59:12,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▊ | 1299/4460 [2:07:57<6:00:39, 6.85s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▊ | 1299/4460 [2:07:57<6:00:39, 6.85s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▊ | 1299/4460 [2:07:57<6:00:39, 6.85s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▊ | 1299/4460 [2:07:57<6:00:39, 6.85s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▊ | 1299/4460 [2:07:57<6:00:39, 6.85s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▊ | 1300/4460 [2:08:04<6:12:08, 7.07s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▊ | 1300/4460 [2:08:04<6:12:08, 7.07s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▊ | 1300/4460 [2:08:04<6:12:08, 7.07s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:59:34,717 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:59:34,717 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9142, 'learning_rate': 9.130057803468209e-05, 'epoch': 1.46} [WARNING|modeling_utils.py:388] 2022-03-02 23:59:34,717 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:59:34,717 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:59:34,717 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▉ | 1302/4460 [2:08:18<6:11:59, 7.07s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▉ | 1302/4460 [2:08:18<6:11:59, 7.07s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:59:46,911 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▉ | 1303/4460 [2:08:25<6:10:17, 7.04s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▉ | 1303/4460 [2:08:25<6:10:17, 7.04s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7351, 'learning_rate': 9.1242774566474e-05, 'epoch': 1.46} 29%|█████████████████████▉ | 1303/4460 [2:08:25<6:10:17, 7.04s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▉ | 1303/4460 [2:08:25<6:10:17, 7.04s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▉ | 1303/4460 [2:08:25<6:10:17, 7.04s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▉ | 1304/4460 [2:08:32<6:08:22, 7.00s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:59:59,119 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:59:59,119 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 23:59:59,119 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▉ | 1305/4460 [2:08:39<6:06:06, 6.96s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▉ | 1305/4460 [2:08:39<6:06:06, 6.96s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:00:07,679 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▉ | 1306/4460 [2:08:46<6:04:38, 6.94s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▉ | 1306/4460 [2:08:46<6:04:38, 6.94s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7902, 'learning_rate': 9.115606936416185e-05, 'epoch': 1.46} 29%|█████████████████████▉ | 1306/4460 [2:08:46<6:04:38, 6.94s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▉ | 1306/4460 [2:08:46<6:04:38, 6.94s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▉ | 1306/4460 [2:08:46<6:04:38, 6.94s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▉ | 1307/4460 [2:08:53<6:02:49, 6.90s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:00:19,645 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:00:19,645 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:00:19,645 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▉ | 1308/4460 [2:08:59<6:00:20, 6.86s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▉ | 1308/4460 [2:08:59<6:00:20, 6.86s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|█████████████████████▉ | 1308/4460 [2:08:59<6:00:20, 6.86s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:00:29,680 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:00:29,680 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6744, 'learning_rate': 9.106936416184971e-05, 'epoch': 1.47} [WARNING|modeling_utils.py:388] 2022-03-03 00:00:29,680 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:00:29,680 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:00:29,680 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████ | 1310/4460 [2:09:13<6:03:19, 6.92s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:00:40,343 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:00:40,343 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████ | 1311/4460 [2:09:20<6:00:42, 6.87s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████ | 1311/4460 [2:09:20<6:00:42, 6.87s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7324, 'learning_rate': 9.101156069364162e-05, 'epoch': 1.47} [WARNING|modeling_utils.py:388] 2022-03-03 00:00:48,640 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████ | 1312/4460 [2:09:27<5:57:15, 6.81s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████ | 1312/4460 [2:09:27<5:57:15, 6.81s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8473, 'learning_rate': 9.098265895953757e-05, 'epoch': 1.47} 29%|██████████████████████ | 1312/4460 [2:09:27<5:57:15, 6.81s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:00:56,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:00:56,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6559, 'learning_rate': 9.095375722543353e-05, 'epoch': 1.47} [WARNING|modeling_utils.py:388] 2022-03-03 00:00:56,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:00:56,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:00:56,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████ | 1314/4460 [2:09:40<5:51:36, 6.71s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:01:06,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:01:06,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████ | 1315/4460 [2:09:47<5:49:01, 6.66s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████ | 1315/4460 [2:09:47<5:49:01, 6.66s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7583, 'learning_rate': 9.089595375722544e-05, 'epoch': 1.47} [WARNING|modeling_utils.py:388] 2022-03-03 00:01:14,938 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▏ | 1316/4460 [2:09:53<5:46:14, 6.61s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▏ | 1316/4460 [2:09:53<5:46:14, 6.61s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6478, 'learning_rate': 9.086705202312139e-05, 'epoch': 1.48} [WARNING|modeling_utils.py:388] 2022-03-03 00:01:21,394 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▏ | 1317/4460 [2:10:00<5:43:54, 6.57s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▏ | 1317/4460 [2:10:00<5:43:54, 6.57s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7939, 'learning_rate': 9.083815028901734e-05, 'epoch': 1.48} 30%|██████████████████████▏ | 1317/4460 [2:10:00<5:43:54, 6.57s/it]g-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:01:29,294 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:01:29,294 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8075, 'learning_rate': 9.08092485549133e-05, 'epoch': 1.48} [WARNING|modeling_utils.py:388] 2022-03-03 00:01:29,294 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:01:35,573 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:01:35,573 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8408, 'learning_rate': 9.078034682080925e-05, 'epoch': 1.48} [WARNING|modeling_utils.py:388] 2022-03-03 00:01:35,573 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:01:41,846 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:01:41,846 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8165, 'learning_rate': 9.075144508670522e-05, 'epoch': 1.48} [WARNING|modeling_utils.py:388] 2022-03-03 00:01:41,846 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:01:48,147 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:01:48,147 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6541, 'learning_rate': 9.072254335260117e-05, 'epoch': 1.48} [WARNING|modeling_utils.py:388] 2022-03-03 00:01:48,147 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:01:54,371 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:01:54,371 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:01:54,371 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7952, 'learning_rate': 9.069364161849711e-05, 'epoch': 1.48} [WARNING|modeling_utils.py:388] 2022-03-03 00:02:00,450 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:02:00,450 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7428, 'learning_rate': 9.066473988439306e-05, 'epoch': 1.48} [WARNING|modeling_utils.py:388] 2022-03-03 00:02:00,450 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:02:00,450 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:02:00,450 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 23:59:00,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▎ | 1324/4460 [2:10:43<5:23:13, 6.18s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:02:08,079 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▎ | 1324/4460 [2:10:43<5:23:13, 6.18s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:02:08,079 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▎ | 1324/4460 [2:10:43<5:23:13, 6.18s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:02:08,079 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▎ | 1325/4460 [2:10:50<5:29:48, 6.31s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:02:08,079 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▎ | 1325/4460 [2:10:50<5:29:48, 6.31s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:02:08,079 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:02:16,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:08,079 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:02:16,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:08,079 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▎ | 1326/4460 [2:10:56<5:24:53, 6.22s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:02:20,637 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▎ | 1326/4460 [2:10:56<5:24:53, 6.22s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:02:20,637 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7064, 'learning_rate': 9.057803468208092e-05, 'epoch': 1.49} [WARNING|modeling_utils.py:388] 2022-03-03 00:02:24,880 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:20,637 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:02:24,880 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:20,637 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8053, 'learning_rate': 9.054913294797688e-05, 'epoch': 1.49} [WARNING|modeling_utils.py:388] 2022-03-03 00:02:24,880 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:20,637 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:02:24,880 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:20,637 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:02:30,529 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:20,637 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:02:30,529 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:20,637 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:02:34,764 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:20,637 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:02:34,764 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:20,637 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▎ | 1329/4460 [2:11:13<5:05:22, 5.85s/it]g-point operations will not be computed-03 00:02:20,637 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:02:38,927 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:20,637 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:02:38,927 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:20,637 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:02:38,927 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:20,637 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▎ | 1330/4460 [2:11:18<4:59:22, 5.74s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:02:43,073 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▎ | 1330/4460 [2:11:18<4:59:22, 5.74s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:02:43,073 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▎ | 1330/4460 [2:11:18<4:59:22, 5.74s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:02:43,073 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:02:46,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:43,073 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:02:46,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:43,073 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:02:50,838 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:43,073 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:02:50,838 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:43,073 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▍ | 1332/4460 [2:11:29<4:44:56, 5.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▍ | 1332/4460 [2:11:29<4:44:56, 5.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▍ | 1332/4460 [2:11:29<4:44:56, 5.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:02:56,933 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:02:59,289 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:01,422 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:01,422 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:03,633 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:05,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:05,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:07,653 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:09,506 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:09,506 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:11,430 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:11,430 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:13,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:14,894 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:14,894 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:18,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:19,526 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:19,526 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:22,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:22,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:23,527 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:23,527 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:25,846 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:25,846 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:26,815 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:26,815 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:30,764 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:30,764 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:30,764 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:34,399 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:34,399 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:38,151 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:41,779 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:41,779 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7074, 'learning_rate': 9.00578034682081e-05, 'epoch': 1.51} [WARNING|modeling_utils.py:388] 2022-03-03 00:03:41,779 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:41,779 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:41,779 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▌ | 1345/4460 [2:12:26<5:02:29, 5.83s/it]g-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▌ | 1345/4460 [2:12:26<5:02:29, 5.83s/it]g-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▌ | 1345/4460 [2:12:26<5:02:29, 5.83s/it]g-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:56,346 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:56,346 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7216, 'learning_rate': 9e-05, 'epoch': 1.51} [WARNING|modeling_utils.py:388] 2022-03-03 00:03:56,346 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:56,346 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:03:56,346 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▋ | 1347/4460 [2:12:40<5:39:20, 6.54s/it]g-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▋ | 1347/4460 [2:12:40<5:39:20, 6.54s/it]g-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▋ | 1347/4460 [2:12:40<5:39:20, 6.54s/it]g-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▋ | 1347/4460 [2:12:40<5:39:20, 6.54s/it]g-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▋ | 1347/4460 [2:12:40<5:39:20, 6.54s/it]g-point operations will not be computed-03 00:02:53,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▋ | 1348/4460 [2:12:47<5:49:23, 6.74s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▋ | 1348/4460 [2:12:47<5:49:23, 6.74s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▋ | 1348/4460 [2:12:47<5:49:23, 6.74s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▋ | 1348/4460 [2:12:47<5:49:23, 6.74s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▋ | 1349/4460 [2:12:54<5:55:52, 6.86s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▋ | 1349/4460 [2:12:54<5:55:52, 6.86s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▋ | 1349/4460 [2:12:54<5:55:52, 6.86s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▋ | 1349/4460 [2:12:54<5:55:52, 6.86s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▋ | 1349/4460 [2:12:54<5:55:52, 6.86s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▋ | 1350/4460 [2:13:02<6:06:58, 7.08s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:04:29,107 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:04:29,107 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▋ | 1351/4460 [2:13:09<6:06:51, 7.08s/it]g-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▋ | 1351/4460 [2:13:09<6:06:51, 7.08s/it]g-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8023, 'learning_rate': 8.985549132947977e-05, 'epoch': 1.51} 30%|██████████████████████▋ | 1351/4460 [2:13:09<6:06:51, 7.08s/it]g-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▋ | 1351/4460 [2:13:09<6:06:51, 7.08s/it]g-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▋ | 1351/4460 [2:13:09<6:06:51, 7.08s/it]g-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▋ | 1352/4460 [2:13:16<6:06:01, 7.07s/it]g-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:04:43,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:04:43,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:04:43,109 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▊ | 1353/4460 [2:13:23<6:03:01, 7.01s/it]g-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▊ | 1353/4460 [2:13:23<6:03:01, 7.01s/it]g-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▊ | 1353/4460 [2:13:23<6:03:01, 7.01s/it]g-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:04:53,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:04:53,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7931, 'learning_rate': 8.976878612716763e-05, 'epoch': 1.52} [WARNING|modeling_utils.py:388] 2022-03-03 00:04:53,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:04:53,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:04:53,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▊ | 1355/4460 [2:13:37<5:59:17, 6.94s/it]g-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:05:03,692 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:05:03,692 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▊ | 1356/4460 [2:13:44<5:56:40, 6.89s/it]g-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▊ | 1356/4460 [2:13:44<5:56:40, 6.89s/it]g-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7739, 'learning_rate': 8.971098265895954e-05, 'epoch': 1.52} [WARNING|modeling_utils.py:388] 2022-03-03 00:05:12,141 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▊ | 1357/4460 [2:13:50<5:55:24, 6.87s/it]g-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▊ | 1357/4460 [2:13:50<5:55:24, 6.87s/it]g-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7204, 'learning_rate': 8.96820809248555e-05, 'epoch': 1.52} 30%|██████████████████████▊ | 1357/4460 [2:13:50<5:55:24, 6.87s/it]g-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▊ | 1357/4460 [2:13:50<5:55:24, 6.87s/it]g-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▊ | 1357/4460 [2:13:50<5:55:24, 6.87s/it]g-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▊ | 1358/4460 [2:13:57<5:53:57, 6.85s/it]g-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:05:24,019 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:05:24,019 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▊ | 1359/4460 [2:14:04<5:51:50, 6.81s/it]g-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▊ | 1359/4460 [2:14:04<5:51:50, 6.81s/it]g-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6448, 'learning_rate': 8.96242774566474e-05, 'epoch': 1.52} [WARNING|modeling_utils.py:388] 2022-03-03 00:05:32,371 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▊ | 1360/4460 [2:14:11<5:50:02, 6.77s/it]g-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|██████████████████████▊ | 1360/4460 [2:14:11<5:50:02, 6.77s/it]g-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.5788, 'learning_rate': 8.959537572254337e-05, 'epoch': 1.52} 30%|██████████████████████▊ | 1360/4460 [2:14:11<5:50:02, 6.77s/it]g-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:05:40,662 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:05:40,662 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.4342, 'learning_rate': 8.956647398843932e-05, 'epoch': 1.53} [WARNING|modeling_utils.py:388] 2022-03-03 00:05:40,662 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:05:40,662 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:05:40,662 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:04:12,625 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|██████████████████████▉ | 1362/4460 [2:14:24<5:46:31, 6.71s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:05:49,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|██████████████████████▉ | 1362/4460 [2:14:24<5:46:31, 6.71s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:05:49,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|██████████████████████▉ | 1362/4460 [2:14:24<5:46:31, 6.71s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:05:49,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|██████████████████████▉ | 1363/4460 [2:14:30<5:43:51, 6.66s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:05:49,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|██████████████████████▉ | 1363/4460 [2:14:30<5:43:51, 6.66s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:05:49,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:05:57,168 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:05:49,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:05:57,168 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:05:49,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|██████████████████████▉ | 1364/4460 [2:14:37<5:40:53, 6.61s/it]g-point operations will not be computed-03 00:05:49,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|██████████████████████▉ | 1364/4460 [2:14:37<5:40:53, 6.61s/it]g-point operations will not be computed-03 00:05:49,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7101, 'learning_rate': 8.947976878612717e-05, 'epoch': 1.53} [WARNING|modeling_utils.py:388] 2022-03-03 00:06:05,341 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:05:49,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|██████████████████████▉ | 1365/4460 [2:14:43<5:39:57, 6.59s/it]g-point operations will not be computed-03 00:05:49,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|██████████████████████▉ | 1365/4460 [2:14:43<5:39:57, 6.59s/it]g-point operations will not be computed-03 00:05:49,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.773, 'learning_rate': 8.945086705202312e-05, 'epoch': 1.53} 31%|██████████████████████▉ | 1365/4460 [2:14:43<5:39:57, 6.59s/it]g-point operations will not be computed-03 00:05:49,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:06:13,311 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:05:49,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:06:13,311 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:05:49,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.829, 'learning_rate': 8.942196531791907e-05, 'epoch': 1.53} [WARNING|modeling_utils.py:388] 2022-03-03 00:06:13,311 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:05:49,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:06:19,724 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:05:49,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:06:19,724 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:05:49,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6826, 'learning_rate': 8.939306358381503e-05, 'epoch': 1.53} [WARNING|modeling_utils.py:388] 2022-03-03 00:06:19,724 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:05:49,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:06:19,724 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:05:49,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:06:19,724 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:05:49,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████ | 1368/4460 [2:15:03<5:33:06, 6.46s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:06:27,757 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████ | 1368/4460 [2:15:03<5:33:06, 6.46s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:06:27,757 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████ | 1368/4460 [2:15:03<5:33:06, 6.46s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:06:27,757 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████ | 1368/4460 [2:15:03<5:33:06, 6.46s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:06:27,757 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████ | 1369/4460 [2:15:09<5:30:00, 6.41s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:06:34,063 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████ | 1369/4460 [2:15:09<5:30:00, 6.41s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:06:34,063 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████ | 1369/4460 [2:15:09<5:30:00, 6.41s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:06:34,063 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████ | 1369/4460 [2:15:09<5:30:00, 6.41s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:06:34,063 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████ | 1370/4460 [2:15:15<5:28:06, 6.37s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:06:40,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████ | 1370/4460 [2:15:15<5:28:06, 6.37s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:06:40,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████ | 1370/4460 [2:15:15<5:28:06, 6.37s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:06:40,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████ | 1370/4460 [2:15:15<5:28:06, 6.37s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:06:40,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████ | 1371/4460 [2:15:21<5:26:08, 6.33s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:06:46,564 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████ | 1371/4460 [2:15:21<5:26:08, 6.33s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:06:46,564 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████ | 1371/4460 [2:15:21<5:26:08, 6.33s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:06:46,564 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████ | 1371/4460 [2:15:21<5:26:08, 6.33s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:06:46,564 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████ | 1372/4460 [2:15:28<5:22:40, 6.27s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:06:46,564 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:06:54,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:06:46,564 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:06:54,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:06:46,564 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:06:54,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:06:46,564 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████ | 1373/4460 [2:15:34<5:19:59, 6.22s/it]g-point operations will not be computed-03 00:06:46,564 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:07:00,223 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:06:46,564 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:07:00,223 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:06:46,564 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:07:00,223 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:06:46,564 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████ | 1374/4460 [2:15:40<5:16:37, 6.16s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:07:04,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████ | 1374/4460 [2:15:40<5:16:37, 6.16s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:07:04,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████ | 1374/4460 [2:15:40<5:16:37, 6.16s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:07:04,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████ | 1374/4460 [2:15:40<5:16:37, 6.16s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:07:04,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████ | 1375/4460 [2:15:46<5:19:57, 6.22s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:07:11,124 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████ | 1375/4460 [2:15:46<5:19:57, 6.22s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:07:11,124 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████ | 1375/4460 [2:15:46<5:19:57, 6.22s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:07:11,124 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████ | 1375/4460 [2:15:46<5:19:57, 6.22s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:07:11,124 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▏ | 1376/4460 [2:15:52<5:13:49, 6.11s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:07:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▏ | 1376/4460 [2:15:52<5:13:49, 6.11s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:07:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:07:21,059 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:07:21,059 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.4398, 'learning_rate': 8.910404624277458e-05, 'epoch': 1.54} [WARNING|modeling_utils.py:388] 2022-03-03 00:07:25,321 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▏ | 1378/4460 [2:16:03<5:02:06, 5.88s/it]g-point operations will not be computed-03 00:07:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▏ | 1378/4460 [2:16:03<5:02:06, 5.88s/it]g-point operations will not be computed-03 00:07:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:07:29,549 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:07:29,549 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:07:29,549 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▏ | 1379/4460 [2:16:09<4:57:11, 5.79s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:07:33,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▏ | 1379/4460 [2:16:09<4:57:11, 5.79s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:07:33,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:07:37,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:33,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:07:37,713 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:33,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8101, 'learning_rate': 8.901734104046244e-05, 'epoch': 1.55} [WARNING|modeling_utils.py:388] 2022-03-03 00:07:41,731 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:33,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▏ | 1381/4460 [2:16:19<4:45:09, 5.56s/it]g-point operations will not be computed-03 00:07:33,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▏ | 1381/4460 [2:16:19<4:45:09, 5.56s/it]g-point operations will not be computed-03 00:07:33,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:07:45,623 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:33,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:07:48,063 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:33,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:07:48,063 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:33,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.725, 'learning_rate': 8.895953757225434e-05, 'epoch': 1.55} [WARNING|modeling_utils.py:388] 2022-03-03 00:07:51,703 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:33,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▎ | 1383/4460 [2:16:29<4:28:10, 5.23s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:07:54,070 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▎ | 1383/4460 [2:16:29<4:28:10, 5.23s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:07:54,070 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:07:56,246 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:54,070 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▎ | 1384/4460 [2:16:34<4:16:05, 5.00s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▎ | 1384/4460 [2:16:34<4:16:05, 5.00s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.5428, 'learning_rate': 8.890173410404625e-05, 'epoch': 1.55} [WARNING|modeling_utils.py:388] 2022-03-03 00:08:01,499 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:08:01,499 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:08:03,604 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:08:05,485 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:08:05,485 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:08:07,376 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:08:10,771 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:08:10,771 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:08:12,291 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:08:12,291 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:08:13,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:08:16,597 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:08:16,597 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:08:17,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:08:17,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:08:20,154 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:08:20,154 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:08:22,335 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:08:22,335 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.5363, 'learning_rate': 8.867052023121388e-05, 'epoch': 1.56} [WARNING|modeling_utils.py:388] 2022-03-03 00:08:26,311 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:08:26,311 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:08:30,013 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:08:30,013 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9825, 'learning_rate': 8.864161849710983e-05, 'epoch': 1.56} [WARNING|modeling_utils.py:388] 2022-03-03 00:08:33,817 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:08:33,817 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:08:37,501 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:08:37,501 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9885, 'learning_rate': 8.861271676300578e-05, 'epoch': 1.56} [WARNING|modeling_utils.py:388] 2022-03-03 00:08:37,501 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:08:44,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:08:44,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8796, 'learning_rate': 8.858381502890174e-05, 'epoch': 1.56} [WARNING|modeling_utils.py:388] 2022-03-03 00:08:44,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:08:44,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▍ | 1396/4460 [2:17:29<5:21:52, 6.30s/it]g-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▍ | 1396/4460 [2:17:29<5:21:52, 6.30s/it]g-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8141, 'learning_rate': 8.855491329479769e-05, 'epoch': 1.57} 31%|███████████████████████▍ | 1396/4460 [2:17:29<5:21:52, 6.30s/it]g-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▍ | 1396/4460 [2:17:29<5:21:52, 6.30s/it]g-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▍ | 1396/4460 [2:17:29<5:21:52, 6.30s/it]g-point operations will not be computed-03 00:07:58,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▍ | 1397/4460 [2:17:36<5:34:55, 6.56s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▍ | 1397/4460 [2:17:36<5:34:55, 6.56s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▍ | 1397/4460 [2:17:36<5:34:55, 6.56s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▍ | 1397/4460 [2:17:36<5:34:55, 6.56s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▌ | 1398/4460 [2:17:43<5:45:26, 6.77s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▌ | 1398/4460 [2:17:43<5:45:26, 6.77s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▌ | 1398/4460 [2:17:43<5:45:26, 6.77s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:09:13,801 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:09:13,801 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8146, 'learning_rate': 8.846820809248555e-05, 'epoch': 1.57} [WARNING|modeling_utils.py:388] 2022-03-03 00:09:13,801 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:09:13,801 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:09:13,801 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▌ | 1400/4460 [2:17:58<6:03:32, 7.13s/it]g-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▌ | 1400/4460 [2:17:58<6:03:32, 7.13s/it]g-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▌ | 1400/4460 [2:17:58<6:03:32, 7.13s/it]g-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▌ | 1400/4460 [2:17:58<6:03:32, 7.13s/it]g-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▌ | 1400/4460 [2:17:58<6:03:32, 7.13s/it]g-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▌ | 1401/4460 [2:18:05<6:04:27, 7.15s/it]g-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▌ | 1401/4460 [2:18:05<6:04:27, 7.15s/it]g-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:09:33,950 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:09:33,950 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▌ | 1402/4460 [2:18:12<6:01:26, 7.09s/it]g-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▌ | 1402/4460 [2:18:12<6:01:26, 7.09s/it]g-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▌ | 1402/4460 [2:18:12<6:01:26, 7.09s/it]g-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▌ | 1402/4460 [2:18:12<6:01:26, 7.09s/it]g-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▌ | 1402/4460 [2:18:12<6:01:26, 7.09s/it]g-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▌ | 1403/4460 [2:18:19<5:59:02, 7.05s/it]g-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:09:46,096 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:09:46,096 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:09:46,096 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▌ | 1404/4460 [2:18:26<5:56:22, 7.00s/it]g-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|███████████████████████▌ | 1404/4460 [2:18:26<5:56:22, 7.00s/it]g-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:09:54,645 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▋ | 1405/4460 [2:18:33<5:54:22, 6.96s/it]g-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▋ | 1405/4460 [2:18:33<5:54:22, 6.96s/it]g-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6941, 'learning_rate': 8.829479768786127e-05, 'epoch': 1.58} 32%|███████████████████████▋ | 1405/4460 [2:18:33<5:54:22, 6.96s/it]g-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:10:03,083 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:10:03,083 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.794, 'learning_rate': 8.826589595375723e-05, 'epoch': 1.58} [WARNING|modeling_utils.py:388] 2022-03-03 00:10:03,083 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:10:03,083 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:10:03,083 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▋ | 1407/4460 [2:18:46<5:51:06, 6.90s/it]g-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▋ | 1407/4460 [2:18:46<5:51:06, 6.90s/it]g-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▋ | 1407/4460 [2:18:46<5:51:06, 6.90s/it]g-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:10:16,758 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:10:16,758 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6544, 'learning_rate': 8.820809248554913e-05, 'epoch': 1.58} [WARNING|modeling_utils.py:388] 2022-03-03 00:10:16,758 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:10:16,758 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:10:16,758 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:09:01,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▋ | 1409/4460 [2:19:00<5:46:52, 6.82s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▋ | 1409/4460 [2:19:00<5:46:52, 6.82s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▋ | 1409/4460 [2:19:00<5:46:52, 6.82s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▋ | 1409/4460 [2:19:00<5:46:52, 6.82s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▋ | 1410/4460 [2:19:07<5:46:36, 6.82s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▋ | 1410/4460 [2:19:07<5:46:36, 6.82s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:10:35,291 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▋ | 1411/4460 [2:19:13<5:43:57, 6.77s/it]g-point operations will not be computed-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▋ | 1411/4460 [2:19:13<5:43:57, 6.77s/it]g-point operations will not be computed-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7691, 'learning_rate': 8.812138728323701e-05, 'epoch': 1.58} 32%|███████████████████████▋ | 1411/4460 [2:19:13<5:43:57, 6.77s/it]g-point operations will not be computed-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:10:43,633 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:10:43,633 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8132, 'learning_rate': 8.809248554913295e-05, 'epoch': 1.58} [WARNING|modeling_utils.py:388] 2022-03-03 00:10:43,633 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:10:43,633 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:10:43,633 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▊ | 1413/4460 [2:19:27<5:40:18, 6.70s/it]g-point operations will not be computed-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:10:53,637 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:10:53,637 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:10:53,637 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▊ | 1414/4460 [2:19:33<5:39:07, 6.68s/it]g-point operations will not be computed-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:11:00,168 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:11:00,168 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▊ | 1415/4460 [2:19:40<5:36:11, 6.62s/it]g-point operations will not be computed-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▊ | 1415/4460 [2:19:40<5:36:11, 6.62s/it]g-point operations will not be computed-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7164, 'learning_rate': 8.800578034682081e-05, 'epoch': 1.59} 32%|███████████████████████▊ | 1415/4460 [2:19:40<5:36:11, 6.62s/it]g-point operations will not be computed-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:11:09,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:11:09,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.58, 'learning_rate': 8.797687861271676e-05, 'epoch': 1.59} [WARNING|modeling_utils.py:388] 2022-03-03 00:11:09,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:11:09,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:11:09,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:10:25,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▊ | 1417/4460 [2:19:53<5:31:27, 6.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:11:17,929 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▊ | 1417/4460 [2:19:53<5:31:27, 6.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:11:17,929 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▊ | 1417/4460 [2:19:53<5:31:27, 6.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:11:17,929 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▊ | 1417/4460 [2:19:53<5:31:27, 6.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:11:17,929 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▊ | 1418/4460 [2:19:59<5:29:10, 6.49s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▊ | 1418/4460 [2:19:59<5:29:10, 6.49s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▊ | 1418/4460 [2:19:59<5:29:10, 6.49s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▊ | 1418/4460 [2:19:59<5:29:10, 6.49s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▊ | 1419/4460 [2:20:06<5:27:28, 6.46s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:11:32,199 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:11:32,199 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:11:32,199 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▉ | 1420/4460 [2:20:12<5:23:53, 6.39s/it]g-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:11:38,412 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:11:38,412 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:11:38,412 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▉ | 1421/4460 [2:20:18<5:20:59, 6.34s/it]g-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:11:44,618 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:11:44,618 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:11:44,618 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▉ | 1422/4460 [2:20:24<5:18:36, 6.29s/it]g-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:11:50,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:11:50,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:11:50,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▉ | 1423/4460 [2:20:30<5:15:56, 6.24s/it]g-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:11:56,880 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:11:56,880 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:11:56,880 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▉ | 1424/4460 [2:20:36<5:13:22, 6.19s/it]g-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:12:02,981 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:12:02,981 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▉ | 1425/4460 [2:20:43<5:20:12, 6.33s/it]g-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▉ | 1425/4460 [2:20:43<5:20:12, 6.33s/it]g-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.5344, 'learning_rate': 8.771676300578036e-05, 'epoch': 1.6} [WARNING|modeling_utils.py:388] 2022-03-03 00:12:11,169 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▉ | 1426/4460 [2:20:49<5:16:43, 6.26s/it]g-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▉ | 1426/4460 [2:20:49<5:16:43, 6.26s/it]g-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.5539, 'learning_rate': 8.768786127167631e-05, 'epoch': 1.6} [WARNING|modeling_utils.py:388] 2022-03-03 00:12:17,002 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:12:17,002 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|███████████████████████▉ | 1427/4460 [2:20:55<5:09:57, 6.13s/it]g-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:12:21,379 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:12:21,379 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:12:21,379 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:11:24,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|████████████████████████ | 1428/4460 [2:21:01<5:03:26, 6.00s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:12:25,650 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|████████████████████████ | 1428/4460 [2:21:01<5:03:26, 6.00s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:12:25,650 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:12:29,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:25,650 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:12:29,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:25,650 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.5638, 'learning_rate': 8.760115606936417e-05, 'epoch': 1.6} [WARNING|modeling_utils.py:388] 2022-03-03 00:12:29,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:25,650 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:12:35,227 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:25,650 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:12:35,227 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:25,650 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6755, 'learning_rate': 8.757225433526012e-05, 'epoch': 1.6} [WARNING|modeling_utils.py:388] 2022-03-03 00:12:39,331 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:25,650 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:12:39,331 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:25,650 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|████████████████████████ | 1431/4460 [2:21:17<4:45:09, 5.65s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:12:42,004 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|████████████████████████ | 1431/4460 [2:21:17<4:45:09, 5.65s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:12:42,004 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:12:45,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:42,004 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:12:45,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:42,004 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7668, 'learning_rate': 8.751445086705203e-05, 'epoch': 1.61} [WARNING|modeling_utils.py:388] 2022-03-03 00:12:49,503 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:42,004 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:12:49,503 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:42,004 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|████████████████████████ | 1433/4460 [2:21:27<4:28:28, 5.32s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|████████████████████████ | 1433/4460 [2:21:27<4:28:28, 5.32s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:12:55,324 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:12:55,324 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:12:57,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:12:59,758 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:12:59,758 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:13:01,836 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:13:03,711 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:13:03,711 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:13:05,682 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:13:07,393 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:13:07,393 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:13:09,128 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:13:09,128 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:13:12,245 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:13:13,660 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:13:13,660 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:13:16,258 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:13:16,258 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:13:17,508 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:13:17,508 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:13:19,882 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:13:19,882 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:13:20,834 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:13:20,834 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:13:24,691 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:13:28,387 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:13:28,387 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8402, 'learning_rate': 8.719653179190752e-05, 'epoch': 1.62} [WARNING|modeling_utils.py:388] 2022-03-03 00:13:32,202 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:13:32,202 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:13:35,824 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:13:35,824 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7606, 'learning_rate': 8.716763005780347e-05, 'epoch': 1.62} [WARNING|modeling_utils.py:388] 2022-03-03 00:13:35,824 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:13:35,824 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:13:35,824 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:12:51,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|████████████████████████▎ | 1445/4460 [2:22:20<4:53:41, 5.84s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|████████████████████████▎ | 1445/4460 [2:22:20<4:53:41, 5.84s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|████████████████████████▎ | 1445/4460 [2:22:20<4:53:41, 5.84s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|████████████████████████▎ | 1446/4460 [2:22:27<5:12:17, 6.22s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|████████████████████████▎ | 1446/4460 [2:22:27<5:12:17, 6.22s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7477, 'learning_rate': 8.710982658959538e-05, 'epoch': 1.62} 32%|████████████████████████▎ | 1446/4460 [2:22:27<5:12:17, 6.22s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|████████████████████████▎ | 1446/4460 [2:22:27<5:12:17, 6.22s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|████████████████████████▎ | 1447/4460 [2:22:34<5:26:53, 6.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|████████████████████████▎ | 1447/4460 [2:22:34<5:26:53, 6.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8106, 'learning_rate': 8.708092485549133e-05, 'epoch': 1.62} [WARNING|modeling_utils.py:388] 2022-03-03 00:14:02,886 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|████████████████████████▎ | 1448/4460 [2:22:41<5:37:51, 6.73s/it]g-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|████████████████████████▎ | 1448/4460 [2:22:41<5:37:51, 6.73s/it]g-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6693, 'learning_rate': 8.705202312138728e-05, 'epoch': 1.62} 32%|████████████████████████▎ | 1448/4460 [2:22:41<5:37:51, 6.73s/it]g-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|████████████████████████▎ | 1448/4460 [2:22:41<5:37:51, 6.73s/it]g-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|████████████████████████▎ | 1448/4460 [2:22:41<5:37:51, 6.73s/it]g-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|████████████████████████▎ | 1449/4460 [2:22:48<5:42:22, 6.82s/it]g-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:14:15,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:14:15,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▍ | 1450/4460 [2:22:56<5:54:29, 7.07s/it]g-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▍ | 1450/4460 [2:22:56<5:54:29, 7.07s/it]g-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.863, 'learning_rate': 8.69942196531792e-05, 'epoch': 1.63} 33%|████████████████████████▍ | 1450/4460 [2:22:56<5:54:29, 7.07s/it]g-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▍ | 1450/4460 [2:22:56<5:54:29, 7.07s/it]g-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▍ | 1451/4460 [2:23:03<5:55:50, 7.10s/it]g-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▍ | 1451/4460 [2:23:03<5:55:50, 7.10s/it]g-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:14:30,075 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:14:30,075 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:14:30,075 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▍ | 1452/4460 [2:23:10<5:53:44, 7.06s/it]g-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▍ | 1452/4460 [2:23:10<5:53:44, 7.06s/it]g-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▍ | 1452/4460 [2:23:10<5:53:44, 7.06s/it]g-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▍ | 1452/4460 [2:23:10<5:53:44, 7.06s/it]g-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▍ | 1452/4460 [2:23:10<5:53:44, 7.06s/it]g-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▍ | 1453/4460 [2:23:17<5:52:53, 7.04s/it]g-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:14:43,998 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:14:43,998 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:14:43,998 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▍ | 1454/4460 [2:23:24<5:49:29, 6.98s/it]g-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▍ | 1454/4460 [2:23:24<5:49:29, 6.98s/it]g-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:14:52,498 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:14:52,498 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▍ | 1455/4460 [2:23:31<5:47:27, 6.94s/it]g-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▍ | 1455/4460 [2:23:31<5:47:27, 6.94s/it]g-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▍ | 1455/4460 [2:23:31<5:47:27, 6.94s/it]g-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▍ | 1455/4460 [2:23:31<5:47:27, 6.94s/it]g-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▍ | 1455/4460 [2:23:31<5:47:27, 6.94s/it]g-point operations will not be computed-03 00:13:45,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▍ | 1456/4460 [2:23:38<5:45:31, 6.90s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:15:02,780 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▍ | 1456/4460 [2:23:38<5:45:31, 6.90s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:15:02,780 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▍ | 1456/4460 [2:23:38<5:45:31, 6.90s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:15:02,780 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▍ | 1456/4460 [2:23:38<5:45:31, 6.90s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:15:02,780 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▌ | 1457/4460 [2:23:44<5:43:13, 6.86s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:15:02,780 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▌ | 1457/4460 [2:23:44<5:43:13, 6.86s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:15:02,780 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▌ | 1457/4460 [2:23:44<5:43:13, 6.86s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:15:02,780 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▌ | 1457/4460 [2:23:44<5:43:13, 6.86s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:15:02,780 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:15:14,520 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:02,780 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:15:14,520 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:02,780 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:15:14,520 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:02,780 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:15:14,520 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:02,780 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:15:14,520 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:02,780 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▌ | 1459/4460 [2:23:58<5:39:54, 6.80s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▌ | 1459/4460 [2:23:58<5:39:54, 6.80s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▌ | 1459/4460 [2:23:58<5:39:54, 6.80s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▌ | 1459/4460 [2:23:58<5:39:54, 6.80s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▌ | 1460/4460 [2:24:04<5:38:29, 6.77s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:15:31,363 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:15:31,363 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:15:31,363 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▌ | 1461/4460 [2:24:11<5:36:32, 6.73s/it]g-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▌ | 1461/4460 [2:24:11<5:36:32, 6.73s/it]g-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▌ | 1461/4460 [2:24:11<5:36:32, 6.73s/it]g-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:15:41,183 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:15:41,183 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.749, 'learning_rate': 8.664739884393063e-05, 'epoch': 1.64} [WARNING|modeling_utils.py:388] 2022-03-03 00:15:41,183 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:15:47,708 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:15:47,708 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6374, 'learning_rate': 8.661849710982659e-05, 'epoch': 1.64} [WARNING|modeling_utils.py:388] 2022-03-03 00:15:47,708 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:15:47,708 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:15:47,708 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▌ | 1464/4460 [2:24:31<5:30:16, 6.61s/it]g-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:15:57,536 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:15:57,536 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:15:57,536 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▋ | 1465/4460 [2:24:37<5:27:11, 6.55s/it]g-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:16:03,921 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:16:03,921 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:16:03,921 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▋ | 1466/4460 [2:24:44<5:24:10, 6.50s/it]g-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▋ | 1466/4460 [2:24:44<5:24:10, 6.50s/it]g-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:16:11,815 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▋ | 1467/4460 [2:24:50<5:21:32, 6.45s/it]g-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▋ | 1467/4460 [2:24:50<5:21:32, 6.45s/it]g-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6369, 'learning_rate': 8.650289017341041e-05, 'epoch': 1.64} [WARNING|modeling_utils.py:388] 2022-03-03 00:16:18,115 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▋ | 1468/4460 [2:24:56<5:19:23, 6.40s/it]g-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▋ | 1468/4460 [2:24:56<5:19:23, 6.40s/it]g-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7427, 'learning_rate': 8.647398843930637e-05, 'epoch': 1.65} 33%|████████████████████████▋ | 1468/4460 [2:24:56<5:19:23, 6.40s/it]g-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:16:25,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:16:25,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6899, 'learning_rate': 8.644508670520232e-05, 'epoch': 1.65} [WARNING|modeling_utils.py:388] 2022-03-03 00:16:25,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:16:32,130 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:16:32,130 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6439, 'learning_rate': 8.641618497109827e-05, 'epoch': 1.65} [WARNING|modeling_utils.py:388] 2022-03-03 00:16:32,130 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:16:38,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:16:38,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.5822, 'learning_rate': 8.638728323699423e-05, 'epoch': 1.65} [WARNING|modeling_utils.py:388] 2022-03-03 00:16:38,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:16:44,360 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:16:44,360 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7477, 'learning_rate': 8.635838150289017e-05, 'epoch': 1.65} [WARNING|modeling_utils.py:388] 2022-03-03 00:16:44,360 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:16:50,526 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:16:50,526 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.5864, 'learning_rate': 8.632947976878613e-05, 'epoch': 1.65} [WARNING|modeling_utils.py:388] 2022-03-03 00:16:55,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▊ | 1474/4460 [2:25:33<5:06:00, 6.15s/it]g-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▊ | 1474/4460 [2:25:33<5:06:00, 6.15s/it]g-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.5783, 'learning_rate': 8.630057803468209e-05, 'epoch': 1.65} 33%|████████████████████████▊ | 1474/4460 [2:25:33<5:06:00, 6.15s/it]g-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:17:02,945 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:17:02,945 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.4554, 'learning_rate': 8.627167630057804e-05, 'epoch': 1.65} [WARNING|modeling_utils.py:388] 2022-03-03 00:17:07,345 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▊ | 1476/4460 [2:25:45<5:03:09, 6.10s/it]g-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▊ | 1476/4460 [2:25:45<5:03:09, 6.10s/it]g-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.582, 'learning_rate': 8.6242774566474e-05, 'epoch': 1.65} [WARNING|modeling_utils.py:388] 2022-03-03 00:17:13,063 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▊ | 1477/4460 [2:25:51<4:57:10, 5.98s/it]g-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▊ | 1477/4460 [2:25:51<4:57:10, 5.98s/it]g-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:17:17,337 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:17:17,337 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:17:17,337 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▊ | 1478/4460 [2:25:57<4:51:35, 5.87s/it]g-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:17:22,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:17:25,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:17:25,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6684, 'learning_rate': 8.615606936416185e-05, 'epoch': 1.66} [WARNING|modeling_utils.py:388] 2022-03-03 00:17:29,605 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▉ | 1480/4460 [2:26:07<4:39:42, 5.63s/it]g-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▉ | 1480/4460 [2:26:07<4:39:42, 5.63s/it]g-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:17:33,538 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:17:33,538 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:17:33,538 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:15:23,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▉ | 1481/4460 [2:26:13<4:32:55, 5.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:17:37,447 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▉ | 1481/4460 [2:26:13<4:32:55, 5.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:17:37,447 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:17:41,139 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:17:37,447 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:17:41,139 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:17:37,447 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:17:43,574 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:17:37,447 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:17:45,907 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:17:37,447 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:17:45,907 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:17:37,447 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.4654, 'learning_rate': 8.604046242774567e-05, 'epoch': 1.66} [WARNING|modeling_utils.py:388] 2022-03-03 00:17:49,462 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:17:37,447 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▉ | 1484/4460 [2:26:27<4:09:32, 5.03s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:17:51,784 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▉ | 1484/4460 [2:26:27<4:09:32, 5.03s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:17:51,784 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:17:53,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:17:51,784 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▉ | 1485/4460 [2:26:31<3:59:11, 4.82s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:17:56,011 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▉ | 1485/4460 [2:26:31<3:59:11, 4.82s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:17:56,011 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:17:57,945 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:17:56,011 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▉ | 1486/4460 [2:26:35<3:46:29, 4.57s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:17:59,929 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|████████████████████████▉ | 1486/4460 [2:26:35<3:46:29, 4.57s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:17:59,929 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:18:01,737 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:17:59,929 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:18:01,737 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:17:59,929 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|█████████████████████████ | 1487/4460 [2:26:39<3:34:18, 4.33s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:18:03,644 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|█████████████████████████ | 1488/4460 [2:26:43<3:21:49, 4.07s/it]g-point operations will not be computed-03 00:18:03,644 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|█████████████████████████ | 1488/4460 [2:26:43<3:21:49, 4.07s/it]g-point operations will not be computed-03 00:18:03,644 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|█████████████████████████ | 1488/4460 [2:26:43<3:21:49, 4.07s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:18:06,993 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|█████████████████████████ | 1489/4460 [2:26:46<3:05:58, 3.76s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:18:09,938 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|█████████████████████████ | 1489/4460 [2:26:46<3:05:58, 3.76s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:18:09,938 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|█████████████████████████ | 1490/4460 [2:26:48<2:50:38, 3.45s/it]g-point operations will not be computed-03 00:18:09,938 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|█████████████████████████ | 1490/4460 [2:26:48<2:50:38, 3.45s/it]g-point operations will not be computed-03 00:18:09,938 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|█████████████████████████ | 1491/4460 [2:26:51<2:35:14, 3.14s/it]g-point operations will not be computed-03 00:18:12,596 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|█████████████████████████ | 1491/4460 [2:26:51<2:35:14, 3.14s/it]g-point operations will not be computed-03 00:18:12,596 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:18:16,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:18:14,935 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:18:16,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:18:14,935 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|█████████████████████████ | 1492/4460 [2:26:53<2:21:33, 2.86s/it]g-point operations will not be computed-03 00:18:14,935 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|█████████████████████████ | 1492/4460 [2:26:53<2:21:33, 2.86s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:18:18,508 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:18:22,258 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:18:18,508 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|█████████████████████████ | 1493/4460 [2:27:01<3:31:03, 4.27s/it]g-point operations will not be computed-03 00:18:18,508 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|█████████████████████████ | 1493/4460 [2:27:01<3:31:03, 4.27s/it]g-point operations will not be computed-03 00:18:18,508 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|█████████████████████████ | 1493/4460 [2:27:01<3:31:03, 4.27s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:18:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|█████████████████████████ | 1493/4460 [2:27:01<3:31:03, 4.27s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:18:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:18:29,659 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:18:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|█████████████████████████ | 1494/4460 [2:27:08<4:17:48, 5.22s/it]g-point operations will not be computed-03 00:18:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|█████████████████████████ | 1494/4460 [2:27:08<4:17:48, 5.22s/it]g-point operations will not be computed-03 00:18:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.7631, 'learning_rate': 8.572254335260116e-05, 'epoch': 1.67} 33%|█████████████████████████ | 1494/4460 [2:27:08<4:17:48, 5.22s/it]g-point operations will not be computed-03 00:18:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|█████████████████████████ | 1494/4460 [2:27:08<4:17:48, 5.22s/it]g-point operations will not be computed-03 00:18:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:18:38,806 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:18:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:18:38,806 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:18:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:18:38,806 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:18:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:18:38,806 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:18:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:18:38,806 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:18:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|█████████████████████████▏ | 1496/4460 [2:27:23<5:08:51, 6.25s/it]g-point operations will not be computed-03 00:18:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|█████████████████████████▏ | 1496/4460 [2:27:23<5:08:51, 6.25s/it]g-point operations will not be computed-03 00:18:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:18:51,432 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:18:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|█████████████████████████▏ | 1497/4460 [2:27:30<5:22:31, 6.53s/it]g-point operations will not be computed-03 00:18:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|█████████████████████████▏ | 1497/4460 [2:27:30<5:22:31, 6.53s/it]g-point operations will not be computed-03 00:18:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8259, 'learning_rate': 8.563583815028902e-05, 'epoch': 1.68} 34%|█████████████████████████▏ | 1497/4460 [2:27:30<5:22:31, 6.53s/it]g-point operations will not be computed-03 00:18:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|█████████████████████████▏ | 1497/4460 [2:27:30<5:22:31, 6.53s/it]g-point operations will not be computed-03 00:18:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|█████████████████████████▏ | 1497/4460 [2:27:30<5:22:31, 6.53s/it]g-point operations will not be computed-03 00:18:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|█████████████████████████▏ | 1498/4460 [2:27:37<5:31:03, 6.71s/it]g-point operations will not be computed-03 00:18:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|█████████████████████████▏ | 1498/4460 [2:27:37<5:31:03, 6.71s/it]g-point operations will not be computed-03 00:18:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:19:05,662 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:18:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|█████████████████████████▏ | 1499/4460 [2:27:44<5:37:02, 6.83s/it]g-point operations will not be computed-03 00:18:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|█████████████████████████▏ | 1499/4460 [2:27:44<5:37:02, 6.83s/it]g-point operations will not be computed-03 00:18:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8608, 'learning_rate': 8.557803468208094e-05, 'epoch': 1.68} 34%|█████████████████████████▏ | 1499/4460 [2:27:44<5:37:02, 6.83s/it]g-point operations will not be computed-03 00:18:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|█████████████████████████▏ | 1499/4460 [2:27:44<5:37:02, 6.83s/it]g-point operations will not be computed-03 00:18:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|█████████████████████████▏ | 1499/4460 [2:27:44<5:37:02, 6.83s/it]g-point operations will not be computed-03 00:18:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2366] 2022-03-03 00:19:15,221 >> Num examples = 2642 | 1500/4460 [2:27:52<5:48:23, 7.06s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2366] 2022-03-03 00:19:15,221 >> Num examples = 2642 | 1500/4460 [2:27:52<5:48:23, 7.06s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2366] 2022-03-03 00:19:15,221 >> Num examples = 2642 | 1500/4460 [2:27:52<5:48:23, 7.06s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 1%|▊ | 3/331 [00:04<09:14, 1.69s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 1%|█ | 4/331 [00:07<10:34, 1.94s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 2%|█▎ | 5/331 [00:09<12:01, 2.21s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 2%|█▌ | 6/331 [00:12<12:56, 2.39s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 2%|█▊ | 7/331 [00:15<13:11, 2.44s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 2%|██ | 8/331 [00:17<13:30, 2.51s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 3%|██▎ | 9/331 [00:20<14:09, 2.64s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 3%|██▍ | 10/331 [00:23<15:02, 2.81s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 3%|██▋ | 11/331 [00:26<14:40, 2.75s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 4%|██▉ | 12/331 [00:29<14:27, 2.72s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 4%|███▏ | 13/331 [00:31<14:19, 2.70s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 4%|███▍ | 14/331 [00:34<14:14, 2.70s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 5%|███▋ | 15/331 [00:38<15:31, 2.95s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 5%|███▉ | 16/331 [00:41<16:35, 3.16s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 5%|████▏ | 17/331 [00:44<16:38, 3.18s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 5%|████▍ | 18/331 [00:47<15:19, 2.94s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 6%|████▋ | 19/331 [00:50<15:03, 2.90s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 6%|████▉ | 20/331 [00:52<14:03, 2.71s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 6%|█████▏ | 21/331 [00:55<14:40, 2.84s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 7%|█████▍ | 22/331 [00:59<15:47, 3.07s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 7%|█████▋ | 23/331 [01:03<16:59, 3.31s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 7%|█████▉ | 24/331 [01:06<17:54, 3.50s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 8%|██████▏ | 25/331 [01:10<17:21, 3.40s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 8%|██████▍ | 26/331 [01:12<16:04, 3.16s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 8%|██████▋ | 27/331 [01:16<16:16, 3.21s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 8%|██████▉ | 28/331 [01:18<15:44, 3.12s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 9%|███████▏ | 29/331 [01:21<15:21, 3.05s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 9%|███████▍ | 30/331 [01:24<14:43, 2.93s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 9%|███████▋ | 31/331 [01:27<14:03, 2.81s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 10%|███████▉ | 32/331 [01:29<13:49, 2.77s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 10%|████████▏ | 33/331 [01:32<13:55, 2.80s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 10%|████████▍ | 34/331 [01:35<13:53, 2.81s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 11%|████████▋ | 35/331 [01:38<13:56, 2.83s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 11%|████████▉ | 36/331 [01:41<14:40, 2.99s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 11%|█████████▏ | 37/331 [01:45<15:17, 3.12s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 11%|█████████▍ | 38/331 [01:48<15:25, 3.16s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 12%|█████████▋ | 39/331 [01:51<15:21, 3.16s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 12%|█████████▉ | 40/331 [01:53<14:13, 2.93s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 12%|██████████▏ | 41/331 [01:56<13:42, 2.84s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 13%|██████████▍ | 42/331 [01:59<14:28, 3.01s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 13%|██████████▋ | 43/331 [02:03<15:10, 3.16s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 13%|██████████▉ | 44/331 [02:06<15:29, 3.24s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 14%|███████████▏ | 45/331 [02:09<14:36, 3.06s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 14%|███████████▍ | 46/331 [02:11<13:34, 2.86s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 14%|███████████▋ | 47/331 [02:14<12:52, 2.72s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 15%|███████████▉ | 48/331 [02:17<13:03, 2.77s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 15%|████████████▏ | 49/331 [02:20<13:37, 2.90s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 15%|████████████▍ | 50/331 [02:23<13:37, 2.91s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 15%|████████████▋ | 51/331 [02:26<14:06, 3.02s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 16%|████████████▉ | 52/331 [02:29<13:30, 2.90s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 16%|█████████████▏ | 53/331 [02:32<13:28, 2.91s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 16%|█████████████▍ | 54/331 [02:34<12:46, 2.77s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 17%|█████████████▋ | 55/331 [02:38<13:54, 3.02s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 17%|█████████████▊ | 56/331 [02:41<13:37, 2.97s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 17%|██████████████ | 57/331 [02:43<13:10, 2.88s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 18%|██████████████▎ | 58/331 [02:47<13:39, 3.00s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 18%|██████████████▌ | 59/331 [02:49<12:57, 2.86s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 18%|██████████████▊ | 60/331 [02:52<12:41, 2.81s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 18%|███████████████ | 61/331 [02:55<13:11, 2.93s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 19%|███████████████▎ | 62/331 [02:58<13:01, 2.90s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 19%|███████████████▌ | 63/331 [03:02<14:06, 3.16s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 19%|███████████████▊ | 64/331 [03:04<13:39, 3.07s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 20%|████████████████ | 65/331 [03:07<13:17, 3.00s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 20%|████████████████▎ | 66/331 [03:11<14:26, 3.27s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 20%|████████████████▌ | 67/331 [03:15<15:07, 3.44s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 21%|████████████████▊ | 68/331 [03:18<15:09, 3.46s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 21%|█████████████████ | 69/331 [03:22<14:47, 3.39s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 21%|█████████████████▎ | 70/331 [03:25<14:34, 3.35s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 21%|█████████████████▌ | 71/331 [03:28<14:43, 3.40s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 22%|█████████████████▊ | 72/331 [03:32<14:38, 3.39s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 22%|██████████████████ | 73/331 [03:35<14:06, 3.28s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 22%|██████████████████▎ | 74/331 [03:38<13:44, 3.21s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 23%|██████████████████▌ | 75/331 [03:41<13:52, 3.25s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 23%|██████████████████▊ | 76/331 [03:44<13:15, 3.12s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 23%|███████████████████ | 77/331 [03:47<12:54, 3.05s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 24%|███████████████████▎ | 78/331 [03:50<12:22, 2.93s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 24%|███████████████████▌ | 79/331 [03:52<12:00, 2.86s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 24%|███████████████████▊ | 80/331 [03:55<11:53, 2.84s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 24%|████████████████████ | 81/331 [03:58<12:14, 2.94s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 25%|████████████████████▎ | 82/331 [04:01<11:58, 2.88s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 25%|████████████████████▌ | 83/331 [04:04<12:17, 2.98s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 25%|████████████████████▊ | 84/331 [04:08<13:03, 3.17s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 26%|█████████████████████ | 85/331 [04:10<12:11, 2.97s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 26%|█████████████████████▎ | 86/331 [04:14<12:50, 3.14s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 26%|█████████████████████▌ | 87/331 [04:17<12:29, 3.07s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 27%|█████████████████████▊ | 88/331 [04:20<12:07, 2.99s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 27%|██████████████████████ | 89/331 [04:22<11:19, 2.81s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 27%|██████████████████████▎ | 90/331 [04:24<10:48, 2.69s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 27%|██████████████████████▌ | 91/331 [04:27<11:14, 2.81s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 28%|██████████████████████▊ | 92/331 [04:30<10:30, 2.64s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 28%|███████████████████████ | 93/331 [04:33<10:41, 2.70s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 28%|███████████████████████▎ | 94/331 [04:36<11:05, 2.81s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 29%|███████████████████████▌ | 95/331 [04:39<11:08, 2.83s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 29%|███████████████████████▊ | 96/331 [04:41<11:07, 2.84s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 29%|████████████████████████ | 97/331 [04:44<10:43, 2.75s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 30%|████████████████████████▎ | 98/331 [04:47<11:09, 2.87s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 30%|████████████████████████▌ | 99/331 [04:50<11:01, 2.85s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 30%|████████████████████████▍ | 100/331 [04:52<10:33, 2.74s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 31%|████████████████████████▋ | 101/331 [04:55<10:26, 2.72s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 31%|████████████████████████▉ | 102/331 [04:59<11:16, 2.95s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 31%|█████████████████████████▏ | 103/331 [05:01<10:46, 2.83s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 31%|█████████████████████████▍ | 104/331 [05:04<10:37, 2.81s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 32%|█████████████████████████▋ | 105/331 [05:07<10:43, 2.85s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 32%|█████████████████████████▉ | 106/331 [05:10<10:44, 2.87s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 32%|██████████████████████████▏ | 107/331 [05:12<10:06, 2.71s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 33%|██████████████████████████▍ | 108/331 [05:15<09:53, 2.66s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 33%|██████████████████████████▋ | 109/331 [05:17<09:55, 2.68s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 33%|██████████████████████████▉ | 110/331 [05:20<10:19, 2.80s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 34%|███████████████████████████▏ | 111/331 [05:23<10:23, 2.83s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 34%|███████████████████████████▍ | 112/331 [05:26<10:23, 2.85s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 34%|███████████████████████████▋ | 113/331 [05:29<09:55, 2.73s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 34%|███████████████████████████▉ | 114/331 [05:31<09:55, 2.75s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 35%|████████████████████████████▏ | 115/331 [05:34<09:56, 2.76s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 35%|████████████████████████████▍ | 116/331 [05:37<10:10, 2.84s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 35%|████████████████████████████▋ | 117/331 [05:40<10:05, 2.83s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 36%|████████████████████████████▉ | 118/331 [05:43<09:52, 2.78s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 36%|█████████████████████████████ | 119/331 [05:46<09:50, 2.79s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 36%|█████████████████████████████▎ | 120/331 [05:48<09:50, 2.80s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 37%|█████████████████████████████▌ | 121/331 [05:52<10:18, 2.94s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 37%|█████████████████████████████▊ | 122/331 [05:54<10:01, 2.88s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 37%|██████████████████████████████ | 123/331 [05:58<10:39, 3.08s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 37%|██████████████████████████████▎ | 124/331 [06:01<10:31, 3.05s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 38%|██████████████████████████████▌ | 125/331 [06:05<11:05, 3.23s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 38%|██████████████████████████████▊ | 126/331 [06:08<11:08, 3.26s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 38%|███████████████████████████████ | 127/331 [06:12<11:29, 3.38s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 39%|███████████████████████████████▎ | 128/331 [06:15<11:31, 3.41s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 39%|███████████████████████████████▌ | 129/331 [06:18<11:12, 3.33s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 39%|███████████████████████████████▊ | 130/331 [06:22<11:20, 3.38s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 40%|████████████████████████████████ | 131/331 [06:25<11:30, 3.45s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 40%|████████████████████████████████▎ | 132/331 [06:28<10:53, 3.28s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 40%|████████████████████████████████▌ | 133/331 [06:31<10:12, 3.09s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 40%|████████████████████████████████▊ | 134/331 [06:34<09:49, 2.99s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 41%|█████████████████████████████████ | 135/331 [06:37<09:58, 3.06s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 41%|█████████████████████████████████▎ | 136/331 [06:40<10:14, 3.15s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 41%|█████████████████████████████████▌ | 137/331 [06:44<10:33, 3.27s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 42%|█████████████████████████████████▊ | 138/331 [06:47<10:48, 3.36s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 42%|██████████████████████████████████ | 139/331 [06:50<09:42, 3.04s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 42%|██████████████████████████████████▎ | 140/331 [06:53<10:27, 3.28s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 43%|██████████████████████████████████▌ | 141/331 [06:56<09:59, 3.15s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 43%|██████████████████████████████████▋ | 142/331 [06:59<09:40, 3.07s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 43%|██████████████████████████████████▉ | 143/331 [07:03<10:05, 3.22s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 44%|███████████████████████████████████▏ | 144/331 [07:05<09:38, 3.09s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 44%|███████████████████████████████████▍ | 145/331 [07:08<09:30, 3.06s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 44%|███████████████████████████████████▋ | 146/331 [07:12<09:59, 3.24s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 44%|███████████████████████████████████▉ | 147/331 [07:15<09:35, 3.13s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 45%|████████████████████████████████████▏ | 148/331 [07:18<09:01, 2.96s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 45%|████████████████████████████████████▍ | 149/331 [07:20<08:32, 2.82s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 45%|████████████████████████████████████▍ | 149/331 [07:20<08:32, 2.82s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 45%|████████████████████████████████████▍ | 149/331 [07:20<08:32, 2.82s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 46%|████████████████████████████████████▉ | 151/331 [07:26<08:46, 2.93s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 46%|█████████████████████████████████████▏ | 152/331 [07:29<08:17, 2.78s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 46%|█████████████████████████████████████▍ | 153/331 [07:31<08:07, 2.74s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 47%|█████████████████████████████████████▋ | 154/331 [07:35<08:32, 2.89s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 47%|█████████████████████████████████████▉ | 155/331 [07:38<08:53, 3.03s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 47%|██████████████████████████████████████▏ | 156/331 [07:41<09:06, 3.12s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 47%|██████████████████████████████████████▍ | 157/331 [07:45<09:28, 3.27s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 48%|██████████████████████████████████████▋ | 158/331 [07:48<09:31, 3.30s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 48%|██████████████████████████████████████▉ | 159/331 [07:52<09:36, 3.35s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 48%|███████████████████████████████████████▏ | 160/331 [07:54<09:02, 3.17s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 49%|███████████████████████████████████████▍ | 161/331 [07:57<08:51, 3.13s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 49%|███████████████████████████████████████▋ | 162/331 [08:01<09:12, 3.27s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 49%|███████████████████████████████████████▉ | 163/331 [08:04<09:17, 3.32s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 50%|████████████████████████████████████████▏ | 164/331 [08:07<08:49, 3.17s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 50%|████████████████████████████████████████▍ | 165/331 [08:10<08:33, 3.09s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 50%|████████████████████████████████████████▌ | 166/331 [08:13<08:21, 3.04s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 50%|████████████████████████████████████████▊ | 167/331 [08:16<08:29, 3.10s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 51%|█████████████████████████████████████████ | 168/331 [08:19<08:01, 2.95s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 51%|█████████████████████████████████████████▎ | 169/331 [08:22<08:12, 3.04s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 51%|█████████████████████████████████████████▌ | 170/331 [08:25<07:52, 2.94s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 52%|█████████████████████████████████████████▊ | 171/331 [08:28<07:49, 2.94s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 52%|██████████████████████████████████████████ | 172/331 [08:30<07:31, 2.84s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 52%|██████████████████████████████████████████▎ | 173/331 [08:34<07:37, 2.89s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 53%|██████████████████████████████████████████▌ | 174/331 [08:36<07:19, 2.80s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 53%|██████████████████████████████████████████▊ | 175/331 [08:39<07:25, 2.85s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 53%|███████████████████████████████████████████ | 176/331 [08:42<07:12, 2.79s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 53%|███████████████████████████████████████████▎ | 177/331 [08:45<07:35, 2.96s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 54%|███████████████████████████████████████████▌ | 178/331 [08:49<08:01, 3.15s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 54%|███████████████████████████████████████████▊ | 179/331 [08:52<08:21, 3.30s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 54%|████████████████████████████████████████████ | 180/331 [08:55<08:09, 3.24s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 55%|████████████████████████████████████████████▎ | 181/331 [08:59<08:02, 3.22s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 55%|████████████████████████████████████████████▌ | 182/331 [09:00<06:57, 2.80s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 55%|████████████████████████████████████████████▊ | 183/331 [09:03<06:35, 2.67s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 56%|█████████████████████████████████████████████ | 184/331 [09:05<06:14, 2.55s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 56%|█████████████████████████████████████████████▎ | 185/331 [09:07<05:52, 2.42s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 56%|█████████████████████████████████████████████▌ | 186/331 [09:10<06:05, 2.52s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 56%|█████████████████████████████████████████████▊ | 187/331 [09:13<06:38, 2.77s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 57%|██████████████████████████████████████████████ | 188/331 [09:16<06:39, 2.79s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 57%|██████████████████████████████████████████████▎ | 189/331 [09:19<06:20, 2.68s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 57%|██████████████████████████████████████████████▍ | 190/331 [09:21<06:09, 2.62s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 58%|██████████████████████████████████████████████▋ | 191/331 [09:24<06:04, 2.60s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 58%|██████████████████████████████████████████████▉ | 192/331 [09:26<05:54, 2.55s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 58%|███████████████████████████████████████████████▏ | 193/331 [09:29<06:17, 2.74s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 59%|███████████████████████████████████████████████▍ | 194/331 [09:32<05:59, 2.63s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 59%|███████████████████████████████████████████████▋ | 195/331 [09:34<05:49, 2.57s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 59%|███████████████████████████████████████████████▉ | 196/331 [09:37<05:58, 2.65s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 60%|████████████████████████████████████████████████▏ | 197/331 [09:40<06:12, 2.78s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 60%|████████████████████████████████████████████████▍ | 198/331 [09:42<05:54, 2.67s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 60%|████████████████████████████████████████████████▋ | 199/331 [09:45<05:56, 2.70s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 60%|████████████████████████████████████████████████▉ | 200/331 [09:47<05:38, 2.58s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 61%|█████████████████████████████████████████████████▏ | 201/331 [09:50<05:33, 2.56s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 61%|█████████████████████████████████████████████████▍ | 202/331 [09:53<05:39, 2.63s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 61%|█████████████████████████████████████████████████▋ | 203/331 [09:55<05:40, 2.66s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 62%|█████████████████████████████████████████████████▉ | 204/331 [09:59<06:04, 2.87s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 62%|██████████████████████████████████████████████████▏ | 205/331 [10:02<06:08, 2.92s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 62%|██████████████████████████████████████████████████▍ | 206/331 [10:05<06:02, 2.90s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 63%|██████████████████████████████████████████████████▋ | 207/331 [10:08<06:15, 3.03s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 63%|██████████████████████████████████████████████████▉ | 208/331 [10:11<06:18, 3.07s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 63%|███████████████████████████████████████████████████▏ | 209/331 [10:13<05:45, 2.84s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 63%|███████████████████████████████████████████████████▍ | 210/331 [10:16<05:24, 2.69s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 64%|███████████████████████████████████████████████████▋ | 211/331 [10:19<05:30, 2.76s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 64%|███████████████████████████████████████████████████▉ | 212/331 [10:21<05:18, 2.68s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 64%|████████████████████████████████████████████████████ | 213/331 [10:24<05:17, 2.69s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 65%|████████████████████████████████████████████████████▎ | 214/331 [10:26<05:02, 2.59s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 65%|████████████████████████████████████████████████████▌ | 215/331 [10:29<04:50, 2.50s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 65%|████████████████████████████████████████████████████▊ | 216/331 [10:32<05:18, 2.77s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 66%|█████████████████████████████████████████████████████ | 217/331 [10:35<05:15, 2.77s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 66%|█████████████████████████████████████████████████████▎ | 218/331 [10:38<05:28, 2.90s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 66%|█████████████████████████████████████████████████████▌ | 219/331 [10:41<05:23, 2.89s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 66%|█████████████████████████████████████████████████████▊ | 220/331 [10:43<05:09, 2.79s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 67%|██████████████████████████████████████████████████████ | 221/331 [10:46<05:09, 2.82s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 67%|██████████████████████████████████████████████████████▎ | 222/331 [10:49<04:54, 2.71s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 67%|██████████████████████████████████████████████████████▌ | 223/331 [10:52<04:56, 2.75s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 68%|██████████████████████████████████████████████████████▊ | 224/331 [10:54<04:59, 2.80s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 68%|███████████████████████████████████████████████████████ | 225/331 [10:57<04:56, 2.80s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 68%|███████████████████████████████████████████████████████▎ | 226/331 [11:01<05:08, 2.94s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 69%|███████████████████████████████████████████████████████▌ | 227/331 [11:03<04:59, 2.88s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 69%|███████████████████████████████████████████████████████▊ | 228/331 [11:06<04:52, 2.84s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 69%|████████████████████████████████████████████████████████ | 229/331 [11:09<04:51, 2.86s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 69%|████████████████████████████████████████████████████████▎ | 230/331 [11:12<04:40, 2.78s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 70%|████████████████████████████████████████████████████████▌ | 231/331 [11:15<04:47, 2.87s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 70%|████████████████████████████████████████████████████████▊ | 232/331 [11:17<04:39, 2.83s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 70%|█████████████████████████████████████████████████████████ | 233/331 [11:21<04:47, 2.94s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 71%|█████████████████████████████████████████████████████████▎ | 234/331 [11:23<04:33, 2.82s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 71%|█████████████████████████████████████████████████████████▌ | 235/331 [11:26<04:20, 2.71s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 71%|█████████████████████████████████████████████████████████▊ | 236/331 [11:29<04:46, 3.02s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 72%|█████████████████████████████████████████████████████████▉ | 237/331 [11:33<04:56, 3.15s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 72%|██████████████████████████████████████████████████████████▏ | 238/331 [11:36<04:51, 3.13s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 72%|██████████████████████████████████████████████████████████▍ | 239/331 [11:39<04:49, 3.15s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 73%|██████████████████████████████████████████████████████████▋ | 240/331 [11:42<04:52, 3.22s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 73%|██████████████████████████████████████████████████████████▉ | 241/331 [11:46<04:57, 3.30s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 73%|███████████████████████████████████████████████████████████▏ | 242/331 [11:49<04:54, 3.31s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 73%|███████████████████████████████████████████████████████████▏ | 242/331 [11:49<04:54, 3.31s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 73%|███████████████████████████████████████████████████████████▏ | 242/331 [11:49<04:54, 3.31s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 74%|███████████████████████████████████████████████████████████▋ | 244/331 [11:56<04:57, 3.42s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 74%|███████████████████████████████████████████████████████████▉ | 245/331 [11:59<04:44, 3.30s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 74%|████████████████████████████████████████████████████████████▏ | 246/331 [12:03<04:56, 3.49s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 75%|████████████████████████████████████████████████████████████▍ | 247/331 [12:06<04:43, 3.37s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 75%|████████████████████████████████████████████████████████████▋ | 248/331 [12:09<04:22, 3.16s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 75%|████████████████████████████████████████████████████████████▉ | 249/331 [12:11<04:00, 2.94s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 76%|█████████████████████████████████████████████████████████████▏ | 250/331 [12:14<03:48, 2.82s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 76%|█████████████████████████████████████████████████████████████▍ | 251/331 [12:17<03:51, 2.90s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 76%|█████████████████████████████████████████████████████████████▋ | 252/331 [12:19<03:39, 2.78s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 76%|█████████████████████████████████████████████████████████████▉ | 253/331 [12:23<03:48, 2.93s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 77%|██████████████████████████████████████████████████████████████▏ | 254/331 [12:25<03:40, 2.86s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 77%|██████████████████████████████████████████████████████████████▍ | 255/331 [12:29<03:43, 2.95s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 77%|██████████████████████████████████████████████████████████████▋ | 256/331 [12:31<03:34, 2.86s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 78%|██████████████████████████████████████████████████████████████▉ | 257/331 [12:34<03:38, 2.96s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 78%|██████████████████████████████████████████████████████████████▉ | 257/331 [12:34<03:38, 2.96s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 78%|██████████████████████████████████████████████████████████████▉ | 257/331 [12:34<03:38, 2.96s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 78%|███████████████████████████████████████████████████████████████▍ | 259/331 [12:40<03:19, 2.77s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 79%|███████████████████████████████████████████████████████████████▋ | 260/331 [12:43<03:23, 2.87s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 79%|███████████████████████████████████████████████████████████████▊ | 261/331 [12:45<03:11, 2.74s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 79%|████████████████████████████████████████████████████████████████ | 262/331 [12:48<03:09, 2.74s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 79%|████████████████████████████████████████████████████████████████▎ | 263/331 [12:51<03:16, 2.89s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 80%|████████████████████████████████████████████████████████████████▌ | 264/331 [12:54<03:08, 2.81s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 80%|████████████████████████████████████████████████████████████████▊ | 265/331 [12:56<03:01, 2.76s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 80%|████████████████████████████████████████████████████████████████▊ | 265/331 [12:56<03:01, 2.76s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 80%|████████████████████████████████████████████████████████████████▊ | 265/331 [12:56<03:01, 2.76s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 81%|█████████████████████████████████████████████████████████████████▎ | 267/331 [13:02<03:04, 2.88s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 81%|█████████████████████████████████████████████████████████████████▌ | 268/331 [13:05<03:01, 2.88s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 81%|█████████████████████████████████████████████████████████████████▊ | 269/331 [13:09<03:09, 3.05s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 82%|██████████████████████████████████████████████████████████████████ | 270/331 [13:12<03:04, 3.03s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 82%|██████████████████████████████████████████████████████████████████▎ | 271/331 [13:15<03:08, 3.14s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 82%|██████████████████████████████████████████████████████████████████▌ | 272/331 [13:18<02:58, 3.02s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 82%|██████████████████████████████████████████████████████████████████▌ | 272/331 [13:18<02:58, 3.02s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 82%|██████████████████████████████████████████████████████████████████▌ | 272/331 [13:18<02:58, 3.02s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 83%|███████████████████████████████████████████████████████████████████ | 274/331 [13:24<03:00, 3.17s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 83%|███████████████████████████████████████████████████████████████████▎ | 275/331 [13:28<03:00, 3.22s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 83%|███████████████████████████████████████████████████████████████████▌ | 276/331 [13:30<02:47, 3.04s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|███████████████████████████████████████████████████████████████████▊ | 277/331 [13:33<02:40, 2.97s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|████████████████████████████████████████████████████████████████████ | 278/331 [13:36<02:36, 2.95s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|████████████████████████████████████████████████████████████████████▎ | 279/331 [13:40<02:46, 3.19s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████████▌ | 280/331 [13:43<02:39, 3.13s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████████▊ | 281/331 [13:46<02:40, 3.21s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████████ | 282/331 [13:49<02:37, 3.21s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████████▎ | 283/331 [13:53<02:37, 3.29s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|█████████████████████████████████████████████████████████████████████▍ | 284/331 [13:56<02:39, 3.38s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|█████████████████████████████████████████████████████████████████████▋ | 285/331 [14:00<02:39, 3.46s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|█████████████████████████████████████████████████████████████████████▉ | 286/331 [14:04<02:37, 3.49s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|██████████████████████████████████████████████████████████████████████▏ | 287/331 [14:07<02:37, 3.58s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|██████████████████████████████████████████████████████████████████████▍ | 288/331 [14:11<02:33, 3.56s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|██████████████████████████████████████████████████████████████████████▋ | 289/331 [14:14<02:20, 3.35s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|██████████████████████████████████████████████████████████████████████▉ | 290/331 [14:16<02:10, 3.18s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████████▏ | 291/331 [14:19<02:01, 3.03s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████████▍ | 292/331 [14:22<01:54, 2.94s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|███████████████████████████████████████████████████████████████████████▋ | 293/331 [14:25<01:51, 2.93s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|███████████████████████████████████████████████████████████████████████▉ | 294/331 [14:27<01:43, 2.79s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████████▏ | 295/331 [14:30<01:37, 2.71s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████████▍ | 296/331 [14:32<01:32, 2.63s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|████████████████████████████████████████████████████████████████████████▋ | 297/331 [14:36<01:39, 2.92s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|████████████████████████████████████████████████████████████████████████▉ | 298/331 [14:39<01:43, 3.13s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|█████████████████████████████████████████████████████████████████████████▏ | 299/331 [14:42<01:37, 3.04s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|█████████████████████████████████████████████████████████████████████████▍ | 300/331 [14:45<01:33, 3.01s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|█████████████████████████████████████████████████████████████████████████▋ | 301/331 [14:48<01:29, 2.98s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|█████████████████████████████████████████████████████████████████████████▉ | 302/331 [14:51<01:24, 2.91s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 92%|██████████████████████████████████████████████████████████████████████████▏ | 303/331 [14:54<01:23, 2.99s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 92%|██████████████████████████████████████████████████████████████████████████▍ | 304/331 [14:58<01:28, 3.28s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 92%|██████████████████████████████████████████████████████████████████████████▋ | 305/331 [15:01<01:25, 3.29s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 92%|██████████████████████████████████████████████████████████████████████████▉ | 306/331 [15:05<01:24, 3.39s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|███████████████████████████████████████████████████████████████████████████▏ | 307/331 [15:09<01:24, 3.51s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|███████████████████████████████████████████████████████████████████████████▎ | 308/331 [15:13<01:23, 3.64s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|███████████████████████████████████████████████████████████████████████████▌ | 309/331 [15:16<01:20, 3.64s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|███████████████████████████████████████████████████████████████████████████▊ | 310/331 [15:19<01:11, 3.38s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|████████████████████████████████████████████████████████████████████████████ | 311/331 [15:22<01:07, 3.35s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|████████████████████████████████████████████████████████████████████████████▎ | 312/331 [15:25<00:59, 3.14s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|████████████████████████████████████████████████████████████████████████████▌ | 313/331 [15:28<00:55, 3.07s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|████████████████████████████████████████████████████████████████████████████▊ | 314/331 [15:31<00:52, 3.11s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████████ | 315/331 [15:35<00:51, 3.21s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████████▎ | 316/331 [15:38<00:48, 3.22s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|█████████████████████████████████████████████████████████████████████████████▌ | 317/331 [15:41<00:46, 3.35s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|█████████████████████████████████████████████████████████████████████████████▊ | 318/331 [15:44<00:41, 3.17s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|██████████████████████████████████████████████████████████████████████████████ | 319/331 [15:47<00:36, 3.01s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|██████████████████████████████████████████████████████████████████████████████▎ | 320/331 [15:50<00:33, 3.03s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|██████████████████████████████████████████████████████████████████████████████▌ | 321/331 [15:53<00:30, 3.01s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|██████████████████████████████████████████████████████████████████████████████▊ | 322/331 [15:56<00:28, 3.16s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|███████████████████████████████████████████████████████████████████████████████ | 323/331 [15:59<00:24, 3.07s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|███████████████████████████████████████████████████████████████████████████████▎ | 324/331 [16:03<00:22, 3.17s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|███████████████████████████████████████████████████████████████████████████████▌ | 325/331 [16:06<00:19, 3.21s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|███████████████████████████████████████████████████████████████████████████████▊ | 326/331 [16:09<00:16, 3.26s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|████████████████████████████████████████████████████████████████████████████████ | 327/331 [16:13<00:13, 3.25s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|████████████████████████████████████████████████████████████████████████████████▎| 328/331 [16:16<00:09, 3.27s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|████████████████████████████████████████████████████████████████████████████████▌| 329/331 [16:19<00:06, 3.21s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 100%|████████████████████████████████████████████████████████████████████████████████▊| 330/331 [16:23<00:03, 3.39s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 100%|█████████████████████████████████████████████████████████████████████████████████| 331/331 [16:25<00:00, 2.92s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 100%|█████████████████████████████████████████████████████████████████████████████████| 331/331 [16:25<00:00, 2.92s/it][INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 03/03/2022 00:35:43 - INFO - datasets.metric - Removing /home/sanchit_huggingface_co/.cache/huggingface/metrics/wer/default/default_experiment-1-0.arrow [INFO|configuration_utils.py:438] 2022-03-03 00:35:43,962 >> Configuration saved in ./checkpoint-1500/config.json [INFO|trainer.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|feature_extraction_utils.py:324] 2022-03-03 00:35:48,828 >> Configuration saved in ./checkpoint-1500/preprocessor_config.jsoner.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|feature_extraction_utils.py:324] 2022-03-03 00:35:48,828 >> Configuration saved in ./checkpoint-1500/preprocessor_config.jsoner.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|feature_extraction_utils.py:324] 2022-03-03 00:35:48,828 >> Configuration saved in ./checkpoint-1500/preprocessor_config.jsoner.py:560] 2022-03-03 00:19:15,215 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 03/03/2022 00:37:53 - WARNING - huggingface_hub.repository - Adding files tracked by Git LFS: ['wandb/run-20220302_215121-t49ehimo/run-t49ehimo.wandb']. This may take a bit of time if the files are large.